Mathematical Modelling of Continuum Physics 3031208137, 9783031208133

This monograph provides a comprehensive and self-contained treatment of continuum physics, illustrating a systematic app

421 109 9MB

English Pages 1017 [1018] Year 2023

Report DMCA / Copyright

DOWNLOAD PDF FILE

Table of contents :
Preface
Contents
Part I Basic Principles and Balance Equations
1 Kinematics
1.1 Frames of Reference and Configurations
1.2 Deformation
1.2.1 Effects on Lengths, Areas, and Volumes
1.2.2 Some Identities Involving the Deformation Gradient
1.3 Motion
1.3.1 Stretching and Spin Tensors
1.4 Rotations and Angular Velocities
1.4.1 Active Rotations
1.5 Transport Relations for Convecting Sets
1.5.1 Identities About the Total Time Derivative of Spatial Gradients
1.5.2 Transport Relations for Vorticity and Velocity Gradient
1.6 Transport Relations for Non-Convecting Sets
1.6.1 Transport Theorems for Discontinuous Fields
1.7 Kinematics of Singular Surfaces
1.7.1 Geometrical Conditions of Compatibility
1.7.2 Kinematical Conditions of Compatibility
1.7.3 Jumps at Acoustic Waves
1.8 Transport Theorems for Surface Integrals
1.8.1 Transport Theorems for Discontinuous Fields
1.9 Objectivity
1.9.1 Transformation Rules for Kinematic Fields
1.9.2 Time Derivatives Relative to Generic Frames of Reference
1.10 Objective Time Derivatives
1.10.1 General Form of Objective Derivatives
1.10.2 Rivlin–Eriksen Tensors
1.10.3 Spins and Angular Velocities Under Euclidean Transformations
2 Balance Equations
2.1 Balance of Mass
2.2 Balance of Linear Momentum
2.3 Balance of Angular Momentum
2.4 Balance of Energy
2.4.1 First Law of Thermodynamics
2.4.2 Balance of Virtual Power
2.5 Material Forms of the Balance Equations
2.5.1 Equation of Motion in Pre-Stressed Materials
2.5.2 Lie Derivative of the Cauchy Stress
2.5.3 Eshelby Stress and Linear-Angular Momentum
2.6 Balance of Entropy
2.6.1 Material Formulation of the Second Law
2.6.2 Third Law of Thermodynamics
2.6.3 Exergy
2.6.4 Entropy Production in Stochastic Kinetics
2.7 Second Law and Phase-Field Models
2.8 Bernoulli's Law and Balance Equations for Fluids
2.8.1 Bernoulli's Law
2.8.2 Variational Derivation of the Equation of Motion
2.9 Balance in a Control Volume
2.10 Basic Principles in Electromagnetism
2.10.1 Balance Equations in Free Space
2.10.2 Balance Laws in Matter
2.11 The Invariant Form of Maxwell's Equations
2.12 Maxwell's Equations for the Fields at the Rest Frame
2.13 Material Description of Electromagnetic Fields
2.14 Formulations of Electromagnetism in Matter
2.15 Poynting's Theorem
2.16 Balance Equations in Electromagnetism
2.16.1 Forces, Torques, and Energy Supply
2.16.2 Angular Momentum and Magnetic Moment
2.16.3 Maxwell Stress Tensor
2.16.4 Balance of Entropy
2.17 Conservation Laws Across a Singular Surface
2.17.1 Imbalance Laws Across a Singular Surface
2.17.2 Jump Conditions
2.17.3 Rankine–Hugoniot Condition
2.17.4 Interaction Between Discontinuities of Different Order
2.17.5 Eigenvector Form of Weak Discontinuities
2.17.6 Jump Conditions of Integrals
2.17.7 Jump Conditions in the Reference Configuration
2.18 Balance Laws for Discontinuous Electromagnetic Fields
2.18.1 Boundary Conditions
Part II Constitutive Models of Simple Materials
3 Generalities on Constitutive Models
3.1 Constitutive Equations
3.2 Objectivity
3.2.1 Dependence on the Velocity Gradient
3.2.2 Dependence on the Deformation Gradient
3.2.3 Thermoelastic Variables
3.2.4 Thermo-Viscous Variables
3.2.5 Dependence on Histories
3.2.6 Variables in Rate-Type Equations
3.2.7 Objectivity and Invariants
3.3 Objectivity and Euclidean Invariants in Electromagnetism
3.4 Consistency with the Second Law
3.4.1 Exploitation of the Second Law
3.4.2 Other Formulations of the Second Law
3.4.3 Remarks on Onsager's Reciprocal Relations
3.5 Entropy Equation and Gibbs Equations
3.6 Second Law and Representation of Constitutive Functions
4 Solids
4.1 Thermoelastic Solids
4.1.1 Linear Theory
4.1.2 Free Energy and Invariants
4.2 Elastic Solids
4.2.1 Material Symmetries
4.2.2 Elastic Waves
4.2.3 Linear Theory
4.2.4 Elastostatics and Compatibility of Strains
4.2.5 Vibrating Strings, Chains, and Phonons
4.3 Internal Constraints
4.4 Hyperelastic Solids and Rubber-Like Materials
4.4.1 Compressible Mooney-Rivlin Solid
4.5 Modelling of Dissipative Solids
4.5.1 A Model for Viscosity and Heat Conduction
4.6 Modelling via Dissipation Potentials
4.6.1 Convex Dissipation Potential
4.6.2 Model with a Scalar Internal Variable
4.6.3 Model with a Vector Internal Variable
5 Fluids
5.1 Elastic Fluids
5.1.1 Water Wave Theories
5.2 Thermoelastic Fluids
5.2.1 Gibbs Relations and Entropy Equation
5.2.2 Conjugate Variables and Maxwell's Relations
5.2.3 Specific Heats
5.3 Ideal Gas
5.3.1 Specific Heats and Entropy Functions
5.4 Models of Real Gases
5.4.1 Van der Waals Model
5.4.2 Peng-Robinson Model
5.5 Heat-Conducting, Viscous Fluids
5.5.1 Stokesian Fluids
5.5.2 Boundary Conditions
5.5.3 Incompressible Fluids
5.5.4 Oberbeck–Boussinesq Approximation
5.6 Newtonian Fluids
5.6.1 Models with Thermal Expansion and Pressure-Dependent Viscosities
5.6.2 Fluids with Pressure-Dependent Viscosities
5.6.3 Vorticity and Enstrophy Transport Equation
5.6.4 Viscosity and Energy Decay
5.6.5 Incompressible Newtonian Fluids
5.7 Generalized Newtonian Fluids
5.8 Viscoplastic and Viscoelastic Fluids
5.8.1 Viscoplastic Fluids
5.9 Models of Turbulence
5.9.1 Incompressible Fluids
5.9.2 Compressible Fluids
Part III Non-simple Materials
6 Rate-Type Models
6.1 Rheological Models
6.1.1 Rigid-Perfectly Plastic Model with Kinematic Hardening
6.1.2 Elastic-Perfectly Plastic Solid
6.1.3 Elastic-Plastic Model with Kinematical Hardening
6.1.4 Maxwell-Wiechert Fluid
6.1.5 Bingham Model
6.1.6 Bingham-Maxwell Model
6.1.7 Kelvin-Voigt Solid
6.1.8 Jeffreys' and Burgers' Fluids
6.1.9 Standard Linear Solid
6.1.10 Generalized Models
6.1.11 Relaxation Modulus and Creep Compliance
6.2 Rate-Type Models of Fluids
6.2.1 Rate Equation for Thermo-Viscous Fluids in the Eulerian Description
6.2.2 Invariant Fields and Objective Rates
6.2.3 Rate Equations in the Material Description
6.2.4 Nonlinear Rate-Type Models of Fluids
6.3 Rate-Type Models of Solids
6.3.1 The Kelvin-Voigt-Fourier Solid
6.3.2 Thermo-Viscoelastic Materials
6.3.3 Thermo-Viscoplastic Models
6.4 Higher-Order Rate Models
6.4.1 Burgers' Fluid
6.4.2 Oldroyd-B Fluid
6.4.3 White-Metzner Fluid
6.4.4 Wave Features in Higher-Order Rate Models
6.5 Wave Equation in Hereditary Fluids
6.6 Further Thermoelastic Models
6.6.1 Maxwell-Cattaneo Equation; a Generalized Form
6.6.2 Temperature-Rate Dependent Thermoelastic Materials
6.6.3 Thermoelasticity Based on an Integral Variable
7 Materials with Memory
7.1 Materials with Fading Memory
7.1.1 Fading Memory Space of Histories
7.1.2 Difference and Summed Histories
7.1.3 Properties About Histories and Norms
7.2 Thermoelastic Materials with Memory
7.3 Rigid Heat Conductors
7.3.1 Models Dependent on Thermal Histories
7.3.2 Models Dependent on Summed Thermal Histories
7.3.3 Moore–Gibson–Thompson Temperature Equation
7.4 Linear Viscoelasticity
7.4.1 Linear Viscoelastic Solids
7.4.2 Thermodynamic Restrictions for the Linear Viscoelastic Solid
7.4.3 Free Energies and Minimal States
7.4.4 Viscoelastic Solids with Unbounded Relaxation Functions
7.4.5 Nonlinear Viscoelastic Models
7.4.6 Some Examples
7.5 Viscous Fluids with Memory
7.5.1 Incompressible Viscoelastic Fluids
7.5.2 Compressible Viscoelastic Fluids
7.5.3 Maxwell-Like Viscoelastic Fluids
7.5.4 Acceleration Waves in Viscous Fluids with Memory
7.6 Electromagnetic Materials with Memory
7.6.1 Dielectrics with Memory
7.6.2 Conductors with Memory
7.6.3 Polarizable Conductors with Memory
7.6.4 Magneto-Viscoelasticity
7.6.5 Rate Equations in the Eulerian Description
7.6.6 Solids
7.7 Causality and Kramers–Kronig Relations
7.8 Memory Models via Fractional Derivatives
7.8.1 Preliminaries on Fractional Derivatives
7.8.2 Fractional Derivatives
7.8.3 Heat Conduction via Fractional Derivatives
7.8.4 Wave Propagation Properties
7.9 Viscoelastic Models of Fractional Order
8 Aging and Higher-Order Grade Materials
8.1 Aging
8.2 Aging of Thermoelastic Solids
8.3 Aging of Rate-Type Materials
8.3.1 Aging Properties
8.4 Aging of Thermo-Viscoelastic Materials
8.5 An Aging Model of Viscous Fluid with Memory
8.6 Damage
8.7 Fluids of Higher-Order Grade
8.7.1 Second Grade Fluid
8.7.2 Third Grade Fluid
8.8 Interaction Effects via Materials of Higher Order
8.9 Modelling via the Interstitial Working
8.9.1 Modelling via the Extra-Entropy Flux
8.10 Materials of Korteweg Type
8.11 Hyperstress and Materials of Higher-Order Grade
8.11.1 Solids of Higher-Order Grade
8.12 A New Scheme Associated with the Hyperstress
9 Mixtures
9.1 Kinematics
9.2 Balance Equations
9.2.1 Balance Equations in the Eulerian Description
9.2.2 Mass Density in the Reference Configuration and Incompressibility
9.3 Second Law of Thermodynamics
9.3.1 Principle of Phase Separation
9.3.2 Remarks on the Thermodynamic Restrictions
9.4 Balance Equations for the Whole Mixture
9.4.1 Stoichiometry Requirements on the Mass Growths
9.4.2 Balance Equation for the Diffusion Flux
9.5 Constitutive Models for Fluid Mixtures
9.5.1 Extent of Reaction and Law of Mass Action
9.5.2 Non-reacting Mixtures with Several Temperatures
9.5.3 Reacting Mixtures with a Single Temperature
9.5.4 Gibbs Equations
9.5.5 Second-Law Inequalities
9.6 Mixtures of Ideal Gases
9.6.1 Chemical Potentials
9.6.2 Mixing Entropy and Mixing Free Energy
9.7 Diffusion
9.7.1 Dynamic Diffusion Equation
9.7.2 Fourier-Like and Rate-Type Diffusion Equations
9.7.3 Maxwell–Stefan Diffusion Model
9.7.4 Coupled Effects Among Diffusion and Heat Conduction
9.7.5 Evolution Problems
9.8 Solid Mixtures
9.8.1 Constitutive Assumptions and Thermodynamic Restrictions
9.9 Immiscible Mixtures
9.10 Entropy Inequality and Models for the Whole Mixture
9.10.1 Gibbs Equations and Restrictions on the Diffusion Flux
9.10.2 Gibbs–Duhem Equation
10 Micropolar Media
10.1 Kinematics of Micropolar Media
10.1.1 Orientational Momentum
10.2 Balance Laws
10.2.1 Balance Laws in the Spatial Description
10.2.2 Balance Laws in the Material Description
10.2.3 Thermoviscous Micropolar Media
10.3 Liquid Crystals
10.3.1 Modelling of Thermotropic Liquid Crystals
10.3.2 Director Field, Energy, and Objectivity
10.4 Nematics
10.4.1 Relation to Other Models Involving the Director Field
10.5 Smectics and Cholesterics
10.6 Mixtures of Micropolar Constituents
10.6.1 Second Law Inequality
10.7 Nanofluids
10.7.1 Brownian Diffusion and Thermophoresis
10.7.2 Model Approximations and Nanofluid Properties
11 Porous Materials
11.1 Porous Materials as Mixtures
11.2 Constitutive Models of Porous Materials
11.2.1 Darcy's Law
11.2.2 Darcy-Like Equations
11.3 Special Models of Porous Materials
11.4 Materials with Voids
11.5 Porous Media with Double Porosity
11.5.1 Porous Material with Undeformable Solid
11.5.2 Porous Material with a Thermoelastic Solid
12 Electromagnetism of Continuous Media
12.1 A Thermodynamic Setting for Electromagnetic Solids
12.2 Electroelasticity
12.3 Electroelastic Materials
12.4 Dielectrics with Polarization Gradient
12.5 Fluids in Electromagnetic Fields
12.6 Magnetoelasticity
12.7 Micromagnetics and Magnetism in Rigid Bodies
12.7.1 Rate Equations and Thermodynamic Restrictions
12.7.2 Evolution Equations of Magnetization
12.8 Ferrofluids
12.8.1 Magneto-Rheological Fluids
12.9 Magneto-, Electro-, and Mechanical-Optical Effects
12.9.1 Magneto-Optical Effects
12.9.2 Frequency Dependence, Closed Processes, Memory Functionals
12.9.3 Electro-Optical Effects
12.9.4 Mechanical-Optical Effects
12.9.5 Magnetocaloric and Electro-Caloric Effects
12.10 Plasmas
12.10.1 One-Fluid Plasma Theory and Magnetohydrodynamics
12.11 Chiral Media and Optical Activity
12.11.1 Wave Solutions in Chiral Media
12.12 Anisotropic Non-Dissipative Magnetic Materials (Ferrites)
12.12.1 Magnetization Model with Damping Effects
Part IV Hysteresis and Phase Transitions
13 Plasticity
13.1 Qualitative Aspects of the Stress–Strain Curve
13.1.1 Yield Criteria
13.2 Rate-Independent Scheme of Plastic Flow
13.2.1 Consistency of Mises–Hill Model with a Defect Energy
13.3 A Temperature-Dependent Model
13.4 Gradient Theories of Plasticity
13.5 Model of Plasticity via the Kröner Decomposition
13.6 A Decomposition-Free Thermodynamic Scheme
13.6.1 Duhem-Like Solids
13.6.2 Hyperelastic and Hypoelastic Solids
13.6.3 Dissipative Duhem-Like Solids
13.6.4 Elastic–Plastic Models
13.6.5 One-Dimensional Models
13.6.6 The Helmholtz Free Energy
13.6.7 Some Hysteretic Models
13.6.8 Models with Additive Decomposition of the Strain Rate
13.7 Constitutive Models of Polymeric Foams
14 Superconductivity and Superfluidity
14.1 Superconductors
14.1.1 Discoveries About Superconductivity
14.1.2 The London Equations
14.1.3 An Alternative to London Equations
14.1.4 A Nonlocal Model
14.1.5 Phase Transition Curve
14.1.6 Ginzburg–Landau Theory
14.2 Superfluids
14.2.1 History of Superfluidity and Properties of Superfluids
14.2.2 First and Second Sound
14.2.3 Discontinuity Waves
14.2.4 Mass Fractions and Chemical Potentials
14.2.5 Rotation of Superfluids
14.2.6 The Ginzburg–Landau Theory
15 Ferroics
15.1 Evolution Function and Hysteretic Effects in Ferroelectrics
15.1.1 One-Dimensional Models of Ferroelectric Hysteresis
15.2 Ginzburg–Landau–Devonshire Theory
15.2.1 Landau Free Energy
15.2.2 Landau–Devonshire Free Energy
15.3 Ferromagnetism
15.3.1 Thermodynamic Approach to Magnetic Hysteresis
15.3.2 One-Dimensional Models of Ferromagnetic Hysteresis
15.3.3 Curie–Weiss Law
16 Phase Transitions
16.1 Jump Conditions at Interfaces
16.2 Phase Transitions at Sharp Interfaces with T= - p 1
16.2.1 The Stefan Model
16.3 Liquid–Vapour Transition
16.3.1 Continuity of the Gibbs Free Energy
16.3.2 Viscous Phases and Surface Tension
16.3.3 The Phase Rule
16.4 Solid–Fluid Transition
16.4.1 Transition in a Finite Layer—A Mixture Model
16.4.2 Evolution of the Mass Fraction
16.5 Solidification-Melting of Binary Alloys
16.6 Brine Channels Formation in Sea Ice
16.6.1 Transition Models via Minimization of Functionals
16.7 Phase-Field Model of Liquid-Solid Transitions
16.8 Phase Transitions in SMA
Appendix A Notes on Vectors and Tensors
A.1 Vector and Tensor Algebra
A.1.1 Eigenvalues and Eigenvectors
A.1.2 Length and Induced Norm
A.1.3 Representation Formulae for Vectors and Tensors
A.2 Isotropic Tensors
A.3 Differentiation
A.4 Integration
A.5 Harmonic Waves and Complex-Valued Functions
A.6 Fourier Transform
Appendix References
Index
Recommend Papers

Mathematical Modelling of Continuum Physics
 3031208137, 9783031208133

  • 0 0 0
  • Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up
File loading please wait...
Citation preview

Modeling and Simulation in Science, Engineering and Technology

Angelo Morro Claudio Giorgi

Mathematical Modelling of Continuum Physics

Modeling and Simulation in Science, Engineering and Technology

Series Editors Nicola Bellomo Department of Mathematical Sciences Politecnico di Torino Turin, Italy

Tayfun E. Tezduyar Department of Mechanical Engineering Rice University Houston, TX, USA

Editorial Board Kazuo Aoki National Taiwan University Taipei, Taiwan Yuri Bazilevs School of Engineering Brown University Providence, RI, USA Mark Chaplain School of Mathematics and Statistics University of St. Andrews St. Andrews, UK Pierre Degond Department of Mathematics Imperial College London London, UK Andreas Deutsch Center for Information Services and High-Performance Computing Technische Universität Dresden Dresden, Sachsen, Germany Livio Gibelli Institute for Multiscale Thermofluids University of Edinburgh Edinburgh, UK Miguel Ángel Herrero Departamento de Matemática Aplicada Universidad Complutense de Madrid Madrid, Spain Thomas J. R. Hughes Institute for Computational Engineering and Sciences The University of Texas at Austin Austin, TX, USA

Petros Koumoutsakos Computational Science and Engineering Laboratory ETH Zürich Zürich, Switzerland Andrea Prosperetti Cullen School of Engineering University of Houston Houston, TX, USA K. R. Rajagopal Department of Mechanical Engineering Texas A&M University College Station, TX, USA Kenji Takizawa Department of Modern Mechanical Engineering Waseda University Tokyo, Japan Youshan Tao Department of Applied Mathematics Donghua University Shanghai, China Harald van Brummelen Department of Mechanical Engineering Eindhoven University of Technology Eindhoven, Noord-Brabant The Netherlands

Angelo Morro Claudio Giorgi •

Mathematical Modelling of Continuum Physics

Angelo Morro DIBRIS University of Genova Genova, Italy

Claudio Giorgi DICATAM University of Brescia Brescia, Italy

ISSN 2164-3679 ISSN 2164-3725 (electronic) Modeling and Simulation in Science, Engineering and Technology ISBN 978-3-031-20813-3 ISBN 978-3-031-20814-0 (eBook) https://doi.org/10.1007/978-3-031-20814-0 Mathematics Subject Classification: 35Q30, 35Q74, 35Q79, 70B10, 74-XX, 76-XX, 78A25, 80A05, 80A17, 80A22 © The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 This work is subject to copyright. All rights are solely and exclusively licensed by the Publisher, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors, and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. This book is published under the imprint Birkhäuser, www.birkhauser-science.com by the registered company Springer Nature Switzerland AG The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland

Preface

This book provides a unified treatment of continuum physics. A systematic approach to the balance equations is elaborated for wide-ranging classes of materials (classical continua, micropolar continua, mixtures, electromagnetic continua) following the lines of the encyclopedic handbook articles of Truesdell and Toupin and Truesdell and Noll. As is standard in Rational Thermodynamics, the constitutive properties are required to obey the objectivity principle and to be consistent with the second law of thermodynamics. Yet here a rather new approach is developed by viewing the entropy production as a constitutive function per se as is the case for the entropy and the entropy flux. While this does not determine any new result for simple materials, it proves conceptually and practically advantageous in the modelling of nonlinear phenomena such as those occurring in hysteretic continua, e.g. in plasticity, electromagnetism, and physics of shape-memory alloys. The book is suitable for engineers, physicists, and mathematicians. The derivations of the sought results are fairly detailed through careful proofs. Though a wide variety of subjects are examined, the contents are developed so as to get a self-contained and consistent presentation of the various topics. Part I reviews the kinematics of continuous bodies and illustrates the general setting of balance laws. Kinematics treats essential preliminaries to continuum physics such as reference and current configurations, transport relations, singular surfaces, objectivity and objective time derivatives. Next, a chapter on balance equations develops the balance laws of mass, linear momentum, angular momentum, energy, entropy (Clausius–Duhem inequality), and the balance laws in electromagnetism. Part II is first devoted to the general requirements of constitutive models. In this sense, emphasis is given to the application of objectivity (the constitutive equations must be invariant under changes of frame) and consistency with the second law of thermodynamics (the constitutive equations must satisfy the restrictions placed by the entropy inequality). Next, a review is given of common models of simple materials, namely materials described by the first-order gradient of deformation, velocity, and temperature. In this framework, detailed descriptions are given of

v

vi

Preface

(thermoelastic, elastic, and dissipative) solids and (elastic, thermoelastic, viscous, and Newtonian) fluids. A wide variety of constitutive models is investigated in Part III, consisting of separate chapters of non-simple materials. The chapter on rate-type models reviews the rheological devices and next shows some schemes within the Eulerian and the Lagrangian description with emphasis on objective time derivatives. The chapter on materials with memory begins with a general setting of (fading) memory and next shows memory models for thermoelasticity, heat conduction, viscoelasticity, electromagnetic solids, and modelling via fractional derivatives. Next, the modelling of aging (thermoelastic, rate-type, and thermo-viscoelastic) materials is exhibited in connection with the thermodynamic restrictions. Also, materials of higher-order grade are examined in the form of fluids and solids or models via interstitial working. The possible consistency of the hyperstress with the standard balance laws is shown to hold. A chapter is devoted to mixtures; in addition to balance equations for the constituents and the whole mixture, some aspects are investigated such as models of diffusion, Soret and Dufour effects, immiscible mixtures, and models for the whole mixture. Micropolar media are modelled as materials with a physical internal structure. Each point of the continuum is viewed as a body with a finite number of degrees of freedom. The balance laws are then reviewed so that the internal degrees of freedom show up in the balance of orientational momentum and energy. The model, so established, is then applied to the description of liquid crystals and nanofluids. Porous materials are described as mixtures (solid–liquid or solid–void). Also, porous materials with double porosity are established. The chapter on electromagnetism of continuous media describes a number of phenomena. The interaction of the electromagnetic field with deformation is examined within electroelasticity, magnetoelasticity, and plasma theory. This in turn allows us to examine magneto-, electro-, and mechanical-optical effects. Memory effects are modelled in quite a general setting. Nonlinearity effects are especially framed within micromagnetics and ferrofluids. Chiral media and ferrites are investigated in detail also to show the optical activity effects on linearly- and circularly-polarized waves. Finally, superconductivity and superfluidity are developed in a common framework of mixtures of reacting fluids; normal electrons and superconducting electrons in one case, normal fluid and superfluid in the other. Hysteretic effects and phase transitions are developed in Part IV. Hysteresis is modelled in ferroelectrics, ferromagnetism, and plasticity. In all of these contexts, the modelling is performed by having recourse to the entropy production as a (non-negative) constitutive function. It follows that the free energy governs the anhysteretic behaviour, while the entropy production characterizes the hysteretic properties. Phase transitions are described in different ways. A transition may occur at a sharp interface between two different phases; the jump conditions across the interface govern the transition. Instead, a transition may occur in a diffuse region where the pertinent fields change continuously; in essence, the transition region is occupied by a mixture of constituents in different phases. A scheme for phase transitions and hysteretic effects in shape-memory alloys is also outlined.

Preface

vii

Notation We use mostly direct notation and the recourse to index notation is made only when, otherwise, the pertinent expression might become ambiguous. The symbols of continuum mechanics are used, e.g. in the books of Truesdell and Noll [426] and Gurtin et al [214]. Yet difficulties arise whenever any well-established symbol has a different meaning in continuum mechanics and in electromagnetism. For instance, E is the Green-St Venant strain tensor in continuum mechanics and the electric field in electromagnetism while D is the stretching tensor in continuum mechanics and the electric displacement in electromagnetism. To avoid the introduction of newly defined symbols, we have maintained the standard symbols whenever the pertinent chapter makes it clear which meaning has to be assigned to the symbol. There are places where the stretching tensor and the electric displacement occur simultaneously; in such cases, we use D for the stretching tensor. Another difficulty arises in connection with the vector x; it is currently used for angular velocity and vorticity. To avoid any ambiguities, we have maintained the symbol x for the angular velocity and have used - for the vorticity (i.e. r v). Throughout, lightface letters indicate scalars, while boldface letters indicate vectors or tensors. In the mechanical context, lowercase (uppercase) boldface letters indicate vectors (tensors). In electromagnetism, following the literature, the (electric and magnetic) vectors are denoted by uppercase letters, e.g. E; B. Moreover, Lin is the set of all tensors, Lin+ is the set of all tensors with positive determinant, Sym is the set of all symmetric tensors, Skw is the set of all skew tensors, Psym is the set of all symmetric, positive definite tensors, Orth is the set of all orthogonal tensors, Orth+ is the set of all rotations (all orthogonal tensors with positive determinant). For the benefit of the reader an Appendix “Notes on vectors and tensors” reviews the essential contents of algebra and analysis of vector and tensor functions. Genova, Italy Brescia, Italy

Angelo Morro Claudio Giorgi

Contents

Part I 1

Basic Principles and Balance Equations

Kinematics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.1 Frames of Reference and Configurations . . . . . . . . . . . . . . . 1.2 Deformation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.2.1 Effects on Lengths, Areas, and Volumes . . . . . . . . . 1.2.2 Some Identities Involving the Deformation Gradient . 1.3 Motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.3.1 Stretching and Spin Tensors . . . . . . . . . . . . . . . . . . 1.4 Rotations and Angular Velocities . . . . . . . . . . . . . . . . . . . . . 1.4.1 Active Rotations . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.5 Transport Relations for Convecting Sets . . . . . . . . . . . . . . . . 1.5.1 Identities About the Total Time Derivative of Spatial Gradients . . . . . . . . . . . . . . . . . . . . . . . . 1.5.2 Transport Relations for Vorticity and Velocity Gradient . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.6 Transport Relations for Non-Convecting Sets . . . . . . . . . . . . 1.6.1 Transport Theorems for Discontinuous Fields . . . . . 1.7 Kinematics of Singular Surfaces . . . . . . . . . . . . . . . . . . . . . 1.7.1 Geometrical Conditions of Compatibility . . . . . . . . . 1.7.2 Kinematical Conditions of Compatibility . . . . . . . . . 1.7.3 Jumps at Acoustic Waves . . . . . . . . . . . . . . . . . . . . 1.8 Transport Theorems for Surface Integrals . . . . . . . . . . . . . . . 1.8.1 Transport Theorems for Discontinuous Fields . . . . . 1.9 Objectivity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.9.1 Transformation Rules for Kinematic Fields . . . . . . . 1.9.2 Time Derivatives Relative to Generic Frames of Reference . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1.10 Objective Time Derivatives . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . . . .

3 3 5 10 16 18 20 22 24 26

..

28

. . . . . . . . . . .

. . . . . . . . . . .

30 32 34 36 38 40 46 47 49 51 53

.. ..

56 62

. . . . . . . . . .

ix

x

Contents

1.10.1 General Form of Objective Derivatives . . . . . . . . . . . . 1.10.2 Rivlin–Eriksen Tensors . . . . . . . . . . . . . . . . . . . . . . . . 1.10.3 Spins and Angular Velocities Under Euclidean Transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2

Balance Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.1 Balance of Mass . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.2 Balance of Linear Momentum . . . . . . . . . . . . . . . . . . . 2.3 Balance of Angular Momentum . . . . . . . . . . . . . . . . . . 2.4 Balance of Energy . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.4.1 First Law of Thermodynamics . . . . . . . . . . . . . 2.4.2 Balance of Virtual Power . . . . . . . . . . . . . . . . 2.5 Material Forms of the Balance Equations . . . . . . . . . . . 2.5.1 Equation of Motion in Pre-Stressed Materials . . 2.5.2 Lie Derivative of the Cauchy Stress . . . . . . . . 2.5.3 Eshelby Stress and Linear-Angular Momentum 2.6 Balance of Entropy . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.6.1 Material Formulation of the Second Law . . . . . 2.6.2 Third Law of Thermodynamics . . . . . . . . . . . . 2.6.3 Exergy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.6.4 Entropy Production in Stochastic Kinetics . . . . 2.7 Second Law and Phase-Field Models . . . . . . . . . . . . . . 2.8 Bernoulli’s Law and Balance Equations for Fluids . . . . 2.8.1 Bernoulli’s Law . . . . . . . . . . . . . . . . . . . . . . . 2.8.2 Variational Derivation of the Equation of Motion . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.9 Balance in a Control Volume . . . . . . . . . . . . . . . . . . . 2.10 Basic Principles in Electromagnetism . . . . . . . . . . . . . . 2.10.1 Balance Equations in Free Space . . . . . . . . . . . 2.10.2 Balance Laws in Matter . . . . . . . . . . . . . . . . . 2.11 The Invariant Form of Maxwell’s Equations . . . . . . . . . 2.12 Maxwell’s Equations for the Fields at the Rest Frame . . 2.13 Material Description of Electromagnetic Fields . . . . . . . 2.14 Formulations of Electromagnetism in Matter . . . . . . . . 2.15 Poynting’s Theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 2.16 Balance Equations in Electromagnetism . . . . . . . . . . . . 2.16.1 Forces, Torques, and Energy Supply . . . . . . . . 2.16.2 Angular Momentum and Magnetic Moment . . . 2.16.3 Maxwell Stress Tensor . . . . . . . . . . . . . . . . . . 2.16.4 Balance of Entropy . . . . . . . . . . . . . . . . . . . . . 2.17 Conservation Laws Across a Singular Surface . . . . . . . 2.17.1 Imbalance Laws Across a Singular Surface . . . 2.17.2 Jump Conditions . . . . . . . . . . . . . . . . . . . . . . 2.17.3 Rankine–Hugoniot Condition . . . . . . . . . . . . .

63 69 70

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

73 73 76 82 85 88 90 93 97 99 100 103 107 109 109 110 111 113 114

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . .

115 116 119 120 124 128 134 135 139 143 144 146 154 155 160 161 162 163 164

Contents

xi

2.17.4 Interaction Between Discontinuities of Different Order . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2.17.5 Eigenvector Form of Weak Discontinuities . . . . 2.17.6 Jump Conditions of Integrals . . . . . . . . . . . . . . 2.17.7 Jump Conditions in the Reference Configuration 2.18 Balance Laws for Discontinuous Electromagnetic Fields . 2.18.1 Boundary Conditions . . . . . . . . . . . . . . . . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

165 167 168 169 171 173

3

Generalities on Constitutive Models . . . . . . . . . . . . . . . . . . . . 3.1 Constitutive Equations . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2 Objectivity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2.1 Dependence on the Velocity Gradient . . . . . . . . . 3.2.2 Dependence on the Deformation Gradient . . . . . . 3.2.3 Thermoelastic Variables . . . . . . . . . . . . . . . . . . . 3.2.4 Thermo-Viscous Variables . . . . . . . . . . . . . . . . . 3.2.5 Dependence on Histories . . . . . . . . . . . . . . . . . . 3.2.6 Variables in Rate-Type Equations . . . . . . . . . . . . 3.2.7 Objectivity and Invariants . . . . . . . . . . . . . . . . . . 3.3 Objectivity and Euclidean Invariants in Electromagnetism . 3.4 Consistency with the Second Law . . . . . . . . . . . . . . . . . . 3.4.1 Exploitation of the Second Law . . . . . . . . . . . . . 3.4.2 Other Formulations of the Second Law . . . . . . . . 3.4.3 Remarks on Onsager’s Reciprocal Relations . . . . 3.5 Entropy Equation and Gibbs Equations . . . . . . . . . . . . . . 3.6 Second Law and Representation of Constitutive Functions

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . .

177 177 178 181 181 182 184 185 186 187 188 189 192 193 197 199 201

4

Solids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.1 Thermoelastic Solids . . . . . . . . . . . . . . . . . . . . . . . . 4.1.1 Linear Theory . . . . . . . . . . . . . . . . . . . . . . 4.1.2 Free Energy and Invariants . . . . . . . . . . . . . 4.2 Elastic Solids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4.2.1 Material Symmetries . . . . . . . . . . . . . . . . . . 4.2.2 Elastic Waves . . . . . . . . . . . . . . . . . . . . . . 4.2.3 Linear Theory . . . . . . . . . . . . . . . . . . . . . . 4.2.4 Elastostatics and Compatibility of Strains . . 4.2.5 Vibrating Strings, Chains, and Phonons . . . . 4.3 Internal Constraints . . . . . . . . . . . . . . . . . . . . . . . . . 4.4 Hyperelastic Solids and Rubber-Like Materials . . . . . 4.4.1 Compressible Mooney-Rivlin Solid . . . . . . . 4.5 Modelling of Dissipative Solids . . . . . . . . . . . . . . . . 4.5.1 A Model for Viscosity and Heat Conduction 4.6 Modelling via Dissipation Potentials . . . . . . . . . . . .

. . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . .

203 203 208 211 213 215 219 223 233 235 239 243 245 250 252 256

Part II

Constitutive Models of Simple Materials

. . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . .

xii

Contents

4.6.1 4.6.2 4.6.3 5

Fluids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.1 Elastic Fluids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.1.1 Water Wave Theories . . . . . . . . . . . . . . . . . . 5.2 Thermoelastic Fluids . . . . . . . . . . . . . . . . . . . . . . . . . 5.2.1 Gibbs Relations and Entropy Equation . . . . . 5.2.2 Conjugate Variables and Maxwell’s Relations 5.2.3 Specific Heats . . . . . . . . . . . . . . . . . . . . . . . 5.3 Ideal Gas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.3.1 Specific Heats and Entropy Functions . . . . . . 5.4 Models of Real Gases . . . . . . . . . . . . . . . . . . . . . . . . 5.4.1 Van der Waals Model . . . . . . . . . . . . . . . . . . 5.4.2 Peng-Robinson Model . . . . . . . . . . . . . . . . . 5.5 Heat-Conducting, Viscous Fluids . . . . . . . . . . . . . . . . 5.5.1 Stokesian Fluids . . . . . . . . . . . . . . . . . . . . . . 5.5.2 Boundary Conditions . . . . . . . . . . . . . . . . . . 5.5.3 Incompressible Fluids . . . . . . . . . . . . . . . . . . 5.5.4 Oberbeck–Boussinesq Approximation . . . . . . 5.6 Newtonian Fluids . . . . . . . . . . . . . . . . . . . . . . . . . . . 5.6.1 Models with Thermal Expansion and Pressure-Dependent Viscosities . . . . . . . . 5.6.2 Fluids with Pressure-Dependent Viscosities . . 5.6.3 Vorticity and Enstrophy Transport Equation . . 5.6.4 Viscosity and Energy Decay . . . . . . . . . . . . . 5.6.5 Incompressible Newtonian Fluids . . . . . . . . . 5.7 Generalized Newtonian Fluids . . . . . . . . . . . . . . . . . . 5.8 Viscoplastic and Viscoelastic Fluids . . . . . . . . . . . . . . 5.8.1 Viscoplastic Fluids . . . . . . . . . . . . . . . . . . . . 5.9 Models of Turbulence . . . . . . . . . . . . . . . . . . . . . . . . 5.9.1 Incompressible Fluids . . . . . . . . . . . . . . . . . . 5.9.2 Compressible Fluids . . . . . . . . . . . . . . . . . . .

Part III 6

Convex Dissipation Potential . . . . . . . . . . . . . . . . . . . 256 Model with a Scalar Internal Variable . . . . . . . . . . . . . 258 Model with a Vector Internal Variable . . . . . . . . . . . . . 261 . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . .

263 263 266 273 275 276 279 280 282 283 283 285 286 293 295 296 299 300

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

301 303 305 306 307 309 311 311 314 314 316

Non-simple Materials

Rate-Type Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.1 Rheological Models . . . . . . . . . . . . . . . . . . . . . . . . . 6.1.1 Rigid-Perfectly Plastic Model with Kinematic Hardening . . . . . . . . . . . . . . . . . . . . . . . . . . 6.1.2 Elastic-Perfectly Plastic Solid . . . . . . . . . . . . 6.1.3 Elastic-Plastic Model with Kinematical Hardening . . . . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . 321 . . . . . . . 322 . . . . . . . 323 . . . . . . . 323 . . . . . . . 324

Contents

6.2

6.3

6.4

6.5 6.6

7

xiii

6.1.4 Maxwell-Wiechert Fluid . . . . . . . . . . . . . . . . . 6.1.5 Bingham Model . . . . . . . . . . . . . . . . . . . . . . . 6.1.6 Bingham-Maxwell Model . . . . . . . . . . . . . . . . 6.1.7 Kelvin-Voigt Solid . . . . . . . . . . . . . . . . . . . . . 6.1.8 Jeffreys’ and Burgers’ Fluids . . . . . . . . . . . . . . 6.1.9 Standard Linear Solid . . . . . . . . . . . . . . . . . . . 6.1.10 Generalized Models . . . . . . . . . . . . . . . . . . . . 6.1.11 Relaxation Modulus and Creep Compliance . . . Rate-Type Models of Fluids . . . . . . . . . . . . . . . . . . . . 6.2.1 Rate Equation for Thermo-Viscous Fluids in the Eulerian Description . . . . . . . . . . . . . . . 6.2.2 Invariant Fields and Objective Rates . . . . . . . . 6.2.3 Rate Equations in the Material Description . . . 6.2.4 Nonlinear Rate-Type Models of Fluids . . . . . . Rate-Type Models of Solids . . . . . . . . . . . . . . . . . . . . 6.3.1 The Kelvin-Voigt-Fourier Solid . . . . . . . . . . . . 6.3.2 Thermo-Viscoelastic Materials . . . . . . . . . . . . 6.3.3 Thermo-Viscoplastic Models . . . . . . . . . . . . . . Higher-Order Rate Models . . . . . . . . . . . . . . . . . . . . . 6.4.1 Burgers’ Fluid . . . . . . . . . . . . . . . . . . . . . . . . 6.4.2 Oldroyd-B Fluid . . . . . . . . . . . . . . . . . . . . . . . 6.4.3 White-Metzner Fluid . . . . . . . . . . . . . . . . . . . 6.4.4 Wave Features in Higher-Order Rate Models . . Wave Equation in Hereditary Fluids . . . . . . . . . . . . . . Further Thermoelastic Models . . . . . . . . . . . . . . . . . . . 6.6.1 Maxwell-Cattaneo Equation; a Generalized Form . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.6.2 Temperature-Rate Dependent Thermoelastic Materials . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6.6.3 Thermoelasticity Based on an Integral Variable

Materials with Memory . . . . . . . . . . . . . . . . . . . . . . . 7.1 Materials with Fading Memory . . . . . . . . . . . . . 7.1.1 Fading Memory Space of Histories . . . . 7.1.2 Difference and Summed Histories . . . . . 7.1.3 Properties About Histories and Norms . . 7.2 Thermoelastic Materials with Memory . . . . . . . . 7.3 Rigid Heat Conductors . . . . . . . . . . . . . . . . . . . 7.3.1 Models Dependent on Thermal Histories 7.3.2 Models Dependent on Summed Thermal Histories . . . . . . . . . . . . . . . . . . . . . . . 7.3.3 Moore–Gibson–Thompson Temperature Equation . . . . . . . . . . . . . . . . . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

325 326 327 328 329 332 335 337 338

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

339 342 345 350 354 354 356 358 359 360 363 365 367 368 369

. . . . . . 370 . . . . . . 372 . . . . . . 375 . . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

381 381 384 387 390 392 396 396

. . . . . . . . . . . 401 . . . . . . . . . . . 405

xiv

Contents

7.4

7.5

7.6

7.7 7.8

7.9 8

Linear Viscoelasticity . . . . . . . . . . . . . . . . . . . . . . . . . . 7.4.1 Linear Viscoelastic Solids . . . . . . . . . . . . . . . . . 7.4.2 Thermodynamic Restrictions for the Linear Viscoelastic Solid . . . . . . . . . . . . . . . . . . . . . . . 7.4.3 Free Energies and Minimal States . . . . . . . . . . . 7.4.4 Viscoelastic Solids with Unbounded Relaxation Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.4.5 Nonlinear Viscoelastic Models . . . . . . . . . . . . . 7.4.6 Some Examples . . . . . . . . . . . . . . . . . . . . . . . . Viscous Fluids with Memory . . . . . . . . . . . . . . . . . . . . . 7.5.1 Incompressible Viscoelastic Fluids . . . . . . . . . . 7.5.2 Compressible Viscoelastic Fluids . . . . . . . . . . . . 7.5.3 Maxwell-Like Viscoelastic Fluids . . . . . . . . . . . 7.5.4 Acceleration Waves in Viscous Fluids with Memory . . . . . . . . . . . . . . . . . . . . . . . . . . Electromagnetic Materials with Memory . . . . . . . . . . . . 7.6.1 Dielectrics with Memory . . . . . . . . . . . . . . . . . . 7.6.2 Conductors with Memory . . . . . . . . . . . . . . . . . 7.6.3 Polarizable Conductors with Memory . . . . . . . . 7.6.4 Magneto-Viscoelasticity . . . . . . . . . . . . . . . . . . 7.6.5 Rate Equations in the Eulerian Description . . . . 7.6.6 Solids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Causality and Kramers–Kronig Relations . . . . . . . . . . . . Memory Models via Fractional Derivatives . . . . . . . . . . 7.8.1 Preliminaries on Fractional Derivatives . . . . . . . 7.8.2 Fractional Derivatives . . . . . . . . . . . . . . . . . . . . 7.8.3 Heat Conduction via Fractional Derivatives . . . . 7.8.4 Wave Propagation Properties . . . . . . . . . . . . . . . Viscoelastic Models of Fractional Order . . . . . . . . . . . . .

Aging and Higher-Order Grade Materials . . . . . . . . 8.1 Aging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.2 Aging of Thermoelastic Solids . . . . . . . . . . . . . . 8.3 Aging of Rate-Type Materials . . . . . . . . . . . . . . 8.3.1 Aging Properties . . . . . . . . . . . . . . . . . 8.4 Aging of Thermo-Viscoelastic Materials . . . . . . . 8.5 An Aging Model of Viscous Fluid with Memory 8.6 Damage . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.7 Fluids of Higher-Order Grade . . . . . . . . . . . . . . 8.7.1 Second Grade Fluid . . . . . . . . . . . . . . . 8.7.2 Third Grade Fluid . . . . . . . . . . . . . . . . 8.8 Interaction Effects via Materials of Higher Order 8.9 Modelling via the Interstitial Working . . . . . . . . 8.9.1 Modelling via the Extra-Entropy Flux . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . 406 . . . . . 407 . . . . . 409 . . . . . 416 . . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

424 429 431 433 435 438 442

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

. . . . . . . . . . . . . . .

449 452 454 457 462 469 475 479 480 482 483 484 488 490 492

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

. . . . . . . . . . . . . .

497 497 499 501 504 506 511 512 514 515 518 521 523 526

Contents

8.10 Materials of Korteweg Type . . . . . . . . . . . . . . . . 8.11 Hyperstress and Materials of Higher-Order Grade . 8.11.1 Solids of Higher-Order Grade . . . . . . . . . 8.12 A New Scheme Associated with the Hyperstress . 9

xv

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

528 534 535 538

Mixtures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.1 Kinematics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.2 Balance Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.2.1 Balance Equations in the Eulerian Description . 9.2.2 Mass Density in the Reference Configuration and Incompressibility . . . . . . . . . . . . . . . . . . . 9.3 Second Law of Thermodynamics . . . . . . . . . . . . . . . . . 9.3.1 Principle of Phase Separation . . . . . . . . . . . . . 9.3.2 Remarks on the Thermodynamic Restrictions . . 9.4 Balance Equations for the Whole Mixture . . . . . . . . . . 9.4.1 Stoichiometry Requirements on the Mass Growths . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.4.2 Balance Equation for the Diffusion Flux . . . . . 9.5 Constitutive Models for Fluid Mixtures . . . . . . . . . . . . 9.5.1 Extent of Reaction and Law of Mass Action . . 9.5.2 Non-reacting Mixtures with Several Temperatures . . . . . . . . . . . . . . . . . . . . . . . . . 9.5.3 Reacting Mixtures with a Single Temperature . 9.5.4 Gibbs Equations . . . . . . . . . . . . . . . . . . . . . . . 9.5.5 Second-Law Inequalities . . . . . . . . . . . . . . . . . 9.6 Mixtures of Ideal Gases . . . . . . . . . . . . . . . . . . . . . . . 9.6.1 Chemical Potentials . . . . . . . . . . . . . . . . . . . . 9.6.2 Mixing Entropy and Mixing Free Energy . . . . 9.7 Diffusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.7.1 Dynamic Diffusion Equation . . . . . . . . . . . . . . 9.7.2 Fourier-Like and Rate-Type Diffusion Equations . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.7.3 Maxwell–Stefan Diffusion Model . . . . . . . . . . 9.7.4 Coupled Effects Among Diffusion and Heat Conduction . . . . . . . . . . . . . . . . . . . . . . . . . . 9.7.5 Evolution Problems . . . . . . . . . . . . . . . . . . . . 9.8 Solid Mixtures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9.8.1 Constitutive Assumptions and Thermodynamic Restrictions . . . . . . . . . . . . . . . . . . . . . . . . . . 9.9 Immiscible Mixtures . . . . . . . . . . . . . . . . . . . . . . . . . . 9.10 Entropy Inequality and Models for the Whole Mixture . 9.10.1 Gibbs Equations and Restrictions on the Diffusion Flux . . . . . . . . . . . . . . . . . . . 9.10.2 Gibbs–Duhem Equation . . . . . . . . . . . . . . . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

545 545 550 554

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

556 558 560 561 561

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

567 568 572 575

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

. . . . . . . . .

576 578 579 582 583 584 588 590 590

. . . .

. . . .

. . . .

. . . . . . 591 . . . . . . 593 . . . . . . 595 . . . . . . 598 . . . . . . 600 . . . . . . 601 . . . . . . 605 . . . . . . 607 . . . . . . 607 . . . . . . 609

xvi

10 Micropolar Media . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.1 Kinematics of Micropolar Media . . . . . . . . . . . . . . 10.1.1 Orientational Momentum . . . . . . . . . . . . . 10.2 Balance Laws . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.2.1 Balance Laws in the Spatial Description . . 10.2.2 Balance Laws in the Material Description . 10.2.3 Thermoviscous Micropolar Media . . . . . . . 10.3 Liquid Crystals . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.3.1 Modelling of Thermotropic Liquid Crystals 10.3.2 Director Field, Energy, and Objectivity . . . 10.4 Nematics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.4.1 Relation to Other Models Involving the Director Field . . . . . . . . . . . . . . . . . . . 10.5 Smectics and Cholesterics . . . . . . . . . . . . . . . . . . . 10.6 Mixtures of Micropolar Constituents . . . . . . . . . . . 10.6.1 Second Law Inequality . . . . . . . . . . . . . . . 10.7 Nanofluids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10.7.1 Brownian Diffusion and Thermophoresis . . 10.7.2 Model Approximations and Nanofluid Properties . . . . . . . . . . . . . . . . . . . . . . . . .

Contents

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

. . . . . . . . . . .

611 611 616 618 618 622 625 629 630 632 634

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

641 643 645 649 650 652

. . . . . . . . . 654 . . . . . . . . . .

. . . . . . . . . .

. . . . . . . . . .

. . . . . . . . . .

. . . . . . . . . .

. . . . . . . . . .

. . . . . . . . . .

. . . . . . . . . .

659 659 661 664 665 667 670 672 673 676

12 Electromagnetism of Continuous Media . . . . . . . . . . . . . . 12.1 A Thermodynamic Setting for Electromagnetic Solids . 12.2 Electroelasticity . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.3 Electroelastic Materials . . . . . . . . . . . . . . . . . . . . . . . 12.4 Dielectrics with Polarization Gradient . . . . . . . . . . . . 12.5 Fluids in Electromagnetic Fields . . . . . . . . . . . . . . . . 12.6 Magnetoelasticity . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.7 Micromagnetics and Magnetism in Rigid Bodies . . . . 12.7.1 Rate Equations and Thermodynamic Restrictions . . . . . . . . . . . . . . . . . . . . . . . . . 12.7.2 Evolution Equations of Magnetization . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

. . . . . . . .

681 682 684 688 695 699 703 710

11 Porous Materials . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11.1 Porous Materials as Mixtures . . . . . . . . . . . . . . . . . . 11.2 Constitutive Models of Porous Materials . . . . . . . . . 11.2.1 Darcy’s Law . . . . . . . . . . . . . . . . . . . . . . . 11.2.2 Darcy-Like Equations . . . . . . . . . . . . . . . . . 11.3 Special Models of Porous Materials . . . . . . . . . . . . . 11.4 Materials with Voids . . . . . . . . . . . . . . . . . . . . . . . . 11.5 Porous Media with Double Porosity . . . . . . . . . . . . . 11.5.1 Porous Material with Undeformable Solid . . 11.5.2 Porous Material with a Thermoelastic Solid .

. . . . . . . 712 . . . . . . . 719

Contents

xvii

12.8 Ferrofluids . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.8.1 Magneto-Rheological Fluids . . . . . . . . . . . . . . . . . 12.9 Magneto-, Electro-, and Mechanical-Optical Effects . . . . . . 12.9.1 Magneto-Optical Effects . . . . . . . . . . . . . . . . . . . . 12.9.2 Frequency Dependence, Closed Processes, Memory Functionals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.9.3 Electro-Optical Effects . . . . . . . . . . . . . . . . . . . . . 12.9.4 Mechanical-Optical Effects . . . . . . . . . . . . . . . . . . 12.9.5 Magnetocaloric and Electro-Caloric Effects . . . . . . 12.10 Plasmas . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12.10.1 One-Fluid Plasma Theory and Magnetohydrodynamics . . . . . . . . . . . . . . . . . . . . 12.11 Chiral Media and Optical Activity . . . . . . . . . . . . . . . . . . . 12.11.1 Wave Solutions in Chiral Media . . . . . . . . . . . . . . 12.12 Anisotropic Non-Dissipative Magnetic Materials (Ferrites) . 12.12.1 Magnetization Model with Damping Effects . . . . . . Part IV

. . . .

. . . .

. . . .

724 731 732 732

. . . . .

. . . . .

. . . . .

737 744 748 750 754

. . . . .

. . . . .

. . . . .

756 775 778 780 785

Hysteresis and Phase Transitions

13 Plasticity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.1 Qualitative Aspects of the Stress–Strain Curve . . . . . . . . 13.1.1 Yield Criteria . . . . . . . . . . . . . . . . . . . . . . . . . . 13.2 Rate-Independent Scheme of Plastic Flow . . . . . . . . . . . 13.2.1 Consistency of Mises–Hill Model with a Defect Energy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13.3 A Temperature-Dependent Model . . . . . . . . . . . . . . . . . 13.4 Gradient Theories of Plasticity . . . . . . . . . . . . . . . . . . . . 13.5 Model of Plasticity via the Kröner Decomposition . . . . . 13.6 A Decomposition-Free Thermodynamic Scheme . . . . . . . 13.6.1 Duhem-Like Solids . . . . . . . . . . . . . . . . . . . . . . 13.6.2 Hyperelastic and Hypoelastic Solids . . . . . . . . . 13.6.3 Dissipative Duhem-Like Solids . . . . . . . . . . . . . 13.6.4 Elastic–Plastic Models . . . . . . . . . . . . . . . . . . . 13.6.5 One-Dimensional Models . . . . . . . . . . . . . . . . . 13.6.6 The Helmholtz Free Energy . . . . . . . . . . . . . . . 13.6.7 Some Hysteretic Models . . . . . . . . . . . . . . . . . . 13.6.8 Models with Additive Decomposition of the Strain Rate . . . . . . . . . . . . . . . . . . . . . . . 13.7 Constitutive Models of Polymeric Foams . . . . . . . . . . . .

. . . .

. . . .

. . . .

. . . .

. . . .

793 793 795 798

. . . . . . . . . . . .

. . . . . . . . . . . .

. . . . . . . . . . . .

. . . . . . . . . . . .

. . . . . . . . . . . .

800 801 803 805 809 811 813 817 818 819 822 824

14 Superconductivity and Superfluidity . . . . . . . . . . . 14.1 Superconductors . . . . . . . . . . . . . . . . . . . . . . 14.1.1 Discoveries About Superconductivity . 14.1.2 The London Equations . . . . . . . . . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . .

. . . . . 827 . . . . . 829 . . . .

. . . .

. . . .

. . . .

835 835 835 837

xviii

Contents

14.1.3 An Alternative to London Equations . . . 14.1.4 A Nonlocal Model . . . . . . . . . . . . . . . . 14.1.5 Phase Transition Curve . . . . . . . . . . . . . 14.1.6 Ginzburg–Landau Theory . . . . . . . . . . . 14.2 Superfluids . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14.2.1 History of Superfluidity and Properties of Superfluids . . . . . . . . . . . . . . . . . . . . 14.2.2 First and Second Sound . . . . . . . . . . . . 14.2.3 Discontinuity Waves . . . . . . . . . . . . . . . 14.2.4 Mass Fractions and Chemical Potentials 14.2.5 Rotation of Superfluids . . . . . . . . . . . . . 14.2.6 The Ginzburg–Landau Theory . . . . . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

843 845 846 848 850

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. . . . . .

850 854 857 860 864 866

15 Ferroics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15.1 Evolution Function and Hysteretic Effects in Ferroelectrics . . . . . . . . . . . . . . . . . . . . . . . . . . 15.1.1 One-Dimensional Models of Ferroelectric Hysteresis . . . . . . . . . . . . . . . . . . . . . . . . 15.2 Ginzburg–Landau–Devonshire Theory . . . . . . . . . . 15.2.1 Landau Free Energy . . . . . . . . . . . . . . . . . 15.2.2 Landau–Devonshire Free Energy . . . . . . . . 15.3 Ferromagnetism . . . . . . . . . . . . . . . . . . . . . . . . . . 15.3.1 Thermodynamic Approach to Magnetic Hysteresis . . . . . . . . . . . . . . . . . . . . . . . . 15.3.2 One-Dimensional Models of Ferromagnetic Hysteresis . . . . . . . . . . . . . . . . . . . . . . . . 15.3.3 Curie–Weiss Law . . . . . . . . . . . . . . . . . . .

. . . . . . . . . 869 . . . . . . . . . 869 . . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

. . . . .

874 883 885 890 895

. . . . . . . . . 898 . . . . . . . . . 901 . . . . . . . . . 907

16 Phase Transitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16.1 Jump Conditions at Interfaces . . . . . . . . . . . . . . . . . . . . 16.2 Phase Transitions at Sharp Interfaces with T ¼ p1 . . . . 16.2.1 The Stefan Model . . . . . . . . . . . . . . . . . . . . . . . 16.3 Liquid–Vapour Transition . . . . . . . . . . . . . . . . . . . . . . . 16.3.1 Continuity of the Gibbs Free Energy . . . . . . . . . 16.3.2 Viscous Phases and Surface Tension . . . . . . . . . 16.3.3 The Phase Rule . . . . . . . . . . . . . . . . . . . . . . . . 16.4 Solid–Fluid Transition . . . . . . . . . . . . . . . . . . . . . . . . . . 16.4.1 Transition in a Finite Layer—A Mixture Model . 16.4.2 Evolution of the Mass Fraction . . . . . . . . . . . . . 16.5 Solidification-Melting of Binary Alloys . . . . . . . . . . . . . 16.6 Brine Channels Formation in Sea Ice . . . . . . . . . . . . . . .

. . . . . . . . . . . . .

. . . . . . . . . . . . .

. . . . . . . . . . . . .

. . . . . . . . . . . . .

. . . . . . . . . . . . .

909 909 912 914 915 917 923 925 925 927 931 932 936

Contents

xix

16.6.1 Transition Models via Minimization of Functionals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 939 16.7 Phase-Field Model of Liquid-Solid Transitions . . . . . . . . . . . . . 941 16.8 Phase Transitions in SMA . . . . . . . . . . . . . . . . . . . . . . . . . . . . 944 Appendix A: Notes on Vectors and Tensors . . . . . . . . . . . . . . . . . . . . . . . 951 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 991 Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1007

Part I

Basic Principles and Balance Equations

Chapter 1

Kinematics

This chapter deals with the kinematics of deformable bodies. Both deformation and motion of a body are developed by using the reference configuration; the position vector in the reference configuration is the operative label of the points of the body. The basic relations so determined for deformation and motion are essential to the next chapters. Attention is addressed to the topics of objectivity and objective time derivatives, thus establishing a general framework that proves remarkable in the description of material properties in terms of time derivatives. This framework shows the connection between various known objective time derivatives (Jaumann, Green–Naghdi, Cotter–Rivlin, Oldroyd, Truesdell). Transport relations are obtained for convecting (or non-convecting) sets, thus establishing basic properties for the derivation of (local) balance equations and jump conditions for discontinuous fields. Hence, the kinematical and the geometric relations are derived for singular surfaces which provide a general setting for the investigation of discontinuity waves. Moreover, the transport theorems for surface integrals are established thus leading, in particular, to the convected time derivative.

1.1 Frames of Reference and Configurations A frame of reference or observer F is an arbitrary set of rigidly fixed axes relative to which the position of points is determined. For simplicity, the chosen fixed axes are taken to be orthogonal and hence the corresponding unit vectors are an orthonormal basis; denote by {e1 , e2 , e3 } the chosen orthonormal basis. The time at which an event takes place can be specified relative to a particular event taken as a reference. An event is then a pair {P, t} consisting of a point P, in the three-dimensional Euclidean space E , and a time t. A chosen position O, origin, is taken as reference and hence we can identify each position P with the position vector x relative to the origin, © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_1

3

4

1 Kinematics

Fig. 1.1 A motion χ maps the point, labelled by the position vector X ∈ R, to the position vector x = χ(X, t)

x = P − O. Hence, a frame of reference F may be viewed as the Cartesian product of the set {E , O, {e1 , e2 , e3 }} and the real time axis R. A change of frame is a 1–1 mapping of space time onto itself such that distances, time intervals, and temporal order are preserved. An event {x, t} and its image {x∗ , t ∗ } under a change of frame are related by rigid transformations and a time shift. Chosen possibly different origins O, O ∗ , for the two observers, F , F ∗ , we can express a change of frame by t ∗ = t − a, x∗ = c(t) + Q(t)x, where a ∈ R, c(t) = O − O ∗ , and Q(t) is a rotation tensor, det Q(t) = 1. Bodies occupy regions of the Euclidean space E . To describe the evolution of a body, we need to label the points.1 This is accomplished by labelling the points of the body by the position in a reference configuration (or placement) R ⊂ E ; sometimes the body is identified with the reference configuration [216]. Upon choosing an origin, the positions in R are associated with a position vector; to make it apparent that we are dealing with the reference configuration the vector is denoted by a uppercase boldface letter, most often X. By definition, the reference configuration is independent of time. We denote by ∇ the gradient operator in E and by ∇R the gradient operator in R (Fig. 1.1). The motion of the body is described by saying the position of any point in E as a function of time. Granted a choice of origin also for the position in the current configuration, the current position vector, say x, is then a function of the point (at) X and time t. Hence, a motion of the body is a (smooth) function χ that assigns to each position (vector) in R and time t a position x = χ(X, t) in space; we say that X is a material vector while x is a spatial vector. Symbolically, χ(R, t) ⊂ E denotes the current configuration (at time t). The reference configuration R might be the configuration at a chosen initial time t0 but need not be so. If X is fixed then χ(X, t) describes the motion of the pertinent point in time. Hence χ(X, ˙ t) := ∂t χ(X, t),

1

χ(X, ¨ t) := ∂t2 χ(X, t),

The term points is used, instead of particles, to make it clear that no internal structure is ascribed.

1.2 Deformation

5

are the velocity and acceleration of the point X at time t. If, instead, t is fixed then χ is a mapping from R to the current configuration, at time t. For brevity, throughout X ∈ R means that X is the position vector of a point in R. Likewise, x ∈ D ⊂ E means that x is the position vector of a point in D.

1.2 Deformation To begin with, we restrict attention to a fixed time t and hence χ(X, t), as a function of X, describes the deformation at time t. Hence, the dependence on the parameter t is understood and we let x = χ(X). It is assumed that χ(X) is a 1–1 mapping so that two material points, X  = Y, cannot occupy the same position in the current configuration, x  = y. Assume also that χ is differentiable. Hence, there is a tensor F(X) such that χ(Y) − χ(X) = F (Y − X) + o(|Y − X|), F being a tensor function of the position X and possibly of t; F is said to be the deformation gradient. To within o(|Y − X|), F maps material vectors, Y − X, to spatial vectors, χ(Y) − χ(X). If χ depends linearly on X then F is independent of X, χ(Y) − χ(X) = F (Y − X), and the deformation is said to be homogeneous. Hence, in general, a deformation approaches the corresponding homogeneous deformation, with F = F(X); the closer Y is to X the more χ approaches a homogeneous deformation. In components,2 there is a matrix F ∈ R3×3 such that, for any two points X, Y ∈ R, χh (Y) − χh (X) = Fh K (X)(Y K − X K ) + o(|Y − X|),

h = 1, 2, 3.

The entries {Fh K } are the components of the deformation gradient F = ∇R χ,

Fh K = ∂ X K χh .

To determine the effects of deformation on lengths, areas, and volumes, it is convenient to establish some properties of the deformation gradient. First we observe that the invertibility of χ implies J := det F = 0. 2

As in well-known textbooks [152, 428, 429], to make the notation more explicit, throughout in suffix notation, we use capital indices for quantities related to the reference configuration.

6

1 Kinematics

Indeed since the trivial deformation x=X gives J = 1 we may regard the deformation as a continuous process and hence we assume J > 0. Also, we let

j = det F−1

and hence J j = 1. As any invertible tensor, the deformation gradient F satisfies the polar decomposition F = RU = VR, where R ∈ Orth while U, V ∈ Psym. To show this property, we first observe that FT F ∈ Psym in that, for any vector w, w · FT Fw = (Fw) · (Fw) ≥ 0 and, since F is invertible, Fw = 0

⇐⇒

w = 0.

√ We can then define3 U = FT F. Since FT F ∈ Psym then the 3eigenvalues, say {μi }, T } be the eigenvectors and then F F = are positive; let {N i i=1 μi Ni ⊗ Ni . Hence √ 3 √ U = FT F is defined as U = i=1 λi Ni ⊗ Ni , λi = μi . Hence also U is invert3 ible and U−1 = i=1 (1/λi )Ni ⊗ Ni . We can then write F = RU and take it as the definition of R; we find that R = FU−1 ,

R T R = U−1 FT FU−1 = 1

and, if J > 0 then det R = 1. Accordingly, R is a rotation if J > 0 and is orthogonal if merely J  = 0. Given U : U2 = FT F the tensor R is unique in that by ˆ RU = RU ˆ it follows R = R. Likewise, since FFT ∈ Psym we can define V ∈ Psym such that FFT = V2 . 3

The uniqueness of U is proved in [216], p. 32.

1.2 Deformation

7

˜ It follows det R ˜ = 1 if J > 0. Now by Let F = VR. ˜ =R ˜R ˜ T VR ˜ = R( ˜ R ˜ T VR) ˜ F = VR and the uniqueness of the decomposition RU, it follows ˜ = R, R

V = RUR T .

We can then write the following statement. Theorem 1.1 (Polar decomposition) If F is an invertible tensor with det F > 0 then there are unique, symmetric, positive-definite tensors U and V and a rotation R such that F = RU = VR. We say that F = RU and F = VR are the right and left polar decompositions of F. A positive-definite symmetric tensor represents a state of pure stretches along three mutually orthogonal axes. Therefore, the polar decomposition means that any (homogeneous) deformation may be viewed as the result of a pure stretch U and a rotation R or the same rotation R followed by the stretch V. Accordingly, U is called the right stretch tensor and V the left stretch tensor. In calculations, it proves more convenient to use the right and left Cauchy–Green tensors C = U2 = FT F,

B = V2 = FFT .

When F is a rotation, F = R, FT F = 1. Also F = 1 if the deformation is the identity transformation x = X. It is then useful to use the Green-St. Venant strain tensor E = 21 (C − 1). The vector u = x − X = χ(X) − X is called the displacement of the point (at X). The displacement gradient is the tensor4 H = ∇R u = F − 1. Hence C = (H + 1)T (H + 1) = 1 + HT + H + HT H and E = 21 (H + HT ) + 21 HT H. 4

To follow the standard notation in continuum mechanics, the symbols B, H, E, and D are used. When B, H, E, and D are used within electromagnetism they denote the magnetic induction, the magnetic field, the electric field, and the electric displacement.

8

1 Kinematics

It is useful sometimes to consider the linear part E of E, i.e. E := 21 (HT + H); we refer to E as the strain tensor. Since E = E + HT H then for small deformations, that is when H 1, we can use the approximation E  E. Further E = symH = sym[∇u(1 + H)]. The infinitesimal strain tensor is defined by ε := sym∇u. Hence, for small deformations E  E  ε. It is worth observing that C and B admit the same principal invariants. By direct calculations, it follows that tr C = tr FT F = F · F,

tr B = tr FFT = F · F,

tr C2 = tr (FT FFT F) = (FT F) · (FT F), tr B2 = tr (FFT FFT ) = (FT F) · (FT F), det C = det FT F = (det F)2 ,

det B = det FFT = (det F)2 .

Hence, we find the common values of the principal invariants I1 = tr C = tr B, I2 = 21 [(tr C)2 − tr C2 ] = 21 [(tr B)2 − tr B2 ], I3 = det C = det B. By the polar decomposition F = RU = VR, it follows that, associated with a deformation gradient F, there are various rotation-independent tensors related to U and V. The right Cauchy–Green deformation tensor (or Green’s deformation tensor) is defined by C = U2 = FT F. The inverse C−1 is denoted by F , F = C−1 = F−1 F−T . As to V, the left Cauchy–Green deformation tensor is denoted by B and defined by B = V2 = FFT . The inverse B−1 is denoted by C ,

1.2 Deformation

9

C = B−1 = F−T F−1 . In the literature, F is called the Finger tensor (and denoted by f). The inverse C of B was introduced by Cauchy and is denoted by c; as with tensors, we prefer to use a C instead of c). In addition, C is called Piola tensor, and Finger tensor in capital letter (C the fluid dynamics literature. We avoid the use of Piola tensor for C because the Piola, or Piola-Kirchhoff, tensor is a well-known stress tensor of continuum mechanics. We now examine the spectral representation of F. The common eigenvalues of C = U2 and B = V2 are λi2 , i = 1, 2, 3. Hence C=

3

2 i=1 λi Ni

⊗ Ni ,

B=

3

2 i=1 λi ni

⊗ ni

and U=

3

i=1 λi Ni

⊗ Ni ,

V=

3

i=1 λi ni

⊗ ni =

3

i=1 λi RNi

⊗ RNi .

Since ni = RNi , we can write R=

3

j=1 n j

⊗ Nj;

R is the active rotation R as described in Sect. 1.4.1. Hence, F is represented in the form 3

F = RU = (

j=1 n j

3  ⊗ N j )( i=1 λi Ni ⊗ Ni ) = 3j=1 λ j n j ⊗ N j .

From the spectral representation, we can derive the matrix representation. If the deformation is a rotation, then  U = 1. Let R be given by a rotation of the angle θ around n3 = N3 . Since R = 3j=1 n j ⊗ N j , we find Rhk = Nh · RNk = Nh · n j N j · Nk = Nh · nk . It follows that



cos θ − sin θ R = ⎣ sin θ cos θ 0 0

⎤ 0 0⎦ . 1

The Green-St Venant strain tensor E describes the deviation of the present configuration from the reference one, where F = 1 and then also C = 1. Hence, a better description of deformation might involve E. Now F−T (C − 1)F−1 = F−T (FT F − 1)F−1 = 1 − F−T F−1 = 1 − B−1 . Hence, letting

E A := 21 (1 − B−1 )

10

1 Kinematics

we have

E A = F−T EF−1 .

The tensor E A is called the Eulerian Almansi strain tensor.

1.2.1 Effects on Lengths, Areas, and Volumes Distances and Lengths Look at two position vectors X, Y ∈ R and the corresponding spatial position vectors x = χ(X), y = χ(Y) in the deformed region χ(R) ⊂ E . It is y − x = F(Y − X) + o(|Y − X|), F being evaluated at X. To within o(|Y − X|), we have y − x = F(Y − X)

(1.1)

and hence (y − x) · (y − x) = [F(Y − X)] · [F(Y − X)] = (Y − X) · FT F(Y − X) = (Y − X) · C(Y − X).

Then the distance l = |y − x| between the points in the deformed region is given by l 2 = (Y − X) · C(Y − X);

(1.2)

the length l depends on the material vector Y − X and not merely on the length |Y − X|. That is why the right Cauchy–Green tensor C is viewed as a metric tensor. If we are interested also in the direction of y − x, then we consider (1.1) and observe that the unit vector e = (Y − X)/|Y − X| is mapped to y−x = Fe. |Y − X| The ratio of the lengths

|y − x| = |Fe|, |Y − X|

is consistent with the action of deformation via the right Cauchy–Green tensor C, |y − x|2 = |Y − X|2 e · Ce = |Y − X|2 e · FT Fe = |Y − X|2 |Fe|2 .

1.2 Deformation

11

The deviation of the deformation from the identity deformation is well described by E. To get a detailed effect of E, we let y − x = F(Y − X) and find |y − x|2 − |Y − X|2 = (Y − X) · (C − 1)(Y − X). Since Y − X = F−T (y − x) we find |y − x|2 − |Y − X|2 = F−1 (y − x) · (C − 1)F−1 (y − x) = (y − x) · F−T (C − 1)F−1 (y − x).

Hence |y − x|2 − |Y − X|2 = (Y − X) · 2E(Y − X) = (y − x) · 2E A (y − x). The Green-St. Venant tensor E yields the difference |y − x|2 − |Y − X|2 in terms of the vector Y − X while the Eulerian Almansi tensor, E A or e, yields the difference in terms of y − x. We now examine the length of a curve. Let C be a curve in R represented parametrically as ˆ Y = Y(λ), λ ∈ [a, b]. By deformation, C is mapped to C given by Y(λ) → y(λ) = χ(Y(λ)). The line integral of a continuous function f defined on C reads b

b

a

a

∫ f (y(λ))|y (λ)|dλ = ∫ f (y(χ(λ)))|χ (Y(λ))|dλ where  means derivative with respect to the parameter λ. Since χ (Y(λ)) = F(Y(λ))Y (λ) then b

b

a

a

∫ f (y(λ))|y (λ)|dλ = ∫ f (χ(Y(λ)))|F(Y(λ))Y (λ)|dλ. Let f = 1 and observe that b

L = ∫ |y (λ)|dλ a

is the length of C while

b a

|Y (λ)|dλ is the length of C. Hence b

L = ∫ |F(Y(λ))Y (λ)|dλ. a

By means of the polar decomposition, we can write

12

1 Kinematics

|F Y | = (R U Y · R U Y )1/2 (R T R U Y · U Y )1/2 = |U Y |. As a consequence b

L = ∫ |U(Y(λ))Y (λ)|dλ, a

so that the length is unchanged if F is (locally) a pure rotation, U = 1. This shows that U  = 1 does not preserve lengths. A deformation that preserves the distance between points is said to be rigid; χ is rigid if |χ(Y) − χ(X)| = |Y − X|, ∀ Y, X ∈ R. (1.3) By (1.3), it follows [χ(Y) − χ(X)] · [χ(Y) − χ(X)] = (Y − X) · (Y − X). Differentiation with respect to X and Y yields FT (Y)F(X) = 1. Letting X = Y we find

FT (X)F(X) = 1

and hence F is a rotation, say R, at any point of the body. Consequently F(Y)FT (Y)F(X) = F(Y) implies F(X) = F(Y). Hence, in a rigid deformation F is a rotation R, independent of the position. Moreover, by integration of ∇R χ = R we have χ(Y) = χ(X) + R(Y − X).

(1.4)

It is of interest to examine the (approximate) description of small rigid deformations. By (1.4), since R = F then letting χ(Y) = Y + u(Y), we have u(Y) − u(X) = (−1 + F)(Y − X). By FT F = C = U2 = 1, it follows (1 + HT )(1 + H) = 1,

HT + H = 0

1.2 Deformation

13

where HT H has been neglected in the last relation. In this approximation F − 1 = H ∈ Skw. As a further consequence, it follows the projection property [u(Y) − u(X)] · (Y − X) = 0,

∀ Y, X ∈ R.

Volumes To establish the effect on volumes, we consider a sub-region P ⊆ R and let χ(P) ⊂ E be the region produced by deformation. Let f be a function defined at the points of the body so that f (y) = f (χ(Y)) =: F(Y), where Y ∈ P. We can view (Y1 , Y2 , Y3 ) → yi = χi (Y1 , Y2 , Y3 ), i = 1, 2, 3,

Y → y = χ(Y),

as a change of variables. Hence ∫ f (y)dv = ∫ F(Y)J (Y)dv R .

χ(P)

P

Let f = 1. Then it follows volχ(P) = ∫ 1 dv = ∫ J (Y)dv R . χ(P)

P

Now, for any X ∈ P, we can write ∫ J (Y)d V = ∫ J (X)dv R + ∫[J (Y) − J (X)]dv R . P

P

P

Granted the continuity of J , if d is the diameter of P, we have ∫ J (Y)d V = [J (X) + O(d)] ∫ 1 d V P

P

and hence vol χ(P) = [J (X) + O(d)]vol P. At the leading order, as d → 0, vol χ(P) = J vol P. If J = 1 the volume is constant. Now, since F = 1 + H then J = det[1 + H] = 1 + tr H + o(|H|).

(1.5)

14

1 Kinematics

In the approximation of small deformations |o(H)| < |tr H| and hence the requirement J = 1 implies (1.6) tr H = tr E = 0, ∇R · u = 0.

Deformation of Normals and Areas Consider a surface S ⊂ R and let S = χ(S) be the deformed surface. Let X be (the position vector of) a point in S and let x = χ(X). A curve C ⊂ S is represented parametrically as ˆ ˆ 0 ) = X. Y = Y(λ), Y(λ ˆ  (λ0 ) is tangent to C at X. Via the deformation function χ, the image The vector Y of C is a curve C in S, C = χ(C), described by ˆ yˆ (λ) = χ(Y(λ)),

yˆ (λ0 ) = x.

Hence yˆ  (λ0 ) is tangent to C at yˆ (λ0 ) = x. The derivative with respect to λ of the composite function yˆ (λ) results in ˆ  (λ0 ); yˆ  (λ0 ) = FY this is the relation between corresponding tangent vectors in S and S = χ(S). This means that if t R and t are corresponding tangent vectors (not necessarily unit vectors) then t = Ft R . At the chosen points (X and x) in S and S the (not necessarily unit) normals n R and n are subject to n · t = 0, n R · t R = 0, for any tangent vectors t R , t; the directions of n R , n are unique. Now, 0 = n · t = n · Ft R = (FT n) · t R . Hence we can define n R = FT n,

n = F−T n R .

Quite naturally, it is technically more convenient to have unit normals n R , n and to know the relation between areas in S and S. This is accomplished as follows. The surface S is described parametrically by Y = φ(u, v) while S = χ(S) is described by y = χ(φ(u, v)), where (u, v) ∈ D ⊂ R2 . Hence we have nR =

∂u φ × ∂v φ , |∂u φ × ∂v φ|

n=

∂u χ(φ) × ∂v χ(φ) . |∂u χ(φ) × ∂v χ(φ)|

1.2 Deformation

15

In view of the definition of surface integral we can write ∫ g n da = ∫ g(u, v)∂u χ(φ(u, v)) × ∂v χ(φ(u, v)) dudv, S

D

∫ G n R da R = ∫ G(u, v)∂u φ(u, v) × ∂v φ(u, v) dudv, S

D

To compute ∂u χ(φ(u, v)) × ∂v χ(φ(u, v)) we observe that [∂u χ(φ(u, v)) × ∂v χ(φ(u, v))]i = i hk Fh P Fk Q ∂u φ P ∂v φ Q . Now, by i hk Fi N Fh P Fk Q =  N P Q J we obtain

i hk Fh P Fk Q = J FN−1 i N P Q .

Consequently it follows [∂u χ(φ(u, v)) × ∂v χ(φ(u, v))]i = J FN−1 i  N P Q ∂u φ P ∂v φ Q , whence ∂u χ(φ(u, v)) × ∂v χ(φ(u, v)) = J F−T [∂u φ(u, v) × ∂v φ(u, v)]. Thus ∫ g n da = ∫ g(u, v)∂u χ(φ(u, v)) × ∂v χ(φ(u, v)) dudv S

D

= ∫ g(u, v)J F−T [∂u χ(φ(u, v)) × ∂v χ(φ(u, v))]dudv = ∫ g J F−T n R da R . D

S

This result indicates how to express the integral of gn on S in terms of an integral on S. Moreover, let g = 1 and denote by A, A the areas of S, S. Hence we observe that, at the leading order when D shrinks to a point, the desired relations for normals and areas follow in the form n A = J F−T n R A,

A = J |F−T n R | A.

(1.7)

Two remarks are in order. Mnemonically we might say that the connection between the integrals on S and χ(S) is obtained by letting n da = J F−T n R da R .

16

1 Kinematics

This relation is known as Nanson formula.5 The second remark is about the relation between areas. By (1.7), we might say that j = J |F−T n R | is the areal Jacobian. In [216], via a different procedure and the choice e3 = n R , the equivalent result j = |Fe1 × Fe2 | is obtained. We have proved that, for any function g on χ(R), ∫ g n da = ∫ g J F−T n R da R . S

S

If g is replaced with a vector g, or likewise with a tensor, then we have ∫ g · n da = ∫ J g · (F−T n R )da R = ∫ J (gF−T ) · n R da R . S

S

S

Hence, letting

g R := J g F−T

(1.8)

we have ∫ g · n da = ∫ g R · n R da R . S

S

As a comment, let S, S be closed surfaces and denote by P, P the regions bounded by S, S. In view of the divergence theorem we can write ∫ ∇ · g dv = ∫ ∇ R · g R dv R .

P

P

Since ∫P ∇ · g dv = ∫P J ∇ · g dv R then the arbitrariness of P implies ∇ R · g R = J ∇ · g.

(1.9)

1.2.2 Some Identities Involving the Deformation Gradient Given an invertible matrix F, the entries of F −1 depend on the entries of F such that −1 −1 ∂ Fm K FM−1 k = −FK k FM m .

Since FM−1 k Fk P = δ M P we observe −1 −1 0 = ∂ Fm K (FM−1 k Fk P ) = (∂ Fm K FM k )Fk P + FM k δkm δ K P 5

See, e.g., [429], p. 20; [147], p. 71.

(1.10)

1.2 Deformation

17

whence, upon multiplying by FP−1 l , −1 −1 δkl (∂ Fm K FM−1 k ) = −FP l FM k δkm δ K P .

The result (1.10) follows. The assumed invertibility of the deformation gradient implies that (1.10) holds for the entries of the deformation gradient matrix. In dealing with the connection between fields in R and χ(R) the identities ∂ X K (J FK−1 i ) = 0,

∂xk (J −1 Fk H ) = 0

(1.11)

prove convenient.6 A direct proof can be given by viewing J FK−1 i as a function of the entries Fm H and observing −1 −1 −1 ∂ X K (J FK−1 i ) = ∂ Fm H (J FK i )∂ X K Fm H = [(∂ Fm H J )FK i + J ∂ Fm H FK i ]∂ X K Fm H .

Since, by (A.16),

∂ Fm H J = J FH−1 m,

in view of (1.10) we have −1 −1 −1 −1 ∂ X K (J FK−1 i ) = (J FH m FK i − J FH i FK m )∂ X K Fm H .

Now ∂ X K Fm H = ∂ X K ∂ X H f m = ∂ X H Fm K . Hence, for each m and i, in the product −1 −1 −1 (J FH−1 m FK i − J FH i FK m )∂ X K Fm H −1 −1 −1 the factor (J FH−1 m FK i − J FH i FK m ) is skew in H K whereas ∂ X K Fm H is symmetric; the sum over H and K vanishes, and hence ∂ X K (J FK−1 i ) = 0, which completes the proof. The second identity in (1.11) follows by duality, that is by interchanging x and X. Identity (1.11) follows also from (1.9). By the relation between g and g R we have −1 −1 ∇R · g R = ∂ X K (J gi Fi−T ) = J ∂ X K gi FK−1 K i + gi ∂ X K (J FK i ) = J ∇ · g + gi ∂ X K (J FK i ).

By (1.9) we have ∇R · g R = J ∇ · g and hence a nonzero gi implies (1.11).

6

These identities were first given by Piola and Jacobi [429], p. 18.

18

1 Kinematics

1.3 Motion A motion is described by a function χ on R × R such that x = χ(X, t) is the position (vector) at time t of the point at X in the reference configuration R. At any time t, χ(X, t) describes a deformation such that R → Rt ⊂ E . Consistently the function is assumed to be invertible in that X = χ−1 (x, t) associates with each time t and spatial position x ∈ Rt the position X in the reference configuration. The function χ is assumed to be differentiable, on R; the deformation gradient F = ∇R χ(X, t) is then a function on R × R. Moreover it is assumed that J = det F > 0. The function χ is assumed to be a C 2 function with respect to time t. For any set of points in P ⊆ R we let Pt = χt (P) = χ(P, t) denote the region occupied by the points of P at time t; this can be the case of sub-regions, surfaces, curves. We then say that P deforms to Pt , at time t, and that a time-dependent spatial set Pt convects with the body if there is a set P of material points such that Pt = χt (P) for all t. Let ϕ be a scalar, vector, or tensor quantity. By ϕ = φ(x, t) we mean a function providing ϕ, at time t, in a spatial domain D ∈ E . If ϕ is defined on a set of points of the body then φ(x, t) = φ(χ(X, t)) =: (X, t), where x ∈ Pt , X ∈ P ⊆ R; we say that φ is the spatial description of ϕ while  is the material description. The connection between gradients in the two descriptions is given by the chain rule so that ∂ X K  = ∂xi φ ∂ X K χi ,

∇R  = ∇φ F = FT ∇φ.

Things are more involved about the time derivative in that the derivatives ∂t φ(x, t),

∂t (X, t)

are in general unequal. Indeed, by means of the chain rule ∂t (X, t) = ∂t φ(χ(X, t), t) = ∂t φ(x, t) + ∂xi φ(x, t) ∂t χi (X, t). Since ∂t χ(X, t) is the velocity, v, of the point X then ∂t  = ∂t φ + v · ∇φ. To simplify the notation, customarily only one symbol is used, here φ, and we let φ˙ stand for ∂t  so that (1.12) φ˙ = ∂t φ + v · ∇φ.

1.3 Motion

19

It is common to say that φ˙ is the material time derivative7 and ∂t φ is the spatial time derivative. By definition, φ˙ is the derivative at X fixed namely the derivative observed by following the motion of the material point X. Instead, ∂t φ is the time derivative relative to an observer fixed in space, at the point x. The difference between the two derivatives, v · ∇φ, is called the convective part and is due to the motion of the continuum. Denote by v(x, t) the velocity field in the spatial description. The spatial tensor field L = ∇v, L i j = ∂x j vi , is called the velocity gradient. To find the connection with the velocity gradient in the material description we assume χ is a C 2 function and observe that F˙i K = ∂t Fi K (X, t) = ∂ X K ∂t χi (X, t) = ∂x j vi (x, t)∂ X K χ j (X, t) = L i j F j K . Hence we have the identity

F˙ = LF.

To determine the material time derivative of J = det F we apply (A.17) to obtain J˙ = J tr (F˙ F−1 ) = J tr L. Since tr L = ∇ · v then

J˙ = J ∇ · v.

(1.13)

The inverse j = 1/J satisfies (1/J )˙ = −(1/J 2 ) J˙ and hence ( j)˙ = − j∇ · v.

(1.14)

Since˙ = ∂t + v · ∇ then (1.14) can be written ∂t j + ∇ · ( jv) = 0. If ∇ · v = 0 then J is constant and the volume is constant. Conversely if J is constant then (1.13) becomes ˙ 0 = J FK−1 i Hi K . 7 In the literature other adjectives are used for the material time derivative: advective, convective, hydrodynamic, Lagrangian, substantial, total.

20

1 Kinematics

In the approximation of small deformations J FK−1 i → δi K and hence ˙ = ∇R · u˙ = ∇R ˙· u. 0  tr H If u(X, t0 ) = 0 and hence ∇R · u = 0 at some time t0 then, consistent with (1.6), it follows ∇R · u = 0.

1.3.1 Stretching and Spin Tensors Let v(x, t) be the spatial description of the velocity field v. If, as we assume it to be, v is differentiable then v(y) − v(x) = L(x)(y − x) + o(|y − x|), the dependence of v and L on t being understood and not written. Let D and W be the symmetric and skew parts of L, D = 21 (L + LT ),

W = 21 (L − LT ).

Then L = D + W and v(y) − v(x) = W(x)(y − x) + D(x)(y − x) + o(|y − x|). The tensors W and D are called the spin and the stretching. It is worth establishing a connection between L and the tensors U, V, R of the polar decomposition. Substitution of F with RU yields ˙ −1 = (RU ˙ + RU)U ˙ −1 R T = RR ˙ T + RUU ˙ −1 R T . L = FF Now, time differentiation of RR T = 1 results in ˙ T = −(RR ˙ T )T . ˙ T = −RR RR ˙ T ∈ Skw. Yet, in general W = RR ˙ T ; indeed This implies that RR ˙ −1 )]R T , D = R[sym(UU

˙ T + R[skw(UU ˙ −1 )]R T . W = RR

The stretching D (as well as the spin W and the velocity gradient L), maps spatial vectors to spatial vectors, while the Green-St.Venant strain E = 21 (C − 1) maps material vectors to material vectors. Nevertheless, there is a direct connection ˙ By direct substitutions, we have between D and E.

1.3 Motion

21

˙ −1 + F−1 F˙ T )F = FT F˙ + F˙ T F = 2 E. ˙ 2FT DF = FT (L + LT )F = FT (FF Hence, we get the identity

˙ FT DF = E.

A motion χ is rigid if, at each time t, for any two points X, Y of the body ∂t |χ(Y, t) − χ(X, t)| = 0. The time derivative yields (y − x) · [v(y, t) − v(x, t)] = 0, being understood that x = χ(X, t), y = χ(Y, t). The gradient with respect to x and next the gradient with respect to y result in v(x, t) − v(y, t) − LT (x, t)(y − x) = 0, L(y, t) + LT (x, t) = 0. The continuity of L allows us to let y = x and to obtain L ∈ Skw. Back to L(y, t) = L(x, t) it follows that L = W is independent of the position and hence L = W(t). Thus v(y, t) = v(x, t) + W(t)(y − x). We know8 that there is a 1–1 correspondence between skew tensors and axial vectors. Let w be the axial vector associated to W; we have W = w × and w j = − 21  j pq W pq = − 14  j pq (∂xq v p − ∂x p vq ) = 21 (∇ × v) j . Thus, in a rigid motion v(y, t) = v(x, t) + 21 (∇ × v) × (y − x), ∇ × v being a function of time t. The vector  = ∇×v is called the vorticity. Hence we can also write9 W = 21  × . 8

See Appendix A. In the literature the vorticity ∇ × v is denoted by ω. To avoid ambiguities with the angular velocity, throughout we denote the vorticity ∇ × v by  rather than by ω. 9

22

1 Kinematics

In a rigid motion, the vorticity is independent of the position. A motion is irrotational if the spin W, as well as the vorticity , is zero at any point and time. For any velocity field, the velocity gradient L and the vorticity  satisfy the identity L  = D . (1.15) The proof follows by observing that L  = W  + D  = 21  ×  + D  = D .

1.4 Rotations and Angular Velocities Consider a conventionally fixed frame of reference associated with orthonormal base vectors (e1 , e2 , e3 ) and a rotating frame of reference associated with orthonormal base vectors (k1 , k2 , k3 ). Let R(t) be the time-dependent rotation matrix defined by Ri j (t) = ki (t) · e j .

(1.16)

Hence, the two bases are related by R in the form ki = (ki · e j )e j = Ri j e j . For any vector u, we have u · ki = u · Ri j e j = Ri j u · e j . Letting u i := u · ki and U j := u · e j we conclude that u i = Ri j U j . This transformation of vector components shows that the result of the rotation is geometrically that of the rotation of axes. We can then say that the definition (1.16) characterizes passive rotations or rotations of the axes. Owing to the orthonormal property of the two bases, R is orthogonal δi j = ki · k j = Ri h R jk eh · ek = Rik R jk . Upon time differentiation, it follows k˙ i = R˙ i j e j = R˙ i j R Tjh kh = Oi h kh , where

(1.17)

1.4 Rotations and Angular Velocities

23

O = R˙ R T ∈ Skw. Let ω :=

1 i 2

Since kh · ki = δi h ,

ki × k˙ i .

(1.18)

kh · k˙ i = −k˙ h · ki

it follows that   ω × kh = − 21 i kh × (ki × k˙ i ) = − 21 i [(kh · k˙ i )ki − (kh · ki )k˙ i ]   = 21 i [(kh · k˙ i )ki − (kh · ki )k˙ i ] = 21 i [(k˙ h · ki )ki + δhi k˙ i ) = k˙ h . Thus, we have

k˙ h = ω × kh ,

h = 1, 2, 3.

(1.19)

Equation (1.19) is sometimes called Poisson relation (of kinematics) for the rate of change of the corotational unit vectors. Indeed, by (1.19), we state that ω is the angular velocity of {k j } relative to {e j }. This view is consistent with the classical relation  du  du = + ω × u, space dt dt body where (du/dt)body is the derivative of u as seen from the rotating frame. Formally, u˙ = u˙ i ki + u i k˙ i = u˙ i ki + u i ω × ki = u˙ i ki + ω × u, thus showing that

 du dt

space

˙ = u,

 du dt

body

= u˙ i ki .

In view of (1.17), we find the connection between the angular velocity ω and the rotation matrix R. Upon substitution of k˙ i from (1.17) into (1.18), it follows ω=

1 i ki 2

× Oi h kh = 21 li h Oi h kl ,

whence ω · kl = 21 li h Oi h . Relative to the (fixed) basis {e j } we have ω = ωh kh = ωh Rh j e j = 21 Rh j hi p Oi p e j , whence ω · e j = 21 R Tjh hi p Oi p .

(1.20)

24

1 Kinematics

It is worth remarking that in both cases, ω involves 21 hi p Oi p which is the opposite of the axial vector (h-component) of O. Sometimes the angular velocity is defined via the right-hand rule, i.e. when the fingers of the right hand curl in the direction of the rotation, the thumb points in the direction of the angular velocity vector. This means that, relative to the right-handed basis (e1 , e2 , e3 ), in a counterclockwise rotation in the plane {e1 , e2 }, the angular velocity is directed along k3 = e3 . Let θ be the angle of rotation, then ⎡

⎤ cos θ sin θ 0 R = ⎣ − sin θ cos θ 0 ⎦ , 0 0 1



⎤ 0 10 R˙ R T = θ˙ ⎣ −1 0 0 ⎦ . 0 00

Hence, as expected, the relation ω · ki = 21 i jk ( R˙ R T ) jk results in ω = θ˙ k3 . The matrix R so defined determines the new coordinates of a point once the axes are rotated; R is said to represent a passive rotation (or passive transformation).

1.4.1 Active Rotations Sometimes in continuum physics, and usually in linear algebra, rotations are defined by transforming points, or rotating vectors, instead of axes. Hence, in the change of basis {ei } → {ki }, we might define the rotation R in the form Rei = ki . This definition characterizes R as an active rotation. We now establish some properties of active rotations and the connection between the matrices of passive and active rotations. Consider the transformation {ei } → {ki }; depending on the representation, we can write R ei = ki . ki = Rik ek , Hence, R is the matrix of the passive rotation. Concerning R , we observe that the orthonormality of {ki } implies δi j = ki · k j = R ei · R e j = ei · R T R e j ; the orthonormality of {ei } implies that R is orthogonal, R T R = 1. Since ki = Ri j e j then inner multiplication by ek of

1.4 Rotations and Angular Velocities

25

R ei = Ri j e j implies Rki = ek · R ei = Rik ; the matrix R associated with R , in the basis {ei }, is the transpose of R. Now observe that   ˙ ei . ω = 21 i ki × k˙ i = 21 i R ei × R ˙ j p Rhp we find Re p )h = eh · R e p = Rhp then letting A j h = R Since (R ω=

1 i R ei 2

˙ ei = el 1 lh j Rhp R ˙ j p = el 1 lh j A j h = aA , ×R 2 2

(1.21)

˙ T. aA being the axial vector of A = RR As a check of consistency of (1.20) and (1.21), we now show that the same result for ω follows by computation via the passive rotation matrix R and the active rotation matrix R, namely (1.22) ω = 21 Rhl hi p Oi p el = 21 lh j A j h el . We start with the passive matrix R, recall that R = RT , and observe ˙ ki Rkp . ωl = 21 Rhl hi p Oi p = 21 Rhl hi p R˙ ik R pk = 21 Rlh hi p R Since Rlh hi p Rkp = l jk R ji then

˙ ki Rkp = l jk R ˙ ki R ji = l jk Ak j , Rlh hi p R

and the conclusion follows. In addition, we observe that ˙ R T )ki . k˙ i = (R Borrowing from the evolution of a triad of eigenvectors [219, 225], we may refer to ˙ R T as the twirl tensor of the triad {ki }. Hence, in terms of the twirl tensor, A =R we have k˙ i = A ki = ω × ki .

26

1 Kinematics

1.5 Transport Relations for Convecting Sets We now compute the effects of motion on integrals over sets convecting with the body. Theorem 1.2 (Reynolds’ transport) Let P be any sub-region of the reference configuration R and let Pt = χt (P). Let φ and φ R be the spatial and material descriptions of a field and let ∂t φ R , ∂t J be continuous functions. Then d ∫ φ dv = ∫ (φ˙ + φ ∇ · v)dv. dt Pt Pt

(1.23)

Proof By the change of variables x → X = χ−1 (x, t), we have ∫ φ(x, t) dv = ∫ φ R (X, t)J (X, t) dv R .

Pt

P

The sub-region P is time independent, and hence by the continuity of ∂t φ R (X, t) and ∂t J (X, t), we deduce d ∫ φ R J dv R = ∫[∂t φ R J + φ R ∂t J ]dv R . dt P P In view of (1.13), we find d ∫ φ R J dv R = ∫(φ˙ + φ∇ · v)J dv R = ∫ (φ˙ + φ∇ · v)dv, dt P P Pt 

whence (1.23) follows. Since φ˙ = ∂t φ + v · ∇φ, then φ˙ + φ∇ · v = ∂t φ + v · ∇φ + φ∇ · v. Now, for any scalar, vector, or tensor φ, we have v · ∇φ + φ∇ · v = ∇ · (φv) := ∂xi (φvi ). With this meaning of ∇ · (φv) in mind, we can write (1.23) in the form d ∫ φ dv = ∫ [∂t φ + ∇ · (φv)]dv. dt Pt Pt

(1.24)

Moreover, by means of the divergence theorem, we can write the alternative form

1.5 Transport Relations for Convecting Sets

27

d ∫ φ dv = ∫ ∂t φ dv + ∫ φ v · n da. dt Pt Pt ∂Pt

(1.25)

Some comments are (1.25) shows   in order about the transport theorem. Equation that the time rate of Pt φ dv consists of two contributions; Pt ∂t φ dv is the time  rate determined by the derivative of φ within the region Pt while ∂Pt φ v · n da is the contribution due to the flux of φ across the boundary ∂Pt . Now observe that  vol Pt = Pt dv. Hence, letting φ = 1 in (1.25) and (1.24), we find d vol Pt = ∫ v · n da = ∫ ∇ · v dv. dt Pt ∂Pt A motion is said to be isochoric if d vol Pt = 0, dt for all spatial regions convecting with the body. Thus, the motion is isochoric if ∇ · v = 0 or rather the flux of v across the boundary is zero. We now let C be a closed material curve and let Ct = χ(C, t) be the corresponding spatial curve determined by motion. Let X = X(λ), λ ∈ [a, b], be the representation of C and hence x = x(λ, t) = χ(X(λ), t) be the representation of Ct . The integral b

(t) = ∫ v(x, t) · dx := ∫ v(x(λ, t), t) · ∂λ x(λ, t)dλ Ct

a

is the circulation of v around Ct . To compute the derivative of  we observe that b

˙ (t) = ∫ ∂t [v(x(λ, t), t) · ∂λ x(λ, t)]dλ, a

where ∂t is meant as the derivative at fixed λ. Therefore ∂t v(x(λ, t), t) = ∂t v(x, t) + ∂xi v(x, t)vi (x, t) = v˙ (x, t) where x stands for x(λ, t). Now ∂t ∂λ x(λ, t) = ∂λ ∂t x(λ, t) = ∂λ v(x(λ, t), t). Since Ct is closed, then b

∫ v · ∂λ vdλ = 21 [v2 (x(b, t), t) − v2 (x(a, t), t)] = 0. a

Hence, it follows the circulation-transport relation

28

1 Kinematics b

˙ (t) = ∫[˙v(x(λ, t), t) · ∂λ x(λ, t)]dλ. a

The motion is said to preserve circulation if ˙ = 0 for every closed material curve and time t. If v˙ is the gradient of a potential, v˙ = ∇ϕ, then b

b

b

a

a

a

∫[˙v(x(λ, t), t) · ∂λ x(λ, t)]dλ = ∫[∇ϕ(x(λ, t), t) · ∂λ x(λ, t)]dλ = ∫ ∂λ ϕ(λ, t)dλ = 0.

This result traces back to Kelvin: if the acceleration is the gradient of a potential then the motion preserves circulation.

1.5.1 Identities About the Total Time Derivative of Spatial Gradients In applications, we are faced with the total time derivative of spatial gradients, say ˙ for any C 2 function φ(x, t), x ∈ , t ∈ R, with scalar, vector, or tensor values. ∇φ, The evaluation is simplified by using the identity ˙ = ∇ φ˙ − LT ∇φ. ∇φ

(1.26)

To prove (1.26), we observe that, for any C 2 function φ, ˙ = ∂ ∇φ + (v · ∇)∇φ = ∇∂ φ + (v · ∇)∇φ. ∇φ t t Since v · ∇ ∇φ = ∇(v · ∇φ) − (LT ∇)φ we have

˙ = ∇∂ φ + ∇(v · ∇φ) − (LT ∇)φ ∇φ t

whence we obtain (1.26). Equivalently, if φ is a scalar, then (1.26) can also be written in the form ˙ = ∇ φ˙ − ∇φ L. ∇φ If φ is a vector, w, or a tensor, A, we can write the identity (1.26) in the forms ˙ = ∇w ˙ − (LT ∇)w, ∇w ˙ = ∇A ˙ − (LT ∇)A, ∇A

∂xi˙w j = ∂xi w˙ j − L ki ∂xk w j .

(1.27)

∂xi ˙A jk = ∂xi A˙ jk − L pi ∂x p A jk .

(1.28)

1.5 Transport Relations for Convecting Sets

29

If higher-order gradients are involved, we can iterate the procedure by starting from (1.26). For definiteness, look at second-order gradients. Let φ = ∇ψ and assume ψ is a C 3 function on  × R. By ˙ − (LT ∇)(∇ψ), ˙ = ∇ ∇ψ ∇∇ψ in indicial notation, we have ∂xi ∂˙xk ψ = ∂xi ∂x˙k ψ − L hi ∂xh ∂xk ψ = ∂xi ∂xk ψ˙ − L hk ∂xi ∂xh ψ − (∂xi L hk )∂xh ψ − L hi ∂xh ∂xk ψ = ∂xi ∂xk ψ˙ − (∂xi ∂xk vh )∂xh ψ − (L hk ∂xh ∂xi ψ + L hi ∂xh ∂xk ψ). Thus, in direct notation, we can write the identity for the second-order gradient in the form ˙ = ∇∇ ψ˙ − ∇ψ(∇∇v) − 2 sym(LT ∇)∇ψ). ∇∇ψ (1.29) We now apply (1.27) to w = v and find T ∂x˙i v j = ∂xi v˙ j − L ik ∂xk v j

whence

L˙ =  − L L,

 ji := ∂xi v˙ j .

(1.30)

¨ By Iteration of the procedure allows us to obtain L.  T L¨ ji = ∂x˙i v j ˙ = (∂xi v˙ j )˙− (L jk L ki )˙ = ∂xi v¨ j − L ik ∂xk v˙ j − L˙ jk L ki − L jk L˙ ki and use of (1.30), we have T T ∂xk v˙ j − (∂xk v˙ j − L kp ∂x p v j )L ki − L jk (∂xi v˙k − L iTp ∂x p vk ) L¨ ji = ∂xi v¨ j − L ik

whence

¨ =  − 2  L − L  − 2 L L L, L

 ji := ∂xi v¨ j .

(1.31)

¨ Time differentiation of F˙ = LF and use of (1.30) It is also of interest to evaluate F. yield ˙ + LF˙ = (∇ v˙ − LL)F + LLF. F¨ = LF Hence

¨ −1 = ∇ v˙ . FF

Alternatively, we might have observed that

(1.32)

30

1 Kinematics −1 ¨ ∂xi v˙ j = ∂xi χ¨ j = ∂ X K χ¨ j FK−1 i = F j K FK i ,

¨ −1 . ∇ v˙ = FF

1.5.2 Transport Relations for Vorticity and Velocity Gradient The material time derivative is the derivative as seen by an observer in motion with the velocity of the continuum. In applications, we are often required to compute the total time derivative in particular of velocity, vorticity, and velocity gradient. We first apply the relation (1.12) to the velocity field v and observe that (v · ∇)v = Lv. Hence, it follows v˙ = ∂t v + Lv. Now (L − LT )v = 2Wv,

Lv = 2Wv + LT v.

Furthermore LT v = ∇ 21 v2 ,

2Wv =  × v.

Hence, we obtain v˙ = ∂t v + ∇ 21 v2 +  × v.

(1.33)

The curl of (1.33) and the identity ∇ × ∇v2 = 0 imply that ∇ × v˙ = ∂t  + ∇ × ( × v). Now, in suffix notation [∇ × ( × v)]i = i jk kpq ∂x j ( p vq ) = ∂x j ( i v j ) − ∂x j ( j vi ). Moreover ∇ ·  = ∇ · ∇ × v = 0. Hence, it follows ∇ × ( × v) = (v · ∇) + (∇ · v) − L. Consequently, we obtain a transport relation for  in the form  ˙ − L + (∇ · v) = ∇ × v˙ .

(1.34)

1.5 Transport Relations for Convecting Sets

31

An analogous transport relation can be derived for the spin tensor W. Since 2W = ˙ −1 − F−T F˙ T , then L − LT = FF ˙ −1 − F−T F˙ T )F = FT F˙ − F˙ T F. 2FT WF = FT (FF In view of (1.32), we obtain ˙ ¨ −1 − F−T F¨ T )F = FT [∇ v˙ − (∇ v˙ )T ]F, 2FT WF = FT F¨ − F¨ T F = FT (FF whence

˙ FT WF = FT 21 [∇ v˙ − (∇ v˙ )T ]F.

(1.35)

Two remarkable consequences follow from (1.35). First assume the acceleration is the gradient of a scalar function ϕ, v˙ = ∇ϕ. Hence ∇ v˙ − (∇ v˙ )T = ∇∇ϕ − (∇∇ϕ)T = 0. By (1.35), it follows that FT WF is constant in time, FT WF = K(X),

W = F−T (X, t)K(X)F−1 (X, t).

If W(X, t0 ) = 0 at some time t0 , then K(X) = 0 and W(X, t) = 0 at any time t. If, further, W(X, t0 ) = 0 for any X ∈ R, then W = 0,

=0

at any point X and time t. This allows us to conclude that10 if the acceleration is the gradient of a scalar field, then the motion is irrotational, provided that it is at some point in time. The transport relation for the spin W is a further consequence of (1.35). Direct evaluation of the time derivative and substitution of F˙ = LF gives ˙ ˙ + LT W + WL)F. 2FT WF = 2FT (W Moreover LT W + WL = DW + WD. Hence, (1.35) becomes

10

The result is due to Lagrange and Cauchy.

32

1 Kinematics

˙ + DW + WD = 1 (∇ v˙ − (∇ v˙ )T ). W 2

(1.36)

The transport relation (1.34) for the vorticity  is expected to be equivalent to the transport relation (1.36) for the spin W. To prove the expected equivalence, we multiply (1.34) by − 21  pqi to obtain W˙ pq + 21  pqi [L i j j − (∇ · v) i ] = 21 (∂xq v˙ p − ∂x p v˙q ). Hence, we have to show that 1  [L i j j 2 pqi

− (∇ · v) i ] = D pk Wkq + W pk Dkq .

Since W = 0 and Wkq = − 21 kqi i , then we need to prove that A pq := 21  pqi [Di j j − (∇ · v)ωi ] − 21 qki Dkp i + 21  pki Dkq i = 0. By definition, A pq is skew, A pq = −Aq p . Hence A pq = 0, for every pair p, q, if and only if h = 1, 2, 3.  pqh A pq = 0, Now 2 pqh A pq = (δqq δhi − δqi δhq )(Di j j − ∇ · v i ) +(δ pk δhi − δ pi δhk )Dkp i + (δqk δhi − δqi δhk )Dkq i = 0 identically. The equivalence of (1.34) and (1.36) is then proved.



1.6 Transport Relations for Non-Convecting Sets There are cases where the boundary of a set (sub-region, surface) is not convecting with the body so that the velocity ν of the boundary need not be equal to the velocity v of the material points. We then consider a sub-region  and establish the following property (Fig. 1.2).

Fig. 1.2 The region (t) is not convected by the body

1.6 Transport Relations for Non-Convecting Sets

33

Theorem 1.3 (generalized transport) Let  ⊂ E be a time-dependent region. If φ and ∂t φ are continuous on  then d ∫ φ dv = ∫ ∂t φ dv + ∫ φ ν · n da. dt   ∂ Proof We compute the derivative as the limit of the difference quotient d 1

∫ φ dv = lim ∫ φ(x, t + h) dv − ∫ φ(x, t) dv . h→0 dt (t) h (t+h) (t) Upon some rearrangements, we have ∫

(t+h)

φ(x, t + h) dv − ∫ φ(x, t) dv = ∫ [φ(x, t + h) − φ(x, t)]dv (t)

+

(t)



(t+h)

φ(x, t + h) dv − ∫ φ(x, t + h) dv (t)

Now φ(x, t + h) − φ(x, t) = ∂t φ(x, ξ)h,

ξ ∈ (t, t + h).

By the uniform continuity of ∂t φ, we can write |∂t φ(x, ξ) − ∂t φ(x, t)| ≤ (h) where (h) → 0 as h → 0. Consequently, it follows lim

h→0

1 ∫ [φ(x, t + h) − φ(x, t)] dv = ∫ ∂t φ(x, t) dv. h (t) (t)

The boundary ∂(t) moves with velocity ν, possibly not continuous. Also, in view of the continuity of φ(x, ·), we have ∫

(t+h)

φ(x, t + h) dv − ∫ φ(x, t + h) dv = h ∫ φ(x, t)ν · n da + o(h) (t)

∂(t)

where |o(h)| ≤ δ(h)h and δ(h) → 0 as h → 0. Hence the conclusion follows.



As a remark, this generalized transport holds, in particular, if ν is continuous on ∂. In such a case, we might consider a continuous field ν on  of a fictitious underlying continuum and say that Reynolds’ transport holds so that also d ∫ φ dv = ∫[φ˙ + φ∇ · ν]dv dt   holds. Yet this view is hardly operative because ν is not the real velocity of the continuum.

34

1 Kinematics

Fig. 1.3 The surface σ(t) divides  into the regions + , − and forms a common boundary between them

1.6.1 Transport Theorems for Discontinuous Fields Let  ⊂ E be a region possibly dependent on the time t ∈ R. A surface σ(t) divides  into the regions + , − and forms a common boundary between them so that  = − ∪ σ ∪ + , − ∩ + = ∅. Let S± = ∂± \ σ and hence ∂ = S+ ∪ S− . Let m be the unit normal of σ, directed towards + . Thus, on σ the outward (unit) normal of − is m whereas the outward normal to + is −m. Let ν be the velocity of (the points of) σ and n be the unit normal to ∂ (Fig. 1.3). Let φ(·, ·) be a function on  × R, whose values may be scalars, vectors, or tensors of any order, such that φ(·, t) is continuous within the regions + and − and let φ(x, t) have definite limits φ+ and φ− as x approaches a point on σ from paths entirely within the regions + and − , respectively φ± (y, t) = lim φ(x, t),

x ∈ ± .

x→y

The surface σ(t) is said to be singular with respect to φ(·, t) at time t if [[φ]](y, t) := φ− (y, t) − φ+ (y, t) = 0,

y ∈ σ(t).

By the divergence theorem, for a vector function w ∈ C 1 (D), D ⊂ E , we have ∫ ∇ · w dv = ∫ w · n da, ∂D

D

n being the unit outward normal to ∂ D. A generalization is required if the domain, , contains a singular surface. Theorem 1.4 (divergence, singular surface) If w is a continuously differentiable vector function in − and + and σ is a singular surface for w then ∫ ∇ · w dv = ∫ w · n da + ∫[[w]] · m da.

\σ

∂

σ

(1.37)

1.6 Transport Relations for Non-Convecting Sets

35

Proof Observe ∫ ∇ · w dv = ∫ ∇ · w dv + ∫ ∇ · w dv. +

\σ

−

In ± , ∇ · w is continuous and the divergence theorem applies to obtain ∫ ∇ · w dv = ∫ w · n da − ∫ w+ · m da,

+

S+

σ

∫ ∇ · w dv = ∫ w · n da + ∫ w− · m da.

−

S−

σ



Summation provides the result.

Theorem 1.5 (transport of discontinuous fields) If φ and the velocity v are continuously differentiable functions in − and + while ν is the velocity of σ then d ∫ φ dv = ∫ {∂t φ + ∇ · (φv)}dv + ∫[[φ(ν − v)]] · m da. dt \σ σ \σ

(1.38)

Proof Since φ and v are C 1 functions in − and + we apply the generalized transport theorem to obtain d ∫ φ dv = ∫ ∂t φ dv + ∫ φv · n da − ∫ φ+ ν · m da, dt + σ + S+ d ∫ φ dv = ∫ ∂t φ dv + ∫ φv · n da + ∫ φ− ν · m da. dt − σ − S− By adding these relations, we find d ∫ φ dv = ∫ ∂t φ dv + ∫ φ v · n da + ∫[[φ]]ν · m da. dt \σ σ \σ ∂ As to the integral over ∂, we apply the divergence theorem to obtain ∫ φ v · n da = ∫ ∇ · (φv) dv − ∫[[φv]] · m da.

∂

\σ

σ

Substitution and the observation that [[φ]]ν = [[φν]] lead to the result (1.38).



The general relation (1.38) reduces to Reynolds’ transport relations (1.23) or (1.24)–(1.25) if ν = v, even though [[φ]] = 0, that is if the discontinuity surface σ

36

1 Kinematics

moves with the velocity of the continuum. Instead, if [[φ]] = 0 then [[φν]] = 0 and ∫[[φ(ν − v)]] · m da = − ∫ φ[[v]] · m da. σ

σ

The normal speed ν · m is called the speed of displacement and is a measure of the speed with which the surface σ(t) traverses in space. The quantity U = (ν − v) · m

(1.39)

is called the local speed of propagation; it is a measure of the normal speed of the surface σ(t) with respect to the material points that are instantaneously situated upon it.

1.7 Kinematics of Singular Surfaces Let P be a region occupied by the body in the reference configuration R. As with  in the spatial description, we let a surface (t) divide P into the regions P+ , P− so that P = P− ∪  ∪ P+ , P− ∩ P+ = 0. Following is an outline of the geometrical and kinematical properties of (t) as a singular surface11 for a function φ. The surface (t) is represented by the relation X = Y( , , t), where and  are a pair of surface parameters. Hence, a point on the surface (t) is identified by and . The velocity U of a point on (t) is defined by U R ( , , t) = ∂t Y( , , t). Let G(X, t) = 0 represent the surface (t). The unit normal to (t) is given by nR =

∇R G . |∇R G|

We call the quantity U R , defined by U R := U R · n R = −

∂t G , |∇R G|

the speed of propagation of the surface (t). Hence, U R is a measure of the speed with which the surface (t) traverses the material. The speed of propagation is an inherent property of the surface and all possible velocities U R of the surface have the 11

See, e.g. [429], p. 173–194; [89, 163, 289], Sect. 8.8.

1.7 Kinematics of Singular Surfaces

37

same normal component U R . A surface (t) that is singular with respect to some quantity and has nonzero speed of propagation U R (X, t) = 0, is said to be a wave. We now examine the corresponding surface σ(t) induced by the motion of the continuum x = χ(X, t), X ∈ R. Letting G(X, t) = G(χ−1 (x, t), t) =: g(x, t) we say that g(x, t) = 0 is the representation of the surface σ(t), in the current configuration χ(R, t). The two surfaces (t) and σ(t) are the duals of each other; they are the material representation and the spatial representation of the singular surface. The surface σ(t) has the parametric representation x = y(δ, γ, t) where δ and γ are a pair of surface parameters. The unit normal n to the surface σ(t) is given by n=

∇g . |∇g|

The normal speed u n of σ(t), called the speed of displacement is given by un = −

∂t g . |∇g|

We may select (the sign of) g so that n points in the direction of propagation. The speed of displacement u n is a measure of the speed with which the surface σ(t) traverses in space; it is also independent of the choice of parameterization of σ(t). Since g(χ(X, t), t) = G(X, t) (1.40) we have ∇g F = ∇R G. Consequently FT whence the absolute value gives

∇g |∇g| = ∇R G |∇g|

(1.41)

38

1 Kinematics

|∇g| 1 = T . |∇R G| |F n| We now determine the connection between the normals n, n R . By (1.41), we have n R = FT n

|∇g| FT n = T . |∇R G| |F n|

Likewise, we have the dual relation n = F−T n R

|∇R G| F−T n R = −T . |∇g| |F n R |

To find the connection between the speeds u n , U R , we look at (1.40) and observe ∂t G = ∂t g + x˙ · ∇g. As a consequence UR = −

∂t G ∂t g + x˙ · ∇g 1 =− T = T (u n − x˙ · n), |∇R G| |F n| |∇g| |F n|

whence UR =

1 U, |FT n|

U = u n − x˙ · n.

Consistent with (1.39),12 the quantity U is called the local speed of propagation and is the speed of the surface σ(t) with respect to the material points.

1.7.1 Geometrical Conditions of Compatibility Let φ(·, t) be a function which is continuous in the interiors of + and − and let σ(t) be the common boundary between them. The theory of singular surfaces rests upon Lemma 1.1 (Hadamard) Let φ be defined and continuously differentiable on σ and let φ+ and ∇φ+ be the limit of φ and ∇φ as σ is approached from + . If x = x(s) is a smooth curve upon σ, then dx dφ+ = ∇φ+ · . ds ds

12

Where m stands for n and ν · m for u n .

1.7 Kinematics of Singular Surfaces

39

The proof is immediate. If φ+ is the function on the + side of σ, then differentiation of φ+ = φ+ (x(s)) yields the result. Applying Hadamard’s lemma to each side of σ, we have dx dφ+ = ∇φ+ · , ds ds



dx dφ− = ∇φ− · . ds ds

Subtraction gives d dx dx [[φ]] = [[∇φ]] · = [[∇φ · ]]. ds ds ds This formula means that the tangential derivative of the jump is the jump of the tangential derivative. Since the values of φ in + and − are unrelated to one another, the limiting values of the normal derivatives of φ on the two sides of σ need not be connected and hence [[dφ/dn]] is unrestricted. Likewise Hadamard’s lemma, letting x(δ, γ, t) be the parameterization of a curve on σ, we have ∂γ [[φ]] = [[∇φ]] · ∂γ x. In particular, if φ is continuous across σ or [[φ]] is constant, then [[∇φ]] · ∂γ x = 0. Thus, we have Theorem 1.6 (Maxwell) If [[φ]] = 0 or [[φ]] is constant on σ, then [[∇φ]] = bn,

b = [[n · ∇φ]] = [[

dφ ]]. dn

If φ = w, a vector, then [[∇w]] = n ⊗ b,

b = [[n · ∇w]] = [[

dw ]]. dn

Hence, we have [[∇ × w]] = n × b,

[[∇ · w]] = n · b,

b = n[[∇ · w]] − n × [[∇ × w]].

This is phrased by saying that the longitudinal and transversal jumps of the gradient of a continuous vector are the jumps of its divergence and curl, respectively (Weingarten’s first theorem) ([425, 443], Sect. 11.4). Assume now σ is a singular surface for the second-order derivatives of φ, that is [[∇φ]] = 0, whereas [[∇∇φ]] = 0. By Hadamard’s lemma

40

1 Kinematics

∂γ [[∇φ]] = [[∇∇φ]]∂γ x. The assumption [[∇φ]] = 0 implies that [[∇∇φ]] t = 0 for every vector t tangent to σ. Hence there is a vector w such that [[∇∇φ]] = w ⊗ n. The symmetry of ∇∇φ implies that w = a n and then [[∇∇φ]] = a n ⊗ n. We can view a as the amplitude of the discontinuity. Indeed, inner multiplying by n ⊗ n gives d 2φ a = [[ 2 ]]. dn Given a vector, φ say, for each component φi , we have a replaced by ai and hence [[∇∇φ]] = a n ⊗ n. We then find that a is the second-order normal derivative so that [[∇∇φ]] = [[

d 2φ ]] n ⊗ n. dn 2

If a n the singularity is said to be longitudinal, if a ⊥ n transversal. The dual properties hold across (t), with (X, t) = (χ−1 (x, t)) = φ(x, t), in the reference configuration. If [[]] = 0 or [[]] is constant, then [[∇ R ]] = B n R ,

B = [[n R · ∇ R ]] = [[

d ]]. dn R

If [[∇ R ]] = 0, whereas [[∇ R ∇ R ]] = 0, then [[∇ R ∇ R ]] = A n R ⊗ n R .

1.7.2 Kinematical Conditions of Compatibility Let g(x, t) = 0 be the equation of the moving surface σ(t) and let x = x(δ, γ, t) be a parameterization of the points of σ(t). Differentiation with respect to t gives

1.7 Kinematics of Singular Surfaces

41

∂t g + u · ∇g = 0,

u = ∂t x(δ, γ, t);

u is the velocity of the surface point identified by the pair of parameters δ, γ. Dividing by |∇g| and letting ∇g un = u · n = u · |∇g| we obtain un = −

∂t g . |∇g|

Since the right-hand side involves the function g alone, and then is independent of the selected parameterization, the quantity u n too is independent of the parameterization. Accordingly, all possible velocities u of the moving surface have the same normal component u n , which is called the speed of displacement of the surface. The particular choice u = un n makes u normal to the surface. This velocity is then called the normal velocity of the surface. An observer moving with the normal velocity u n n will encounter points on the surface having surface coordinates, γ1 , γ2 say, which vary in time in a proper way. Now if x = x(γ1 , γ2 , t), and γ1 , γ2 vary in time, then the velocity is given by v = ∂t x + ∂t γi ∂γi x. If the observer moves with the normal velocity, then the derivatives ∂t γi are subject to (1.42) ∂t x + ∂t γi ∂γi x = u n n. If f (γ1 , γ2 , t) is a function defined on the moving surface we denote by δ f /δt the displacement derivative that is the rate of change of f as apparent to an observer moving with the normal velocity. Hence δf = ∂t f + ∂t γi ∂γi f, δt where ∂t γ1 and ∂γ2 are such that (1.42) holds. Now let f (γ1 , γ2 , t) = fˆ(x(γ1 , γ2 , t), t). Then

∂t f = ∂t fˆ + ∂t x · ∇ fˆ,

∂γi f = ∂γi x · ∇ fˆ.

42

1 Kinematics

Consequently δ fˆ δf ≡ = ∂t fˆ + (∂t x + ∂t γi ∂γi x) · ∇ fˆ = ∂t fˆ + u n n · ∇ fˆ. δt δt Let φ+ (φ− ) be a function defined on the whole region + (− ). The displacement derivative of φ+ reads δφ+ = ∂t φ+ + u n n · ∇φ+ δt and the like holds for φ− . Of course ∂t φ± = (∂t φ)± ,

∇φ± = (∇φ)± .

Subtracting the relation for φ+ from that for φ− , we obtain δ [[φ]] = [[∂t φ]] + u n [[n · ∇φ]], δt

(1.43)

which is named kinematical condition of compatibility. In the special case where φ is continuous across σ Eq. (1.43) reduces to [[∂t φ]] = −u n [[n · ∇φ]].

(1.44)

In particular, across a stationary surface (u n = 0) that is singular with respect to ∇φ but not with respect to φ we have [[∂t φ]] = 0. If [[φ]] = 0, then the geometrical condition of compatibility and (1.44) yield [[∇φ]] = bn,

[[∂t φ]] = −u n b.

For practical purposes, it is useful to determine a relation for the jump of the Lagrangian derivative φ˙ within the Eulerian description. Since φ˙ = ∂t φ + v · ∇φ then ˙ = [[∂t φ]] + [[v · ∇φ]]. [[φ]] If [[φ]] = 0 then by (1.44) it follows ˙ = −[[(u n n − v) · ∇φ]]. [[φ]] If, in particular, [[v]] = 0, then ˙ = −(u n n − v) · [[∇φ]] = −(u n n − v) · n [[n · ∇φ]] [[φ]] whence

˙ = −U [[n · ∇φ]], [[φ]]

U := u n − v · n,

(1.45)

1.7 Kinematics of Singular Surfaces

43

U being called the local speed of propagation. Further, kinematical conditions of compatibility hold for higher-order derivatives. For example, applying (1.43) to ∂t φ and ∇φ, we have δ [[∂t φ]] = [[∂t2 φ]] + u n [[n · ∇∂t φ]], δt δ [[∇φ]] = [[∇∂t φ]] + u n [[n · ∇∇φ]]. δt We say that σ(t) is a propagating singular surface of order n, with respect to φ, if φ and the derivatives up to order n − 1 are continuous functions everywhere whereas the derivatives of order n and higher have jump discontinuities across σ(t) but are continuous functions everywhere else. Singular surfaces of order 0 and 1 are said to be strong singularities; if φ is the position vector such is the case of contact discontinuities and shock waves. Singular surfaces of order 2 or higher are said to be weak singularities. If σ is a singular surface of order 2 with respect to φ, then, by Maxwell’s theorem, we have [[∇∇φ]] = a n ⊗ n. By the kinematical condition of compatibility [[∂t ∇φ]] = −u n [[n · ∇∇φ]] = −u n a n, and [[∂t2 φ]] = −u n n · [[∂t ∇φ]] = u 2n a. The dual properties hold for singularities across (t) in the reference configuration. Let ˆ F(1 , 2 , t) = F(X( 1 , 2 , t), t) Then

ˆ ∂t F = ∂t Fˆ + ∂t X · ∇ R F,

ˆ ∂i F = ∂i X · ∇ R F.

If F(1 , 2 , t) is a function defined on the moving surface we denote by δ F/δt the displacement derivative that is the rate of change of F as apparent to an observer moving with the normal velocity U R . Hence δF = ∂t F + ∂t i ∂i F, δt and δ Fˆ δF ˆ ≡ = ∂t Fˆ + (∂t X + ∂t i ∂i X) · ∇R Fˆ = ∂t Fˆ + U R n R · ∇ R F, δt δt

44

1 Kinematics

where U R n R = ∂t X + ∂t i ∂i X by definition of the displacement derivative. Accordingly, for any function (X(1 , 2 , t), t) we consider the limit functions + , − as the current point approaches (t) from R+ , R− . Subtraction of the corresponding relation for − , + gives δ[[]] = [[∂t ]] + U R [[n R · ∇ R ]], δt

(1.46)

that is the kinematical condition of compatibility in the reference configuration. In the case where  is continuous across , Eq. (1.46) reduces to [[∂t ]] = −U R [[n R · ∇R ]].

(1.47)

Moreover, by the geometrical condition of compatibility, we have [[∇R ]] = B n R ,

[[∂t ]] = −U R B.

Singular surfaces (t) of order n are defined as with the corresponding surface σ(t) in the Eulerian description. If  is a singular surface of order 2, with respect to , then by Maxwell’s theorem and the kinematical condition of compatibility (1.47), we have [[∇R ∇R ]] = A n R ⊗ n R ,

[[∂t ∇R ]] = −U R An R ,

[[∂t2 ]] = U R2 A.

If the function  is the motion χ, then we have the following results [[∇R ∇R χ]] = [[∇R F]] = A n R ⊗ n R ,

˙ = −U R An R , [[∇R χ]] ˙ = [[F]]

[[χ]] ¨ = U R2 A,

which are referred to as Hugoniot’s compatibility conditions [425], Sect. 11.4. These compatibility relations can be given a form appropriate for the Eulerian description, involving the normal n and the speed U of the surface σ(t). Since nR =

FT n , |FT n|

UR =

1 U, |FT n|

letting a=

1 |FT n|2

A

we obtain [[∇R F]] = a FT n ⊗ FT n,

˙ = −U a FT n, [[F]]

[[χ]] ¨ = U 2 a.

For the sake of generality, we now let  be a singular surface of order p ≥ 1. For definiteness, we let the function  be the motion χ. Hence, order p means that

1.7 Kinematics of Singular Surfaces

45

the ( p − 1)-th order derivatives, of χ(X, t), are continuous whereas the p-th order derivatives suffer jump discontinuities across . Consequently p−1

[[∂t

χ]] = 0,

[[∇ R · · · ∇ R χ]] = 0.   p−1

By Maxwell’s theorem13 [[∇ R · · · ∇ R χ]] = [[∂n R ∇ R · · · ∇ R χ]]n R = [[∇ R · · · ∇ R ∂n R χ]]n R .       p

p−1

p−1

Iterating the procedure, we find [[∇ R · · · ∇ R χ]] = [[∂npR χ]] n R ⊗ · · · ⊗ n R .  

  p

p

By the kinematical condition of compatibility, we obtain p

p−1

[[∂t χ]] = −U R [[∂n R ∂t

p−1

χ]] = −U R [[∂t

∂n R χ]].

Applying the kinematical condition of compatibility p − 1 more times, we find p

[[∂t χ]] = (−U R ) p [[∂npR χ]]. These relations are traced back to Hadamard [425]. We can then write the following Higher-order compatibility condition. At a singular surface of order p ≥ 1 for the motion, the jumps of space and time derivatives are related by p

(−U R ) p [[ ∇ R · · · ∇ R χ]] = [[∂t χ]] n R ⊗ · · · ⊗ n R .  

  p

p

A further relation follows from the equation of motion, in the material description, p−2 to (2.28) so that as p ≥ 2. Apply ∂t p−1

ρ R ∂t p−2

and, subject to [[∂t

p−2

p−2

v = ∂t

∇ R · T R + ρ R ∂t

b]] = 0, take the jumps to have p−1

ρ R [[∂t

p−2

v]] = [[∂t

∇ R · T R ]].

Now p−2

[[∂t 13

b,

p−2

∇ R · T R ]] = [[∇ R · ∂t

T R ]] = −

1 p−1 [[∂t T R ]]n R UR

For ease in writing we use ∂n and ∂n R in place of d/dn = n · ∇ and d/dn R = n R · ∇R .

46

1 Kinematics

and

p−1

[[∂t Hence, we have

p−1

[[∂t

p

v]] = [[∂t χ]] = (−U R ) p [[∂npR χ]]. T R ]]n R = ρ R (−U R ) p+1 [[∂npR χ]].

1.7.3 Jumps at Acoustic Waves In applications, it may happen that we need the geometrical description in the current configuration, at the surface σ(t), whereas because of the balance equations, we may use the total time derivative. To fix ideas, let σ be a singular surface of order 1 with respect to φ, [[φ]] = 0, [[∇φ]] = 0. By Maxwell’s theorem [[∇φ]] = b n,

b = [[∂n φ]],

∂n φ := n · ∇φ.

An acoustic wave (or acceleration wave) is a moving singular surface σ(t) such that [[v]] = 0, [[∇v]] = 0, across σ(t). Hence Maxwell’s theorem implies that [[∇v]] = a ⊗ n,

a := [[∂n v]].

Hence, it follows that 2[[D]] = a ⊗ n + n ⊗ a, [[∇ · v]] = a · n, [[∇ × v]] = n × a, 2[[W]] = a ⊗ n − n ⊗ a.

By the kinematical condition of compatibility, we can write [[∂t v]] = −u n [[n · ∇v]] = −u n a. Now [[˙v]] = [[∂t v + v · ∇v]] = [[∂t v]] + v · [[∇v]] = (−u n n + v) · [[∇v]] = (−u n + v · n)a.

Consequently, we have [[˙v]] = −U [[∂n v]]. In terms of [[∂n2 χ]], we have

(1.48)

1.8 Transport Theorems for Surface Integrals

47

[[∇v]] = n ⊗ [[∂n v]] = −U n ⊗ [[∂n2 χ]] = −U n ⊗ c,

c := [[∂n2 χ]],

in suffix notation [[∂xi v j ]] = n i [[∂n v j ]]. Hence, we find 2[[D]] = −U (c ⊗ n + n ⊗ c),

2[[W]] = −U (c ⊗ n − n ⊗ a),

[[∇ · v]] = −U c · n,

[[∇ × v]] = −U n × c,

[[χ]] ¨ = −U [[∂n χ]] ˙ = U 2 [[∂n2 χ]], the relations for ∇ · v and W being the content of the Weingarten-Hadamard theorem [425].

1.8 Transport Theorems for Surface Integrals Let S be a surface in the three-dimensional space E , possibly dependent on the time t ∈ R. The dependence on t may be induced by the motion of the points of a body or by a given velocity field of S as a purely geometrical surface. We first compute the time derivative of a surface integral, over a time-dependent surface S, when the time dependence is induced by a continuous velocity field of the underlying body. We then say that S convects with the body. Let S consist of points ¯ The surface S is the image of a body undergoing a velocity field v ∈ C 1 (S) ∩ C(S). of S through the motion of the body, S  x = χ(X, t), X = χ−1 (x, t) ∈ S,

S = χ(S, t),

for any time t. In light of the Nanson formula, we can write ∫ φ n da = ∫ φ J F−T n R da R , S

S

where n is the unit normal to S and n R is the unit normal to S. The surface S is time independent. Hence d ˙ ∫ φ n da = ∫ φ J F−T n R da R . dt S S Since

J˙ = J ∇ · v,

and F−T F˙ T = LT we find

˙ F−T = −F−T F˙ T F−T

48

1 Kinematics

d ∫ φ n da = ∫(φ˙ J F−T + φ J ∇ · v F−T − φ J F−T F˙ T F−T ) n R da R dt S S ˙ + φ[(∇ · v)1 − LT ]n}da. = ∫{φn S

Now we let the boundary ∂S of S depend on time via a velocity field ν, on ∂S. Then the boundary is not convecting and moves with velocity ν − v, possibly discontinuous, relative to the material points and we say that S is not convecting with the body. The area produced by the relative velocity ν − v, per unit length of the boundary, is |(ν − v) × t| and (ν − v) × t is orthogonal to the surface. Hence, the additional contribution to the time derivative is given by ∫ φ (ν − v) × t dl.

∂S

Consequently, for a generic motion of the boundary ∂S, we have d ˙ + φ[(∇ · v)1 − LT ]n}da + ∫ φ (ν − v) × t dl. ∫ φ n da = ∫{φn dt S S ∂S Of course the integral on ∂S vanishes if the boundary consists of material points, in which case ν = v. With a view to applications, we now let φ be a vector, say φ = w, and hence φ n = w · n or a tensor A so that φn = An. Since ˙ + (∇ · v)w − (w · ∇)v] · n ˙ · n + w · n ∇ · v − w · L T n = [w w then

d  ∫ w · n da = ∫ w ·n da + ∫ w · (ν − v) × t dl, dt S S ∂S

where

(1.49)



˙ + (∇ · v)w − (w · ∇)v = w ˙ − Lw + (∇ · v)w; w := w 

w is referred to as the convected time derivative of the vector w. If φn = An, then ˙ + φ[(∇ · v)1 − LT ]n with [A ˙ − ALT + (∇ · v)A]n to we can formally replace φn obtain  d ∫ A n da = ∫ A n da + ∫ A (ν − v) × t dl, dt S S ∂S where



˙ − ALT + (∇ · v)A. A := A 

The symbol A is referred to as the convected time derivative of the tensor A. If S is convecting with the body, then

1.8 Transport Theorems for Surface Integrals

49

 d ∫ A n da = ∫ A n da. dt S S

Vectors and tensors are subject to the same convected time derivative; the common expression of the derivative is due to the evolution of the surface. We add a property of vectors. In view of the identity 

w= ∂t w + ∇ × (w × v) + v∇ · w,

(1.50)

by applying Stokes’ theorem, we have d ∫ w · n da = ∫(∂t w + v∇ · w) · n da + ∫ w × v · t dl + ∫ w · (ν − v) × t dl, dt S S ∂S ∂S whence

d ∫ w · n da = ∫(∂t w + v∇ · w) · n da + ∫ w × ν · t dl. dt S S ∂S

(1.51)

If w is divergence-free, i.e. ∇ · w = 0, then d ∫ w · n da = ∫ ∂t w · n da + ∫ w × ν · t dl. dt S S ∂S

1.8.1 Transport Theorems for Discontinuous Fields We now allow for the occurrence of a line of discontinuity, say γ, which divides S in two surfaces S+ , S− , and take the unit tangent tγ to γ such that tγ × n is along the direction from S− to S+ . The surface S consists of points of the body, moving with the velocity field v. The line γ, instead, is moving with a velocity ν possibly different from the velocity v of the points of S. The vector field w suffer a jump discontinuity across γ. We let [[w]] = w− − w+ , w− , w+ at a point x ∈ γ being the limit values of w(y, t) as y approaches x within S− , S+ , respectively (Fig. 1.4). We have d ∫ w · n da = ∫ (∂t w + v ∇ · w) · n da + ∫ w · v × t dl − ∫ w+ · ν × tγ dl, dt S+ γ S+ ∂S+ \γ d ∫ w · n da = ∫ (∂t w + v ∇ · w) · n da + ∫ w · v × t dl + ∫ w− · ν × tγ dl. dt S− γ S− ∂S− \γ

50

1 Kinematics

Fig. 1.4 The line γ divides the surface S in two surfaces S− , S+ and is the common boundary between them

Summation gives d ∫ w · n da = ∫ (∂t w + v ∇ · w) · n da + ∫ w × v · t dl + ∫[[w × ν]] · tγ dl dt S\γ γ S\γ ∂S where the identity w · v × t = w × v · t and the analogue with ν have been used. Likewise, we apply Stokes’ theorem to S+ and S− to obtain ∫ ∇ × u · n da = ∫ u+ · t dl − ∫ u+ · tγ dl,

S+

γ

∂ S+ \γ

∫ ∇ × u · n da = ∫ u− · t dl + ∫ u− · tγ dl, γ

∂ S− \γ

S−

for any vector field u of class C 1 in S+ and S− . The summation results in ∫ ∇ × u · n da = ∫ u · t dl + ∫[[u]] · tγ dl,

S\γ

∂S

γ

(1.52)

that is the generalized Stokes theorem for a surface S including a discontinuity line γ. Identify u with w × v and observe ∫ w × v · t dl = ∫ ∇ × (w × v) · n da − ∫[[w × v]] · tγ dl.

∂S

S\γ

γ

Replacing the integral over ∂S, we obtain d ∫ w · n da = ∫ (∂t w + v ∇ · w) · n da + ∫ ∇ × (w × v) · n da + ∫[[w × (ν − v)]] · tγ dl dt S \γ γ S \γ S \γ

whence

d  ∫ w · n da = ∫ w ·n da + ∫[[w × (ν − v)]] · tγ dl. dt S\γ γ S\γ

(1.53)

Equation (1.53) gives the time derivative of the flux of w across a time-dependent surface S.

1.9 Objectivity

51

1.9 Objectivity Consider two frames of reference F , F ∗ , possibly with different origins O, O ∗ . An event {x, t} and its image {x∗ , t ∗ } under a change of frame are related by rigid transformations and a time shift so that x∗ = c(t) + Q(t)x,

t ∗ = t − a,

(1.54)

where a ∈ R, c(t) = O − O ∗ , and Q(t) is a rotation tensor, det Q(t) = 1. In connection with classical kinematics, F might be the relative observer and F ∗ the absolute observer. Under a change of frame, objective (or frame-indifferent) scalars remain unchanged. Objective vectors change according to the transformation v∗ = Qv,

(1.55)

the dependence of Q on time being understood. Let S be a second-order objective tensor, that is a linear transformation of objective vectors. Let w = Sv and let v∗ = Qv, w∗ = Qw the transforms under the change of frame whence v = QT v∗ . Then w∗ = QSv = QSQT v∗ , whence

S∗ = QSQT .

(1.56)

Objective scalars are invariant whereas objective vectors and objective tensors transform according to (1.55) and (1.56) under a change of frame. Consider a motion of a body B and assume that, relative to a frame of reference F , the motion is given by x = χ(X, t). In another frame of reference F ∗ , the same motion is described by x∗ = χ∗ (X, t ∗ ). If the change from F to F ∗ is described by (1.54), then we have x∗ = χ∗ (X, t ∗ ) = c(t) + Q(t)χ(X, t), Understanding the time dependence, we can write x∗ = c + Qx. Time differentiation, at fixed X, yields

t ∗ = t − a.

52

1 Kinematics

x˙∗ = Q˙x + c˙ + (x∗ − c), ˙ − 2 )(x∗ − c) + 2(x˙∗ − c˙ ), x¨∗ = Q¨x + c¨ + ( where

˙ T  = QQ

(1.57)

is a skew tensor. Only for constant c and Q are the velocity x˙ and the acceleration x¨ objective, i.e. x˙∗ = Q˙x, x¨∗ = Q¨x. Otherwise they are not and hence velocity and acceleration are regarded as frame-dependent vectors. In addition, these formulae provide the classical composition law for velocities and accelerations. By the skew character of , we can consider the axial vector a such that (x∗ − c) = a × (x∗ − c) ˙ − 2 )(x∗ − c) + 2(x˙∗ − c˙ ). Now, to identify the kinematical and the like for ( meaning of a we let x˙ = 0 and c = 0 so that x˙∗ = a × x∗ . Hence, since x is co-rotating (˙x = 0), then x∗ plays the role of any of the co-rotating vectors of (1.19) so that k˙ i = ω × ki . This allows us to identify a with the angular velocity of F with respect to F ∗ . We then let x˙∗ = Q˙x + c˙ + ω × (x∗ − c), x¨∗ = Q¨x + c¨ + ω(x ˙ ∗ − c) − ω × [ω × (x∗ − c)] + 2ω × (x˙∗ − c˙ ), and regard c˙ + ω × (x∗ − c) as the drag velocity, c¨ + ω ˙ × (x∗ − c) − ω × [ω × (x∗ − ˙ ∗ c)] as the drag acceleration, and 2ω × (x − c˙ ) as the Coriolis acceleration. To complete the kinematic properties under changes of frame, we now investigate the transformation of the angular velocity. Let Q be the orthogonal tensor associated with the change of reference F → F ∗ ; the vectors {ki } in F appear as {Qki } in F ∗ . Let QQT = 1, ki → Qki , and denote by ω ∗ the angular velocity of {Qki } relative to F ∗ . Hence, by definition (1.18)  ˙ = 1  Qk × Qk˙ + 1  Qk × Qk ˙ i. ω ∗ = 21 i Qki × Qk i i i i i i 2 2 Observe that, for any pair of vectors u, v, the i-component relative to {e j } of Qu × Qv is given by

1.9 Objectivity

53

[Qu × Qv]i = i pq Q ph u h Q qk vk . Since i pq Q ph Q qk =  j hk Q i j . then [Qu × Qv]i = Q i j  j hk u h vk = [Q(u × v)]i . Consequently

1 i Qki 2

× Qk˙ i = Q 21



i ki

× k˙ i = Qω.

Now observe that 1 i Qki 2

˙ i= × Qk

1 ˙ i QRi p e p × QRiq eq 2

=

1 ˙ p Qe p × Qe p 2

and (Qe p )h = Q hp . Hence, we have 1 i Qki 2

˙ i = el 1 lh j Q hp Q˙ j p = el 1 lh j  j h = a , × Qk 2 2

˙ T . Thus, under the change of frame F → F ∗ a being the axial vector of  = QQ associated with the orthogonal tensor Q the angular velocity obeys the transformation law (1.58) ω ∗ = Qω + a . As with the velocity, by (1.58) we can say that the absolute angular velocity ω ∗ is the sum of (Q applied to) the relative angular velocity, ω, and the drag angular velocity, a .

1.9.1 Transformation Rules for Kinematic Fields For formal simplicity, we let t ∗ = t and hence the motion of X in F ∗ is related to that in F by (1.59) χ∗ (X, t) = c(t) + Q(t)χ(X, t). The gradient with respect to X yields the transformation law F∗ (X, t) = Q(t)F(X, t) for the deformation gradient. Time differentiating these relations, we find

54

1 Kinematics

˙ χ˙∗ (X, t) = c˙ (t) + Q(t)χ(X, ˙ t) + Q(t)χ(X, t),

(1.60)

˙ ˙ F˙∗ (X, t) = Q(t)F(X, t) + Q(t)F(X, t). Hence, v = χ ˙ and F˙ are not objective, whereas F transforms as a (objective) vector, not as a tensor. Time differentiation of fields φ(x, t) in the Eulerian, or spatial, description is more involved because x is no longer invariant while X is. Consider the velocity field with v(x, t), v∗ (x∗ , t), in the two frames F , F ∗ , with x, x∗ corresponding to the same material point X, namely x = χ(X, t), x∗ = χ∗ (X, t). Hence, by (1.59), we have ∂x x∗ = Q and, by (1.60), we can write ˙ v∗ (x∗ , t) = Q(t)v(x, t) + c˙ (t) + Q(t)x.

(1.61)

We compute the velocity gradient in F ∗ , i.e. L∗ = ∂x∗ v∗ (x∗ , t). By the chain rule, we have ˙ ∂x v∗ (x∗ , t) = ∂x∗ v∗ (x∗ , t)∂x x∗ = Q∂x v(x, t) + Q, whence

L∗ Q = [(QLQT ) + ]Q.

It follows that the velocity gradient transforms according to L∗ (x∗ , t) = Q(t)L(x, t)QT (t) + (t).

(1.62)

Replace L with D + W, D = symL, W = skwL to obtain QLQT = QDQT + QWQT , and (QDQT )T = QDQT ,

(QWQT )T = −QWQT .

Hence, the symmetric and the skew parts of (1.62), along with the obvious definitions D∗ = symL∗ , W∗ = skwL∗ , result in D∗ (x∗ , t) = Q(t)D(x, t)QT (t), W∗ (x∗ , t) = Q(t)W(x, t)QT (t) + (t). Thus, D is objective, whereas L and W are not.

1.9 Objectivity

55

By the transformation law F∗ = QF, we derive easily the transformation laws for the stretch fields U and C and the strain field E = (C − 1)/2. Since C = U2 = FT F we have C∗ = (QF)T QF = FT F = C. Consequently, C an hence also U and E are invariant. Instead the left Cauchy–Green tensor B = FFT satisfies B∗ = (QF)(QF)T = QFFT QT = QBQT and hence B is a (objective) tensor. In view of the polar decomposition F = RU = VR and of the transformation law F∗ = QF, we have V∗ R∗ = QVR R∗ U = QRU, whence

R∗ = QR,

V∗ = QVQT .

To sum up, we say that U∗ = U, C∗ = C, E∗ = E, V∗ = QVQT , B∗ = QBQT , D∗ = QDQT , F∗ = QF, R∗ = QR, L∗ = QLQT + , W∗ = QWQT + ,

(1.63)

Hence, U, C, and E are invariant, V, B, and D are objective (tensors), F and R are objective as vectors whereas L and W are not objective.

Velocity Differences In view of (1.61), we have v∗ (x∗ , t) = Q(t)v(x, t) + c˙ (t) + (t)(x∗ − c). If x = χ(X, t) and y = χ(Y, t), we obtain v∗ (X, t) = Q(t)v(X, t) + c˙ (t) + (t)[x∗ (X, t) − c], v∗ (Y, t) = Q(t)v(Y, t) + c˙ (t) + (t)[x∗ (Y, t) − c(t)]. Subtracting the two equations, we find v∗ (X, t) − v∗ (Y, t) = Q(t)[v(X, t) − v(Y, t)] + (t)[χ∗ (X, t) − χ∗ (Y, t)].

56

1 Kinematics

Hence, velocity differences are not objective unless X = Y in which case this relation degenerates. Yet, in mixtures, different particles (of different constituents) may occupy the same position (see Chap. 11). In that case, we may have xα = χα (X, t),

xβ = χβ (Y, t), subject to xα (t) = xβ (t).

Since xα = xβ =⇒ xα∗ = xβ∗ then vα∗ (X, t) − vβ∗ (Y, t) = Q(t)[vα (X, t) − vβ (Y, t)]. Velocity differences of particles in the same position are objective.

1.9.2 Time Derivatives Relative to Generic Frames of Reference For any function φ(x, t) = φ(χ(X, t), t) =: (X, t) the derivatives

∂t φ and φ˙ = ∂t 

denote the Eulerian (or local or spatial) time derivative and the Lagrangian (or material) time derivative. Hence ∂t φ represents the derivative of φ at a given point in space whereas φ˙ represents the derivative of φ holding the material point fixed; φ˙ is the derivative relative to the observer moving with the velocity of the material point. We now examine further time derivatives of interest in continuum physics. As in Sect. 1.4, consider a conventionally fixed frame of reference associated with orthonormal base vectors (e1 , e2 , e3 ) and a rotating frame of reference associated with orthonormal base vectors (k1 , k2 , k3 ). Consistent with (1.54), we let ki = Qei . ˙ i = QQ ˙ T ki , whence Hence, it follows k˙ i = Qe k˙ i = ω × ki , ˙ T . Hence for where ω is defined by (1.18) and equals the axial vector a of  = QQ any vector function u(X, t) we have u˙ = u i˙ki = u˙ i ki + u i k˙ i = u˙ i ki + u i ω × ki = u˙ i ki + ω × u Consequently, u˙ i ki , namely the derivative of u with respect to the frame at rest with {k j }, is given by u˙ i ki = u˙ − ω × u = u˙ − u.

1.9 Objectivity

57

Likewise, we determine the derivative of a tensor field. Let Ai j := ki · Ak j be the i j-component of the tensor A relative to the basis {ki }. A direct differentiation, subject to k˙ i = ki , implies that ˙ j + ki · Ak j + ki · Ak j . A˙ i j = ki · Ak Now, ki · Ak j = ki T · Ak j = ki · T Ak j = (T A)i j and ki · Ak j = ki · (A)k j = (A)i j . Hence

˙ − A − AT A˙ i j ki ⊗ k j = A

is the rate of change of A with respect to the frame which is spinning along with the basis {ki }. A particular derivative relative to the set of axes spinning with a suitable tensor ˙ T arises by choosing Q = R, R being the rotation tensor given by the polar  = QQ decomposition. Letting ˙ T  = Z := RR we denote the derivative by a superposed u= u˙ − Zu,

and write

˙ − Z A − AZT . A= A

Owing to [207], this derivative is named after Green and Naghdi. In a conceptually analogous way, we now look for the time derivative relative to non-orthonormal bases. Let X be referred to coordinates {X i } and let gi = ∂ X i χ(X, t). Hence g˙ i = ∂t ∂ X i χ = ∂ X i ∂t χ = ∂ X i v = ∂xh v∂ X i χh = (∂ X i χh ∂xh )v = (∂ X i χ · ∇)v = L gi . To compute g˙ i , we observe that by gk · gi = δik it follows

58

1 Kinematics

g˙ k · gi = −gk · g˙ i . Multiplication by gi and summation over i results in g˙ k = −(gk · g˙ i )gi = −(gk · L)gi )gi = −(gi · LT gk )gi = −LT gk . Accordingly, covariant and contravariant vectors satisfy g˙ i = Lgi ,

g˙ i = −LT gi .

As an application, we now look for the time derivative of covariant and contravariant components of a tensor. By Ai j = gi · Ag j ,

g˙ i = Lgi

we have ˙ j + gi · A˙g j = gi · (LT A + A ˙ + AL)g j . A˙ i j = g˙ i · Ag j + gi · Ag We define the covariant or lower convected derivative of A as !

A:= gi A˙ i j g j and hence we have

!

˙ + LT A + AL. A= A !

The covariant derivative A is also referred to as Cotter–Rivlin rate [108]. Likewise we consider the contravariant components Ai j = gi · Ag j ,

g˙ i = −LT gi .

Hence, we have ˙ j + gi · A˙g j = gi · (−LA + A ˙ − ALT )g j . A˙ i j = g˙ i · Ag j + gi · Ag Upon defining the contravariant or upper convected derivative14 of A as "

A:= A˙ i j gi ⊗ g j we have

"

˙ − LA − ALT . A= A 14

Or Lie derivative.

1.9 Objectivity

59

It is of interest that the mean of the two convected derivatives is just the Jaumann– ◦

Zaremba derivative A ([428], p. 36), !

1 (A 2

Moreover, it follows that



"

˙ + WT A + AW =: A . + A) = A !

"

A − A= 2 DA − 2A D. Within the models of fluids, the Oldroyd-B model is taken to describe the flow of viscoelastic fluids and involves the upper convected derivative. The model traces back to Oldroyd [348] and that is why the convected derivatives are named after Oldroyd. Further, a variant of the Oldroyd derivative is that of the (upper convected) Oldroyd derivative of a vector density and of a tensor density. They are denoted and defined as   ˙ − LA − ALT + (∇ · v)A. u = u˙ − Lu + (∇ · v)u, A=A This derivative is referred to as the Truesdell rate of vectors and tensors [424]. In a more direct way, we can obtain time derivatives by considering the following problem. Let (e1 , e2 , e3 ) be a conventionally fixed system of orthonormal base vectors and denote by (k1 , k2 , k3 ) a system of base vectors. The two bases are taken to be related by15 ki (t) = K(t)ei , i = 1, 2, 3, the dependence of K on the point being understood and not written. Two cases are of interest, namely K = F, K = J −1 F. If K = F then the vectors {ki } evolve as the space vectors in a deformation and they are said to convect as a tangent. If K = J −1 F then det K = 1 and the vectors {ki } are a system of unitary base vectors. In both cases, the basis {ki } need not be orthonormal. Since F˙ = LF and J˙ = J ∇ · v, if ki = Fei then ˙ i = LFei = Lki k˙ i = Fe while, if K = J −1 F, ˙ i = (LJ −1 F − ∇ · v J −1 F)ei = (L − (∇ · v)1)ki . k˙ i = (J −1 F˙ − J˙ J −2 F)e Let M=L 15

or

M = L − (∇ · v)1

Here we apply again the active transformation in that the base-transformation tensor K maps the initial vector into the corresponding final vector.

60

1 Kinematics

as appropriate. Hence

k˙ i = M ki .

Let {ki } be the corresponding dual basis so that ki · k j = δi j . Hence ˙ 0 = ki · k j = ki · k˙ j + k˙ i · k j = ki · k˙ j + M k˙ i · k j = ki · (k˙ j + MT k j ). Since {ki } is a basis, then it follows k˙ j = −MT k j . Consider the covariant components {u i } of a vector u, so that u = u i ki . Represent the vector u in the dual basis, u = u i ki . Hence u˙ = u˙ i ki + u i k˙ i = u˙ i ki − u i MT ki = u˙ i ki − MT u. Now u˙ i ki is the derivative of u relative to the dual basis {ki } and we have u˙ i ki = u˙ + MT u. Likewise, we compute the derivative of a tensor field, A = Ai j ki ⊗ k j ,

Ai j = ki · Ak j .

Time differentiation of Ai j gives ˙ j + ki · Ak˙ j = Mki · Ak j + ki · MAk j + ki · AMk j A˙ i j = k˙ i · Ak j + ki · Ak ˙ + MT A + AM)k j = ki · (A whence

˙ + MT A + AM. A˙ i j ki ⊗ k j = A

If instead we consider the contravariant components, u i = u · ki ,

Ai j = ki · Ak j

then we have u˙ i = u˙ · ki + u · k˙ i = u˙ · ki − u · MT ki = (u˙ − Mu) · ki ,

1.9 Objectivity

61

˙ j + ki · Ak˙ j = −MT ki · Ak j + ki · Ak ˙ j − ki · AMT k j A˙ i j = k˙ i · Ak j + ki · Ak ˙ − MA − AMT )k j , = ki · (A whence

˙ − MA − AMT . A˙ i j ki ⊗ k j = A

˙ − Mw, w˙ i ki = w

Letting M be replaced by L or L − (∇ · v)1 we obtain the rate of vectors and tensors relative to the bases {ki } and {ki }, ˙ + LT A + AL, A˙ i j ki ⊗ k j = A

u˙ i ki = u˙ + LT u,

˙ − LA − ALT A˙ i j ki ⊗ k j = A

u˙ i ki = u˙ − Lu, or u˙ i ki = u˙ + LT u − (∇ · v)u,

˙ + LT A + AL − 2(∇ · v)A, A˙ i j ki ⊗ k j = A

u˙ i ki = u˙ − Lu + (∇ · v)u,

˙ − LA − ALT + 2(∇ · v)A. A˙ i j ki ⊗ k j = A

To sum up, we now give a list of rates of vectors (u) and tensors (A). Rate of vectors

Rate of tensors ˙ − ZA − AZT A= A

Green–Naghdi rate u = u˙ − Zu

!

!

˙ + LT A + AL A= A

u = u˙ − Lu

"

˙ − LA − ALT A= A





Cotter–Rivlin rate u = u˙ + LT u Oldroyd rate

"

˙ − LA − ALT + (∇ · v)A u = u˙ − Lu + (∇ · v)u A = A

Truesdell rate

$

F-rate

$ ˙ + LT A + AL − 2(∇ · v)A u = u˙ + LT u − (∇ · v)u A = A

J −1 F-rate

u = u˙ − Lu + (∇ · v)u

%

%

˙ − LA − ALT + 2(∇ · v)A A= A

In the Green–Naghdi rate, Z is the spin tensor associated with the rotation R of !

!

the polar decomposition F = RU. The Cotter–Rivlin rate u, A is also referred to as covariant rate and is the rate induced by the evolution of the covariant components. "

"

Likewise, the Oldroyd rate u, A is also referred to as contravariant rate. The Truesdell 

rate A is motivated by a property of the second Piola stress which is shown in the next chapter. 



The convected rate u, A arises in connection with the surface integral of vectors and tensors, 

u= u˙ − uLT + (∇ · v)u,



˙ − ALT + (∇ · v)A. A= A

62

1 Kinematics 



The convected time derivative u coincides with the Truesdell rate u, in the case of vectors. In rigid motions, D = 0 and then L = W. Moreover, U = 1 and then ˙ T = LFFT = L = W. ˙ T = FF Z = RR Consequently, all of the derivatives in the table coincide with the Jaumann–Zaremba derivative,16 Rate of vectors

Rate of tensors



˙ − WA − AWT A= A

Jaumann–Zaremba rate u = u˙ − Wu



This means that differences between the derivatives arise in non-rigid motions of the continuum. Both the Jaumann–Zaremba rate and the Green–Naghdi rate are the rate, of vectors and tensors, relative to an observer rotating with the angular velocity associated with the spin tensors W and Z, respectively. To avoid ambiguities, though, here17 the corotational derivative is identified with the Jaumann–Zaremba rate. In the next section, we define and investigate the form of objective time derivatives in connection with the requirements of (rate-type) constitutive equations. We find that the derivatives so established are particular cases of a wide class of objective time derivatives.

1.10 Objective Time Derivatives Let u be any vector field. The values u∗ in F ∗ and u in F , at the same point X ∈ R and time t ∈ R, are related by u∗ = Qu. We are interested in the material time derivative in which we follow the evolution of a chosen material point. The natural view is to fix the point via the position vector X ∈ R, and hence we look at the vectors u within the Lagrangian description ˜ ˜ t) and u = u(X, t). Hence u∗ (X, t) = Qu(X, ˙ = Qu ˙ + Qu˙ u˙∗ = Qu

16

Throughout the Jaumann–Zaremba derivative (or rate) is also denoted as Jaumann derivative or corotational derivative. 17 As, e.g., in [216].

1.10 Objective Time Derivatives

whence

63

˙ + Qu. ˙ u˙∗ = Qu

Any other time derivative might be considered, provided it is understood that the derivative of u∗ and that of u are considered at the same point of the body. Throughout this section ˙ T,  := QQ Q being related to the change of frame. For later convenience we observe that ˙ = QQ ˙ T Q = Q Q and hence

so that

˙ = Qu = u∗ Qu ˙ u˙∗ = u∗ + Qu.

1.10.1 General Form of Objective Derivatives A derivation [26, 224] ∂ of a vector algebra V , over R, is a rule ∂ : V → V such that, for every u, w ∈ V and α, β ∈ R, ∂(αu + βw) = α∂u + β∂w, ∂(u ⊗ w) = (∂u) ⊗ w + u ⊗ (∂w). These conditions are referred to as the linearity and the Leibniz rule. Owing to linearity, a derivation ∂ such that ∂u has the meaning of a time derivative is considered in the form ∂u = u˙ − u (1.64) where  : V → V is a function on R × R. Accordingly ˙ − w). ∂(u ⊗ w) = (u˙ − u) ⊗ w + u ⊗ (w The function  may depend on properties of the continuum and hence may depend on the chosen reference. Accordingly a possible frame dependence is allowed so that ∂(u∗ ) = u∗ − ∗ u∗ ,

∂u = u − u.

A derivative ∂u of a vector u is objective if, under the Euclidean transformation (1.54), it is

64

1 Kinematics

Q∂u = (∂u)∗ = ∂(u∗ ) = ∂(Qu), u ∈ V . The following statement gives a characteristic condition for the set of operators  to make (1.64) an objective derivative. Proposition 1.1 The derivative (1.64) of a vector u is objective if and only if ∗ = QQT + .

(1.65)

Proof If ∂u = u˙ − u is objective then necessarily ˙ − ∗ Qu = ∂u∗ = (∂u)∗ = Q(u˙ − u). Qu Hence, we have

˙ − ∗ Qu = −Qu. Qu

The arbitrariness of u implies that ˙ ∗ Q = Q + Q. Right multiplication by QT leads to the conclusion (1.65). Conversely, if ∗ = QQT +  then ˙ − Qu − u = Q(u˙ − u), u˙∗ − ∗ u∗ = Qu that is ∂u∗ = (∂u)∗ .



To determine the transformation law of the derivative of a tensor we now consider the derivative of a dyadic product u ⊗ w. Proposition 1.2 The derivative ∂(u ⊗ w) is objective if and only if (1.65) holds. Proof The objectivity requirement [∂(u ⊗ w)]∗ = ∂(u∗ ⊗ w∗ ) results in ˙ − ∗ Qu) ⊗ Qw + Qu ⊗ (Qw ˙ − ∗ Qw). ˙ − w) = (Qu Q(u˙ − u) ⊗ Qw + Qu ⊗ Q(w

Upon some rearrangements, by the arbitrariness of u and w we find ˙ − ∗ Qu + Qu = 0, Qu whence the transformation law (1.65) follows. Conversely, if (1.65) holds, then the objectivity requirement is satisfied. 

1.10 Objective Time Derivatives

65

The derivative of a dyadic product yields ˙ − w) = u ⊗˙ w − (u ⊗ w) − (u ⊗ w)T . ∂(u ⊗ w) = (u˙ − u) ⊗ w + u ⊗ (w

Hence, we define the derivative of a tensor K in the form ˙ − K − KT . ∂K := K With this derivation ∂ we may proceed and determine the corresponding properties. Proposition 1.3 If  satisfies (1.65), then the derivative ∂K is objective. Proof By direct substitutions, we find ∂(K∗ ) = K˙∗ − ∗ K∗ − K∗ ∗T ˙ = QKQT − (QQT + )QKQT − QKQT (QQT + )T ˙ − K − KT )QT = (∂K)∗ = Q(K and the conclusion follows.



Based on the transformation property (1.65), we observe that the operator  is non-unique as is shown by the following ˜ satisfies (1.65) and B is any tensor, namely Proposition 1.4 If  ˜ T + , ˜ ∗ = QQ 

B∗ = QBQT ,

˜ + B satisfies (1.65). then  =  Proof A direct substitution leads to ∗

˜ + B∗ = QQ ˜ T +  + QBQT = Q( ˜ + B)QT +  = QQT + , ∗ =  which proves the assertion.



As a consequence, given an operator satisfying (1.65), we obtain infinitely many operators satisfying (1.65) by merely adding any tensor B. The derivative of vectors and tensors need not be the same in that ∂u and ∂K may involve different operators . What is more, the derivative of a tensor ∂K may involve two different operators  at the same time. To see this, let 1 and 2 satisfy (1.65) and consider the derivative ˙ − 1 K − K2T . ∂K = K

(1.66)

66

1 Kinematics

Now ˙ (∂K∗ ) = QKQT − (Q1 QT + )QKQT − QKQT (Q2 QT + )T ˙ − 1 K − K2T )QT = (∂K)∗ = Q(K and hence the derivative (1.66) is objective. It is worth remarking that if 1 and 2 satisfy (1.65), then  = 1 + 2 does not satisfy (1.65). This is so because (1 + 2 )∗ = ∗1 + ∗2 = Q1 QT +  + Q1 QT +  = Q(1 + 2 )QT + 2. Objective time derivatives are now established by showing that continuum physics provides infinitely many operators  satisfying (1.65). By L∗ = QLQT + , whence

W∗ = QWQT + ,

D∗ = QDQT ,

it follows that the operators  = W + λD + ν(∇ · v)1

(1.67)

constitute a two-parameters family of operators satisfying (1.65). On the other hand, letting ˙ T ∈ Skw, Z = RR R = FU−1 , we have

˙ T ˙ T + QRR ˙ T QT = QZQT + . = QQ Z∗ = QR(QR)

Hence Z satisfies (1.65). As a consequence, also  = Z + λD + ν(∇ · v)1

(1.68)

is a two-parameters family of operators satisfying (1.65). In the Jaumann–Zaremba derivative  = W; in the Cotter–Rivlin derivative  = W − D, (λ = −1, ν = 0); in the Oldroyd derivative  = W + D, (λ = 1, ν = 0); in the Truesdell derivative  = W + D − (∇ · v)1, (λ = 1, ν = −1) for vectors and  = W + D − 21 (∇ · v)1, (λ = 1, ν = − 21 ) for tensors; in the F-rate  = W − D + (∇ · v)1, (λ = −1, ν = 1); in the J −1 F-rate  = W + D − (∇ · v)1, (λ = 1, ν = −1). Instead, in the Green–Naghdi derivative,  = Z. There are models of material behaviour where the derivative is a superposition of objective derivatives in the form

1.10 Objective Time Derivatives ♦

67

A=



k νk



∂k A,

k νk

= 1.

Consider the objective derivatives ˙ − WA + AW + ξk (DA + AD), ∂k A := A

(1.69)

{ξk } being real numbers; this is the case for the derivatives examined in the previous section. Proposition 1.5 A superposition of the objective derivatives (1.69) is objective. Proof The superposition results in ♦

A= (





˙ −(

k νk )A

whence



k νk )WA

+(



k νk )AW

+(



˙ − WA + AW + ( A= A



k νk ξk )(DA

k νk ξk )(DA

+ AD),

+ AD).

˙ − WA + AW and DA + AD are objectives, then the conclusion Since A  follows.

Objective Derivatives and Trace Conservation It is of interest to examine the operator (1.67) (and (1.68)) in connection with the trace of tensors, say A, under the derivative ˙ − A − AT . ∂A = A In view of (1.67), we have the generic (objective) derivative 

˙ − WA − AWT − λ(DA + AD) − ν(∇ · v)A. A := A For any tensor A tr (AW − WA) = Ai j W ji − Wi j A ji = 0. Instead, tr (DA + AD) = Di j A ji + Ai j D ji = 2tr DA, and tr DA = 0 if A ∈ Skw. Hence it follows that  tr A= tr˙A − 2λtr (DA) − ν(∇ · v)tr A.

68

1 Kinematics

Consequently tr A = 0

=⇒



tr A= 0,

if λ = 0 or A ∈ Skw. The trace-free condition is preserved by the derivative 

˙ − WA − AWT − ν(∇ · v)A, A= A for any scalar function ν. Hence, this happens for the corotational derivative modified by the addition of the term ν(∇ · v)A for any function ν and any velocity field v.

Derivatives of the Deformation Gradient Particular results follow about the derivatives of the deformation gradient, as a consequence of the relation F˙ = LF and of the transformation law F∗ = QF. If we apply the Oldroyd rate, it follows "

F= F˙ − LF = 0. The deformation gradient is constant relative to the Oldroyd rate. Consequently, the left Cauchy–Green tensor B too is constant relative to the Oldroyd rate in that "

˙ − LB − BLT = FF ˙ T + FF˙ T − LB − BLT = LFFT + FFT LT − LFFT − FFT LT = 0. B= B

By applying the Jaumann–Zaremba rate, it follows ◦

F= F˙ − WF = F˙ − 21 (L − LT )F = D F. ◦

In rigid motions F = R, D = 0, and hence F= 0. Thus, in rigid motions, F is constant relative to the corotational derivative. Moreover, the Finger tensor B−1 is constant relative to Cotter–Rivlin rate in that (B−1 )˙ = −LT B−1 − B−1 L !

and hence B−1 = 0. By applying the Truesdell rate, it follows 

F= F˙ − LF + (∇ · v)F = (∇ · v)F. 

The Truesdell derivative of F, F, is zero for isochoric deformations.

1.10 Objective Time Derivatives

69

1.10.2 Rivlin–Eriksen Tensors Rivlin–Ericksen tensors [380] are sometimes applied in the modelling of materials with memory properties via suitable dependences on higher-order derivatives. We start with the observation that the right Cauchy–Green tensor C = FT F is invariant under a Euclidean transformation (1.54). Hence, a time derivative of any order of C is invariant too (and hence objective). The first-order derivative results in ˙ = FT 2 D F. C Let A1 = 2 D and hence

˙ = FT A1 F. C

A further differentiation yields ˙ 1 + LT A1 + A1 L)F, ¨ = FT (A C whence

˙ 1 + LT A1 + A1 L. A2 := A

¨ = FT A2 F, C

Denote the n-th order time derivative as (n)

C = FT An F.

A further differentiation yields (n+1)

˙ n + LT An + An L. An+1 := A

C = FT An+1 F,

We have thus proved by induction that (n)

C = FT An F,

˙ n−1 + LT An−1 + An−1 L, An := A

and A1 = 2 D. The sequence of tensors {An } so determined constitutes the set of Rivlin–Ericksen tensors. The tensors {An } are symmetric. This statement is proved by observing that A1 = 2 D ∈ Sym,

An ∈ Sym =⇒ An+1 ∈ Sym.

This property in turn follows by direct check, T ˙ n + (LT An )T + (An L)T = A ˙ n + LT An + An L = An+1 . An+1 =A

It is apparent that Rivlin–Ericksen tensors might be defined by

70

1 Kinematics

A1 = 2 D, where the superposed

!

!

An+1 =An ,

n = 1, 2, ...,

denotes the Cotter–Rivlin derivative. Also in this way we

!

conclude that {An } is a sequence of objective tensors. With this view we might define other sequences of Rivlin–Ericksen-like tensors by merely assuming another time derivative, A1 = 2 D,

An+1 = Dt An ,

Dt denoting an objective time derivative. This would be the case with the sequences "

{An },



{An },

with A1 = 2 D for both sequences, where the Oldroyd and the Jaumann–Zaremba rates are involved.

1.10.3 Spins and Angular Velocities Under Euclidean Transformations We now examine the properties of spins, angular velocities, and axial vectors under Euclidean transformations such that x → x∗ in the the form x∗ = c(t) + Q(t)x, where Q is a rotation. As it happens with micropolar media, Sect. 11.1, at each point of the body an angular velocity ω is defined as that of the particle at the point. The balance of energy then involves a power proportional to the difference ω − w, where w is the axial vector of the spin W. It is then natural here to investigate the possible objectivity of ω, w, and ω − w. By (1.63), the spin W obeys the transformation law W∗ = QWQT + . We then compute the axial vector w of W to have T w∗ = el 21 lh j Q j p W pk Q kh + a .

Since lh j Q j p Q hk = ikp Q li

1.10 Objective Time Derivatives

71

then T = el Q li wi = Qw. el 21 lh j Q j p W pk Q kh

Hence, we obtain

w∗ = Qw + a .

(1.70)

Now, the angular velocity obeys the transformation law (1.58), ω ∗ = Qω + a . Thus, both the angular velocity ω and the axial vector w are not objective. Yet the difference ω − w satisfies (1.71) ω ∗ − w∗ = Q(ω − w) and then it is objective. A further question is in order about the objectivity of the velocity gradients. First look at the material gradient ∇R w. Since ∇R  = 0 then (1.70) implies that (∂ X K wi )∗ = Q ir ∂ X K wr . Thus, ∇R w is an objective vector. As to the spatial gradient we observe ∗ (∂xk wi )∗ = ∂xk∗ wi∗ = ∂x j wi∗ ∂xk∗ x j = Q −1 jk ∂x j wi .

Hence, we have (∂xk wi )∗ = Q k j ∂x j (Q ir wr + i jk k j ) = Q k j Q ir ∂x j wr . We then conclude that ∇w is an objective tensor.

Chapter 2

Balance Equations

Borrowing from the mechanics of systems of particles, the balance equations for mass, linear momentum, and angular momentum are established. Next, the balance of energy is obtained after having realized that energy in general requires a more general framework including quantities of non-mechanical character. This is done first by considering fields in the spatial (Eulerian) description for regions that convect with the body. As is customary, the starting point is the pertinent axiom about an arbitrary region (global law), and hence the corresponding local balance equation is derived. The key step of the mathematical development is Reynolds’ transport theorem, mainly for regions convecting with the body. The balance of entropy is made formal by assuming that the entropy rate, deprived of the divergence of a pertinent flux and of an entropy supply, is non-negative, this quantity being, by definition, the entropy production. Next, after a brief introduction to electromagnetic fields, balance equations are established for electromagnetism in deformable bodies. The approach stems from the physical insights about particle behaviour. Hence, balances are obtained for linear and angular momentum, energy, and entropy. Moreover, jump relations are derived for balance properties across singular surfaces.

2.1 Balance of Mass The (assumed) conservation of mass, irrespective of the motion and the size of a body, is expressed by the following Axiom. The mass of any convecting subregion of a body is constant in time. To represent mathematically the content of the axiom, we let ρ(x, t) be the (positivevalued) mass density in the spatial description so that the mass M(Pt ), within the region Pt = χt (P), is given by M(Pt ) = ∫ ρ dv. Pt

© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_2

73

74

2 Balance Equations

The axiom then reads: if Pt = χt (P), then ∫ ρ dv is constant in time.

Pt

Denote by ρ R (X) the mass density in the reference configuration so that M(P) = ∫ ρ R dv R . P

Hence, if Pt = χt (P), the constancy of mass means ∫ ρ dv = ∫ ρ R dv R .

Pt

(2.1)

P

Otherwise, the constancy of mass is expressed by d ∫ ρ dv = 0. dt Pt

(2.2)

By a change of variables, Pt  x → X ∈ P, we have ∫ ρ dv = ∫ ρ J dv R .

Pt

P

Hence, (2.1) results in ∫ (ρ R − ρ J )dv R = 0. P

The arbitrariness of P and the continuity of the integrand imply that ρ R − ρ J = 0,

∀X ∈ P, t ∈ R.

Consequently, the product ρ J is constant in time and ρ=

ρR . J

(2.3)

The mass density ρ in the current configuration Pt is 1/J times the mass density ρ R in the reference configuration. Since ρ R depends only on X then (2.3) shows that ρ depends on time as J −1 . If instead we start with (2.2), we assume ρJ is a C 1 function on P and, by means of Reynolds’ transport theorem, we have 0=

d ∫ ρ dv = ∫ (ρ˙ + ρ∇ · v)dv. dt Pt Pt

2.1 Balance of Mass

75

The arbitrariness of P implies that ∂t ρ + ∇ · (ρv) = 0.

ρ˙ + ρ∇ · v = 0,

(2.4)

The differential equations (2.4) are referred to as the continuity equations for the mass. By (2.4), the specific volume (volume per unit mass) υ :=

1 ρ

satisfies υ˙ = υ ∇ · v. If ∇ · v = 0 both ρ and υ are constant along the path of any point (isochoric motions). In incompressible bodies, only isochoric motions are possible and the velocity field is subject to the constraint ∇ · v = 0. The result (2.4) about the mass density allows us to restate Reynolds’ transport theorem in a more convenient form. Let φ = ρϕ so that φ is a density, per unit volume, whereas ϕ is the corresponding density, per unit mass. Upon direct substitution and use of (2.4), it follows φ˙ + φ ∇ · v = (ρ˙ + ρ ∇ · v)ϕ + ρϕ˙ = ρϕ. ˙ Hence, by (1.23), we obtain Reynolds’ transport theorem in the form d ∫ ρ ϕ dv = ∫ ρ ϕ˙ dv. dt Pt Pt

(2.5)

Equation (2.5) holds for any C 1 density ϕ irrespective of the scalar, vector, or tensor character of ϕ. As a comment, we observe that, owing to (2.4), there is a direct connection between the Oldroyd rate and the Truesdell rate, for vectors and tensors. Since ˙ = 1 [u˙ + u∇ · v], u/ρ ρ then we have 



u = ρ u/ρ, Instead 



u = ρ u/ρ,

˙ = 1 [A ˙ + A∇ · v] A/ρ ρ 







A = ρ A/ρ .

A = ρ A/ρ .

76

2 Balance Equations

In isochoric motions, ∇ · v = 0, the F-rate coincides with the Cotter–Rivlin rate, while the J −1 F-rate coincides with the Truesdell rate and the Oldroyd rate. Referential densities For any density field ϕ, per unit mass, ρϕ is the corresponding field per unit volume and ϕ R := ρ R ϕ = J ρϕ is the density field per unit volume in the reference configuration. We then call ϕ R the referential density field.

2.2 Balance of Linear Momentum Throughout the dynamic equations of mechanics, i.e. evolution equations of linear momentum and angular momentum are assumed to hold in inertial frames. For a system of particles, dynamics is based on the assumption that the linear momentum P changes in time according to ˙ =F P where F is the total force acting on the system. For a continuous body, we let P (Pt ) := ∫ ρ v dv. Pt

The force F (Pt ) acting on Pt consists of two terms: body forces exerted on the interior points of Pt , contact forces through the boundary ∂Pt of Pt . Body forces are modelled by assuming that there is a force density b (per unit mass) so that the whole body force on Pt is ∫ ρ b dv,

Pt

where the dependence of ρ and b on (x, t) is understood To model the contact forces, we consider a surface S in the current configuration, passing through x, at time t, with unit normal n (Fig. 2.1). Denote by positive side of S that on the portion of space into which n points. By Cauchy’s hypothesis, there is a surface-traction field t such that t(x, n, t) is the force exerted on the positive side of S, at x, per unit area, at time t. Hence we let ∫ t(x, n, t)da S

be the force on S at time t. The whole force on Pt is then given by

2.2 Balance of Linear Momentum

77

Fig. 2.1 The vector t is the force per unit area of S at the point x

F (Pt ) = ∫ ρb dv + ∫ t da. Pt

∂Pt

Incidentally, two tangent surfaces at x at time t, are subject to the same traction t(x, n, t). With this assumption on P and F in a continuous body, we state Axiom. The motion of a continuous body is governed by the equation ˙ (Pt ) = F (Pt ), P

(2.6)

for any convecting region Pt . By the transport theorem (2.5), we can write (2.6) in the form ∫ ρ(˙v − b)dv = ∫ t da. (2.7) Pt

∂Pt

As with any equality between volume and surface integrals, this condition cannot hold unless the integrand t is appropriate. To show that a restriction on t is required, we argue as follows. Let ρ(˙v − b) be bounded, say |ρ(˙v − b)| < κ. We then find that    ∫ ρ(˙v − b)dv  ≤ ∫ |ρ(˙v − b)|dv < κ vol Pt . Pt

Hence

Pt

   ∫ t da  < κ vol Pt ∂Pt

and this implies the requirement 1 volPt ∫ t da → 0 as → 0. area ∂Pt ∂Pt area ∂Pt

(2.8)

For formal simplicity, we omit writing the dependence on time t. The explicit requirement on t is determined by means of the following

78

2 Balance Equations

Fig. 2.2 The tetrahedron T employed in the proof of Cauchy’s theorem

Theorem 2.1 (Cauchy) If t(x, n) is a continuous function and ρ(˙v − b) is bounded then the condition (2.8) implies the existence of a tensor field T such that t(x, n) = T(x) n.

(2.9)

Proof Given any x ∈ Rt \ ∂Rt and an orthonormal basis (i1 , i2 , i3 ), we choose a tetrahedron T such that ∂T = S1 ∪ S2 ∪ S3 ∪ Sδ where S1 , S2 , S3 are the faces in the coordinate planes through x while Sδ is the oblique face, opposite to the vertex x. The distance δ of Sδ from x is so small that T ⊂ Rt \ ∂Rt . The unit outward normals to S1 , S2 , S3 , Sδ are −i1 , −i2 , −i3 , k. It is k · i j > 0, j = 1, 2, 3 (Fig. 2.2). Let A1 , A2 , A3 , Aδ denote the areas of S1 , S2 , S3 , Sδ . Since A j /Aδ < 1, j = 1, 2, 3, then   Aj  area ∂T = Aδ 1 + j = λAδ , 1 < λ < 4, Aδ and hence the requirement (2.8) yields 1 ∫ t da → 0 as δ → 0. Aδ ∂T Let y ∈ ∂T . Upon substitution of t(y, n) = t(x, n) + [t(y, n) − t(x, n)],

n = −i1 , −i2 , −i3 , k,

it follows that ∫ t(y, n)da =

∂T

where

tˆ =

 j



j t(x, −i j )A j

+ t(x, k)Aδ + tˆ,

∫ [t(y, −i j ) − t(x, −i j )]da + ∫ [t(y, k) − t(x, k)]da.

Sj



(2.10)

2.2 Balance of Linear Momentum

79

We now estimate tˆ. Let d be the maximum distance of a point of ∂T from x. Then, by the continuity of t(·, n), for any > 0 there is d˜ such that d < d˜ implies |t(y, −i j ) − t(x, −i j )| < and |t(y, k) − t(x, k)| < . Hence d < d˜ implies that |tˆ| < (

 j

A j + Aδ ) = λ Aδ .

Since δ → 0 implies d → 0 then a suitably small δ makes as small as we please. Consequently, 1 tˆ = O(δ), Aδ whence

 Aj 1 ∫ t da = j t(x, −i j ) + t(x, k) + O(δ). Aδ ∂T Aδ

The requirement (2.10) implies that  Aj t(x, −i j ) + t(x, k) = 0. j Aδ

(2.11)

By the continuity of t(x, n), we can evaluate the limit of (2.11) as k → i1 . It follows t(x, −i1 ) + t(x, i1 ) = 0. The arbitrariness of i1 implies that t(x, −n) = −t(x, n) for any unit normal n. Now observe that

(2.12)

Aj = k · ij. Aδ

To prove this relation, we consider, e.g. Sδ and S3 (Fig. 2.3). The ratio A3 /Aδ equals the ratio of the heights, h 3 / h δ relative to the common segment of the two triangles.

Fig. 2.3 A section of the tetrahedron T to prove that A3 /A = k · i3

80

2 Balance Equations

If θ is the angle between the two heights, then h 3 / h δ = cos θ = i3 · k. The analogue holds for A j /Aδ , j = 1, 2. Hence, by use of (2.12), Eq. (2.11) results in 

t(x, k) =

j t(x, i j ) k

· ij.

(2.13)

Equation (2.13) is derived upon the restriction k · i j > 0, j = 1, 2, 3. We show that (2.13) holds without any restriction on k. Let {f j } be a new basis such that i j = (sgn k · f j )f j . By definition, k · i j = (sgn k · f j )k · f j > 0, as required, while no restriction is placed on k · f j . By (2.13) it follows   t(x, k) = j t(x, i j ) k · i j = j t(x, sgn (k · f j )f j ) sgn (k · f j )k · f j   = j sgn (k · f j )t(x, f j ) sgn (k · f j )k · f j = j t(x, f j ) k · f j . Hence Eq. (2.13) holds for any basis. In particular it holds relative to the basis {ei } chosen for the representation of vectors and tensors so that 

i t(x, ei ) n

t(x, n) =

· ei ,

for any unit vector n. This means that t(x, n) = [



i t(x, ei )

⊗ ei ]n

i t(x, ei )

⊗ ei .

and the conclusion follows with T(x) =





The intermediate result (2.12) has a remarkable mechanical meaning. Since t(n) and t(−n) are the tractions on opposite sides (at a point) of a surface, then (2.12) means that t(n) and t(−n) are opposite to each other. This property may be regarded as Newton’s law of action and reaction. This result is consistent with the fact that letting k approach any of the vectors {i j } makes the tetrahedron to approach a membrane with vanishing thickness and hence the volume integral approaches zero. This in turn means that (2.12) expresses the equilibrium of surface forces. Conceptually, we started with the observation that equalities between surface and volume integrals cannot hold for generic integrands. By Cauchy’s theorem, the traction t(n) is required to depend on n in the form (2.9). A natural question now arises as to whether Eq. (2.9) makes in fact the whole contact force be equal to a volume integral (sufficient condition) for any boundary surface ∂Pt . Let a be any constant vector and consider

2.2 Balance of Linear Momentum

81

a · ∫ T n da = ∫ (TT a) · n da. ∂Pt

∂Pt

By the divergence theorem ∫ (TT a) · n da = ∫ ∇ · (TT a) dv = ∫ ∂x j Th j ah dv = a · ∫ ∇ · T dv

∂Pt

Pt

Pt

Pt

provided we define ∇ · T := ∂x j Th j eh . Hence, the balance of linear momentum (2.7) can be written in the form ∫ ρ(˙v − b)dv = ∫ ∇ · T dv

Pt

Pt

or ∫ [ρ(˙v − b) − ∇ · T]dv = 0;

Pt

thanks to the existence of T now the balance of linear momentum is expressed by two volume integrals. The arbitrariness of the convecting region Pt and the assumed continuity of the integrand imply the local balance equation ρ v˙ = ∇ · T + ρ b,

(2.14)

that is the local equation of motion. In suffix notation Eq. (2.14) reads ρ v˙i = ∂x j Ti j + ρ bi . The tensor T is called the stress tensor. Since Tn is the force per unit area, on a positive side of a surface, then T n can be decomposed into the sum of two terms. One is the normal traction (n · Tn)n = (n ⊗ n)Tn. The other one is the shear traction (1 − n ⊗ n)Tn. Consequently, Te j is the traction on the surface orthogonal to e j and ei · Te j is the i-th component. Hence Ti j := ei · Te j is the i-th component of the traction on the positive side of the surface orthogonal to ej. By Eq. (1.33), we can write the equation of motion in the form

82

2 Balance Equations

ρ(∂t v + ∇ 21 v2 +  × v) = ∇ · T + ρ b, where  = ∇ × v is the vorticity.

2.3 Balance of Angular Momentum Let O¯ be any point in space and let r be the position vector of a point of the body ¯ We say that O¯ is a base point (or reference point). Let o be the with respect to O. position vector of O¯ so that x = o + r. We let O¯ be fixed in space and hence o˙ = 0. For a system of particles, the angular momentum (or moment of momentum) L changes in time by the action of the moment of forces, or torque, T according to L˙ O¯ = T O¯ . Based on this equation, we state the following Axiom. The motion of a continuous body is governed by the equation L˙ O¯ (Pt ) = T O¯ (Pt ),

(2.15)

for any convecting region Pt . For a continuous body, we let L O¯ (Pt ) := ∫ r × ρ v dv. Pt

The moment (or torque) T O¯ (Pt ) acting on Pt consists of the terms due to the body force b and the traction T n, T O¯ (Pt ) = ∫ r × T n da + ∫ r × ρ b dv. ∂Pt

Pt

Equation (2.15) can then be written in the form d ∫ r × ρ v dv = ∫ r × T n da + ∫ r × ρ b dv. dt Pt Pt ∂Pt First observe that ˙ v = r˙ × v + r × v˙ = v × v + r × v˙ = r × v˙ . r× Thus, by Reynolds’ transport theorem d ∫ r × ρ v dv = ∫ r × ρ v˙ dv. dt Pt Pt

(2.16)

2.3 Balance of Angular Momentum

83

The surface integral can be rearranged by observing that r × T n = i jk r j Tkh n h ei and hence ∫ r × T n da = i jk ei ∫ r j Tkh n h da.

∂Pt

∂Pt

By the divergence theorem, it follows ∫ r j Tkh n h da = ∫ ∂xh (r j Tkh )dv = ∫ (δh j Tkh + r j ∂xh Tkh )dv. Pt

∂Pt

Pt

Consequently, we have

i jk ei ∫ r j Tkh n h da = ∫ (ϒ + r × ∇ · T)dv, Pt

∂Pt

ϒ := i jk Tk j ei ,

and Eq. (2.16) yields ∫ [r × (ρ˙v − ∇ · T − ρb) − ϒ]dv = 0.

Pt

In view of the equation of motion (2.14), it follows ∫ ϒ dv = 0.

Pt

The arbitrariness of Pt and the assumed continuity of ϒ imply1 ϒ = 0.

(2.17)

Conversely, if (2.14) and (2.17) hold, then (2.7) and (2.16) hold too. Equation (2.17) means that

i jk Tk j = 0,

i = 1, 2, 3.

Hence T12 = T21 , T13 = T31 , T23 = T32 or T = TT . To sum up, we can write Proposition 2.1 For smooth fields ρ, v, b, T, the global balance Eqs. (2.7) and (2.16) imply that the local balance Eqs. (2.14) and (2.17) hold. 1

Observe ϒ is twice the axial vector of skwT.

84

2 Balance Equations

For completeness, we now determine the balance of angular momentum relative to a moving base point. Let B be the moving base point and let r B be the position ¯ so that vector of a point relative to B. Let d be the position vector of B, d = B − O, ¯ supposed to be fixed. First observe r = d + r B is the position vector relative to O, T O¯ = ∫ r × ρb dv + ∫ r × T n da = ∫ r B × ρb dv + ∫ r B × T n da + d × F (Pt ), Pt

∂ Pt

Pt

∂ Pt

namely T O¯ = T B + d × F (Pt ). Now, by L O¯ (Pt ) = ∫ r × ρv dv = ∫ r B × ρv dv + d × ∫ ρv dv Pt

Pt

Pt

we obtain d d ∫ r B × ρv dv = L O¯ (Pt ) − v B × ∫ ρv dv − d × ∫ ρ v˙ dv. dt Pt dt Pt Pt Consequently d L B (Pt ) = T O¯ − d × F (Pt ) − v B × P , dt whence

d L B (Pt ) = T B − v B × P , dt

as it happens in the classical mechanics of systems of particles. As is the case in electromagnetism, there are continua subject to couple fields. This means that there is a couple density field,2 per unit mass, l such that ∫ ρ l dv

Pt

is the couple acting on Pt . In this circumstance, the balance of angular momentum involves the new term ρl so that d ∫ r × ρ v dv = ∫ r × T n da + ∫ ρ[l + r × b]dv. dt Pt Pt ∂Pt By repeating the previous procedure, we obtain ∫ (ρ l + ϒ)dv = 0,

Pt

2

Or couple per unit volume t = ρl.

2.4 Balance of Energy

85

whence ρ l + ϒ = 0. Hence, T is (no longer symmetric and is) required to satisfy the condition

i jk Tk j + ρli = 0,

i = 1, 2, 3.

As an example, if ρ l = M × B then

i jk (Tk j + M j Bk ) = 0. Consequently skw T = skw M ⊗ B or T + B ⊗ M ∈ Sym.

2.4 Balance of Energy Consider a system of N particles subject to the equations of motion m i v˙ i = fi ,

i = 1, 2, ..., N ,

fi denoting the force acting on the i-th particle. Inner multiply by vi , m i vi · v˙ i = vi · fi , and sum over i to obtain

 d 1 m i vi2 = i vi · fi ; dt 2 i

the derivative of the kinetic energy is equal to the power of the forces applied to the system. This might suggest that an analogous balance law occurs in continuum mechanics provided appropriate definitions are considered for the kinetic energy and the power. The kinetic energy K (Pt ) of the convecting subregion Pt is defined by K (Pt ) = ∫

Pt

1 ρ v2 2

dv.

The power (Pt ) on Pt of the body force and traction fields is defined as (Pt ) = ∫ ρ b · v dv + ∫ v · Tn da. Pt

∂Pt

86

2 Balance Equations

We might expect that any subregion obeys the balance equation d K (Pt ) = (Pt ). dt

(2.18)

Now, by Reynolds’ transport theorem d K (Pt ) = ∫ ρ v · v˙ dv. dt Pt We employ the identity v · (∇ · T) = vi ∂x j Ti j = ∂x j (vi Ti j ) − Ti j ∂x j vi = ∇ · (vT) − T · L and observe that since T is symmetric then T · L = T · D. Consequently, upon substitution of ρ˙v = ∇ · T + ρ b, it follows d K (Pt ) = ∫ ρ b · v dv + ∫ v · Tnda − ∫ T · D dv. dt Pt Pt ∂Pt Hence, we deduce

d K (Pt ) = (Pt ) − ∫ T · D dv. dt Pt

This relation is the content of the kinetic energy theorem (or work-energy theorem or living forces method according to Euler). This in turn implies that Eq. (2.18) holds only if T · D = 0. Alternatively, if we start from (2.18) d ∫ 1 ρ v2 dv = ∫ ρ b · v dv + ∫ v · Tn da dt Pt 2 Pt ∂Pt and employ the divergence theorem so that ∫ v · Tnda = ∫ [v · (∇ · T) + T · D]dv

∂Pt

Pt

then we obtain ∫ [v · (ρ˙v − ρb − ∇ · T) + T · D]dv = 0.

Pt

Owing to the equation of motion (2.14), it follows

2.4 Balance of Energy

87

∫ T · D dv = 0,

Pt

T · D = 0.

Both views show that a seemingly redundant term T · D occurs in the balance of energy. Incidentally, for an inviscid incompressible fluid T · D = 0 but this is merely a particular case of continuous body. Conceptually, a simple and natural way to overcome this difficulty is to regard (2.18) as an overly poor scheme in that a correct balance of energy should involve also fields of non-mechanical character. In this sense, we proceed by keeping the mathematical structure of the time derivative of a volume integral equal to the sum of a volume integral and a surface integral d ∫ ρ u dv = ∫ ρ β dv + ∫ s da. dt Pt Pt ∂Pt We let u = 21 v2 + ε,

β = b · v + r,

s = v · Tn + h

and so ε is the energy density (per unit mass), r is the power supply, and h is the power per unit area, all of them in addition to the corresponding mechanical term ( 21 v2 , b · v, v · Tn). Based on these arguments, we assume the existence of fields ε, r, h and state the following Axiom. The time evolution of energy of a continuous body is governed by the equation d ∫ ρ( 1 v2 + ε)dv = ∫ ρ(b · v + r )dv + ∫ (v · Tn + h)da, dt Pt 2 Pt ∂Pt

(2.19)

for any convecting region Pt . This axiom can be viewed as the first principle of thermodynamics; see, e.g. [111]. As we show shortly, the first law of thermodynamics is a consequence of the axiom (2.19). We first observe that (2.19) involves both volume and surface integrals. On the basis of Cauchy’s theorem, we expect that the function h be subject to an appropriate requirement. Yet it is convenient to simplify the form of (2.19) by means of Reynolds’ theorem and the equation of motion. From d ∫ ρ( 1 v2 + ε)dv − ∫ ρ(b · v + r )dv − ∫ v · Tn da = ∫ h da dt Pt 2 Pt ∂Pt ∂Pt it follows ∫ (ρε˙ − ρr − T · D)dv = ∫ h da.

Pt

∂Pt

(2.20)

If ρ(ε˙ − r ) − T · D is bounded and h(x, n, t) is continuous, then by arguing as in Sect. 2.2, we can state the necessary condition

88

2 Balance Equations

1 volPt ∫ h da → 0 as → 0. area ∂Pt ∂Pt area ∂Pt We can then develop the analogue of Cauchy’s theorem, step by step, to obtain h(x, n) = [



i h(x, ei )ei ]

· n.

This proves that there exists a vector function, say  q(x) = − i h(x, ei )ei , such that h = −q · n. Substitution of h in (2.20) and use of the divergence theorem yield ∫ (ρε˙ − ρr − T · D + ∇ · q)dv = 0,

Pt

whence ρ ε˙ = ρr + T · D − ∇ · q.

(2.21)

As we will see in the next chapters, the balance of energy comprises further terms (powers) if e.g. electromagnetic fields occur or the body has a non-simple character (internal structure).

2.4.1 First Law of Thermodynamics After the mathematical derivation of (2.21), it is natural to inquiry about the physical relevance. For any subregion Pt , integration of (2.21) and use of Reynolds’ theorem yield d ∫ ρ ε dv = ∫ (ρ r + T · D)dv − ∫ q · n da. (2.22) dt Pt Pt ∂Pt Again we have a balance law where the time derivative of a set function is given by a surface integral and a volume integral. Now let U (Pt ) = ∫ ρ ε dv, Pt

W(Pt ) = ∫ T · D dv, Pt

Q(Pt ) = ∫ ρ r dv − ∫ q · n da. Pt

∂ Pt

The same equation, in the form d U (Pt ) = W(Pt ) + Q(Pt ), dt

(2.23)

2.4 Balance of Energy

89

allows us to view W as a power of mechanical character and Q as a power of nonmechanical character. Hence, T · D is the power density of mechanical character that contributes to the rate of energy dU (Pt )/dt. In the purely-mechanical balance (2.18), the power density T · D could not contribute to any balance law. The power Q(Pt ) is the heat flow ; the model involves a heat flux vector q and a scalar heat supply ρr . The occurrence of the − sign in front of the surface integral arises merely by the definition of q. By (2.23), with r = 0 and D = 0 (rigid motions), we have d U (Pt ) = − ∫ q · n da. dt ∂Pt If q · n < 0 then dU (Pt )/dt > 0 and vice versa. Since n is the (unit) outward normal to ∂Pt then q · n < 0 when q points into Pt . This allows q to be viewed as a flux of energy (heat flux); if q points into Pt then dU (Pt )/dt > 0 and the energy U (Pt ) increases. As to the mechanical power W, it is worth considering the simplest case when T = − p 1,

p > 0,

which happens in inviscid fluids. Then, in view of (2.4), ρ˙ T · D = −p ∇ · v = p . ρ If, further, ρ is uniform (independent of the position) then ρ = M/V , M being the mass and V being the volume of Pt . Since M is constant, then ρ˙ V˙ =− , ρ V

T · D = − p V˙ .

If p is uniform too, then ∫ T · D dv = V T · D = − p V˙ .

Pt

Hence, (2.23) becomes d U (Pt ) = − p V˙ + Q(Pt ). dt In the well-known terminology of classical physics, Eq. (2.23) is the first law of thermodynamics . In the continuum physics framework, Eqs. (2.23) and (2.21) are the global and local forms of the first law. In the physical literature, the formal statement of the first law is a bit more involved. For instance, in [449], Chap. 4, for “a process involving only infinitesimal changes” the first law is written in the form

90

2 Balance Equations

dU = d¯Q − dW, ¯ (or d E = δ Q − δW in [316], Sect. 2.5), the use of d¯ instead of d being due to the fact that the infinitesimal amount is an inexact differential [449] or because Q and W are not properties [316]. If “the infinitesimal process is quasi-static”, then dW ¯ = p dV , which is consistent with − p V˙ as the mechanical power if p is uniform. In both cases, the − sign in front of dW ¯ is due to the view that dW ¯ is energy transferred out of the system.

External and Internal Powers The literature shows balances of energy in terms of external and internal powers. To illustrate the concept, we go back to the balance of kinetic energy and observe that, in view of the equation of motion d K (Pt ) = ∫ v · Tn da − ∫ T · D dv + ∫ ρ b · v dv. dt Pt Pt Pt

(2.24)

The field T · D is said to be the power expended within Pt and then the integral is referred to as the internal power. What we denoted by (Pt ) instead is regarded as external power. Hence, upon the form3 d ∫ v · Tn da + ∫ ρb · v dv = ∫ T · D dv + ∫ 1 ρv2 dv dt Pt 2 Pt Pt ∂Pt       internal power

external power

kinetic−energy rate

of the balance of (kinetic) energy, the view is stated that the external power expended on Pt is balanced by the sum of the internal power expended within Pt and the time derivative of the kinetic energy of Pt .

2.4.2 Balance of Virtual Power The literature exhibits a remarkable attention to the use of principles of virtual power in various contexts.4 Here we show a motivation of the principle based on the balance of kinetic energy. The balance of kinetic energy (2.24) can be written in the form ∫ v · Tn da + ∫ ρv · (b − v˙ )dv = ∫ T · D dv.

∂Pt

3 4

Pt

Pt

See, e.g., [216], p. 143. See, e.g. the book [189] and the review paper [302]; see also [367].

2.4 Balance of Energy

91

The left-hand side is viewed as the external power, while the right-hand side as the internal power. Now drop the suffix t to Pt in that the assumption of convecting region is here inessential. Moreover we write the (internal) power T · D as T · L, which corresponds to ignoring the symmetry condition T ∈ Sym. Further, we ignore Cauchy’s relation t(n) = Tn. Hence, we consider the power balance ∫ v · t(n) da + ∫ ρv · (b − v˙ )dv = ∫ T · L dv. P

∂P

P

Now, instead of the effective fields v and L, we insert the virtual velocity v˜ , regarded as an arbitrary vector field; v˜ need not coincide with the velocity field, though it may, and need not have the dimension of a velocity. Hence, let W(P, v˜ ) := ∫ v˜ · t(n) da + ∫ ρ˜v · (b − v˙ )dv, P

∂P

I(P, v˜ ) := ∫ T · ∇ v˜ dv. P

be the virtual expenditures of external and internal power. Principle of virtual power. For any time t and region P of the body the virtual power balance W(P, v˜ ) = I(P, v˜ ), holds for any arbitrary function v˜ (x), x ∈ P. We now show the consequences of the principle. Observe that ∫ T · ∇ v˜ dv = ∫ v˜ · Tn da − ∫ (∇ · T) · v˜ dv.

P

P

∂P

Hence W(P, v˜ ) = I(P, v˜ ) results in ∫ (∇ · T + ρb − ρ˙v) · v˜ dv + ∫ (t(n) − Tn) · v˜ da = 0.

P

∂P

By the arbitrariness of v˜ , we first let v˜ = 0 on the boundary ∂P. Consequently ∫ (∇ · T + ρb − ρ˙v) · v˜ dv = 0.

P

Now, for any x ∈ P \ ∂P, we let v˜ = α(y)[∇ · T + ρb − ρ˙v], α being positive in an open neighbourhood N of x, N ⊂ P, and equal to zero outside. It follows ∇ · T + ρb − ρ˙v = 0 as y ∈ N . The arbitrariness of x implies that ∇ · T + ρb − ρ˙v = 0 in P. Thus

∫ (t(n) − Tn) · v˜ da = 0.

∂P

92

2 Balance Equations

The arbitrariness of v˜ implies that t(n) = Tn on ∂P. So, the principle implies the equation of motion and Cauchy’s relation ∇ · T + ρb − ρ˙v = 0,

t(n) = Tn.

Let x˜ be any motion, x˙˜ = v˜ . By a Euclidean transformation, x˜ ∗ and v˜ ∗ become x˜ ∗ = c + Q˜x,

˙ x + Q˜v. v˜ ∗ = c˙ + Q˜

The corresponding velocity gradients are related by ˜ T + , L˜ ∗ = QLQ ˙ T ∈ Skw. Since where  = QQ det ∂x∗ x = det QT = 1, the invariance of I(P, v˜ ) under Euclidean transformations results in ˜ T + ) = QT T∗ Q · L˜ + T∗ · . T · L˜ = T∗ · (QLQ First let Q be constant so that  = 0. Hence, the invariance of I(P, v˜ ) results in (T − QT T∗ Q) · L˜ = 0. The arbitrariness of L˜ implies that T∗ = QTQT for all rotations Q; consequently T is a (objective) tensor. The remaining condition T∗ ·  = 0 for any skew tensor  implies that T∗ , and hence T, is symmetric. To sum up, the principle of virtual power, that is W(P, v˜ ) = I(P, v˜ ) for any subregion P and any field v˜ , implies Cauchy’s relation t(n) = Tn, the (local) equation of motion ρ˙v = ∇ · T + ρb, the symmetry of T, and the (objective) tensor character of the stress, T∗ = QTQT , for any rotation Q. The fact that the principle of virtual power yields eventually the boundary condition (t(n) = Tn), the local balance equation, the symmetry and the objective character of T might suggest the recourse to this principle, especially in connection with more involved or non-classical settings. Yet a question arises about the consistent statement of a principle of virtual power. It seems then that, for a correct modelling of material behaviour, a thorough analysis of the balance equations is unavoidable.

2.5 Material Forms of the Balance Equations

93

2.5 Material Forms of the Balance Equations The balance laws considered so far are expressed in the form d ∫ ρϕ dv = ∫ β dv + ∫ s da, dt Pt Pt ∂Pt where ϕ is a density (per unit mass) while β and s represent the density fields, per unit volume and unit area, that determine the evolution of ϕ. As in Cauchy’s theorem, we can prove that s has to be linear in the unit normal n to the boundary; let h be such that s = h · n. Hence d ∫ ρϕ dv = ∫ β dv + ∫ h · n da. dt Pt Pt ∂Pt

(2.25)

If ϕ is a scalar (vector), then β is a scalar (vector) and h is a vector (tensor). This is the generic form of a balance law in the spatial (or Eulerian) description in that it involves functions ϕ(x, t), where x ∈ Pt ⊆ Rt ⊂ E . Here we establish the dual form of balance equations in the material (or Lagrangian or referential) description that is the corresponding law relative to P = χ−1 (Pt ) ⊆ R. The key point is the transformation of integrals induced by the change of variable x → X = χ−1 (x, t). As shown in Sect. 1.2.1 ∫ g n da = ∫ g J F−T n R da R ,

St

S

where St = χ(S, t). Hence, by the change of variable x → X = χ−1 (x, t) the generic balance equation (2.25) can be written in the form d ∫ ρ R ϕ dv R = ∫ J β dv R + ∫ h · J F−T n R da R , dt P P ∂P where ρ R = J ρ. It is advantageous in calculations that ρ R = ρ R (X). Observe that h · F−T n R = h F−T · n R and let

h R := J hF−T .

(2.26)

Hence the divergence theorem, the arbitrariness of P and the assumed continuity of the integrands imply ρ R ϕ˙ = J β + ∇R · h R . Thus, there is the following 1–1 correspondence between balance equations in the spatial and the material descriptions

94

2 Balance Equations

ρϕ˙ = β + ∇ · h

⇐⇒

ρ R ϕ˙ = J β + ∇R · h R ,

(2.27)

h, h R being related by (2.26). The correspondence (2.27) follows also directly multiplying ρϕ˙ = β + ∇ · h by J and observing that ρJ = ρ R and −1 −1 ∂ X K h KR = h i ∂ X K (J FK−1 i ) + J FK i (∂x p h i )F p K = h i ∂ X K (J FK i ) + J ∂xi h i .

By (1.11), ∂ X K (J FK−1 i ) = 0 and hence (2.27) follows. The equation of motion is obtained by letting ϕ = v,

β = ρb,

h = T.

By (2.26) and (2.27), in the material formulation, we have ρ R v˙ = ρ R b + ∇R · T R ,

(2.28)

where, consistently with (1.8), T R := J T F−T . The tensor T R denotes the contact force, per unit area, in the reference configuration and is referred to as the first Piola–Kirchhoff stress tensor ([428], Sect. 43A) or merely Piola stress ([216], Sect. 24.1). The suffix R emphasizes the meaning associated with the reference configuration.5 The Piola stress T R is not in general a symmetric tensor. Indeed, since T = J −1 T R FT , the symmetry of the Cauchy stress T implies that T R FT = F TTR . Look now at the balance of energy in the form (2.22) d ∫ ρ ε dv = ∫ (ρr + T · D)dv − ∫ q · n da. dt Pt Pt Pt so that ϕ = ε, Letting

5

β = ρr + T · D,

h = −q.

q R = J q F−T

The symbols S and P are also used in the literature for the Piola stress. The Piola stress is also called nominal stress or engineering stress.

2.5 Material Forms of the Balance Equations

95

we obtain the balance of energy in the material form ρ R ε˙ = ρ R r + J T · D − ∇ R · q R . The vector q R denotes the heat flux, per unit area, in the reference configuration. The quantity J T · D is the mechanical power, per unit volume in the reference configuration. It is then natural to look for a representation involving the Piola stress T R rather than the Cauchy stress T. Now, by obvious identities, we can write ˙ −1 + F−T F˙ T ) J T · D = J 21 T · (L + LT ) = J 21 T · (FF = J 1 T · F−T (FT F˙ + F˙ T F)F−1 = 1 (J F−1 TF−T ) · (FT F˙ + F˙ T F) 2

2

This suggests that we define the stress tensor T R R := J F−1 TF−T . Since

˙ FT F˙ + F˙ T F = C

we can write

˙ J T · D = 21 T R R · C.

Hence, the local form of the balance of energy becomes ˙ − ∇R · qR . ρ R ε˙ = ρ R r + 21 T R R · C

(2.29)

The symmetry of T implies that of T R R , TTR R = (J F−1 TF−T )T = J F−1 TT F−T = J F−1 TF−T = T R R . The tensor T R R is called second Piola stress [216] or second Piola-Kirchhoff stress [428]. The Kirchhoff stress tensor T K is defined as T K = J T. One reason for its use is that the stress power in the reference configuration is merely T K · D. A further form of the mechanical power follows by observing that 1 2

Hence

˙ −1 + F−T F˙ T ) = 1 J (TF−T + TT F−T ) · F˙ = J TF−T · F. ˙ J T · (FF 2 J T · D = T R · F˙

and T R · F˙ is the sought representation of the mechanical power, which involves the first Piola stress T R . Therefore, we can also write the balance of energy in the form ρ R ε˙ = ρ R r + T R · F˙ − ∇ R · q R .

(2.30)

96

2 Balance Equations

It is worth remarking that if T is non-symmetric, then ˙ ˙ −1 ) = (TF−T ) · F˙ = J −1 T R · F. T · L = T · (FF Piola stresses enjoy interesting properties in connection with the objective time derivatives. Consider the Piola stress T R = J TF−T and observe that F−T FT = 1, Hence, we find that

˙ F−T = −F−T F˙ T F−T ,

F˙ T = FT LT .

T˙ R = J (T˙ − TLT + (∇ · v)T)F−T .

Likewise, since F−1 F = 1,

˙ ˙ −1 , F−1 = −F−1 FF

F˙ = LF,

then time differentiation of the second Piola stress T R R = J F−1 TF−T results in T˙ R R = J F−1 (T˙ − LT − TLT + (∇ · v)T)F−T . We then observe that



T˙ R R = J F−1 T F−T ; the operator mapping T R R to T maps T˙ R R to the Truesdell derivative of the Cauchy 



stress, T. In addition, we remark that T= ∂t T + Lv J T, Lv being the Lie derivative of the contravariant components of J T relative to the velocity v. The interpretation of T˙ R is less direct. Observe that, if T is symmetric (TLT )i j = L jk Tki . If we replace T with a vector u, then uLT = Lu. Consequently, T˙ − TLT + (∇ · v)T is the Truesdell rate of T if T is a vector. This is not surprising in that T∗R = QT R under a Euclidean transformation. It is worth summarizing some properties of the power of the stress. In the Eulerian description, the power π, per unit volume, is π = T · L; if T = TT , then T · L = T · D. In the material description, the power π R , per unit volume in the reference configuration, is

2.5 Material Forms of the Balance Equations

97

˙ π R = J π = T R · F. If T = TT , then π = T · L = T · D,

˙ = T R R · E. ˙ π R = T R · F˙ = 21 T R R · C

Sometimes π occurs in the literature as T · E˙ or T · ε˙ . Both expressions are linear approximations of π.

2.5.1 Equation of Motion in Pre-Stressed Materials Consider a body occupying a region R in the reference configuration. Under the ˜ Let T˜ R and ρ0 b˜ be the first action of a body force b˜ the body occupies a region R. Piola stress and the body force density such that the body is at equilibrium ∇X · T˜ R + ρ0 b˜ = 0. Because of a different body force b or because of a superimposed motion, the body occupies the time-dependent region Rt ; as in Fig. 2.4, the point X occupies the position x˜ (X) in R˜ and x(X, t) in Rt . The equation of motion takes the form ρ0 v˙ = ∇X · T R + ρ0 b. At the same point X, the difference yields ˜ ρ0 v˙ = ∇X · T R + ρ0 (b − b),

T R := T R − T˜ R ;

(2.31)

Fig. 2.4 The reference configuration R is mapped into a pre-stressed configuration R˜ and next into the current configuration Rt

98

2 Balance Equations

the motion is determined by the variation T R of the Piola stress and that of the body ˜ and let ρ0 b˜ be the force per unit force. For simplicity, ρ0 is used both in Rt and R ˜ volume applied in R. ˜ F the We now look for the Eulerian version of the equation of motion. Denote by F, ˜ H the displacement gradients relative to the defordeformation gradients and by H, ˜ J = det F. The corresponding ˜ R → Rt . Moreover, let J˜ = det F, mations R → R, ˜ T in R, ˜ Rt are given by Cauchy stresses T, 1 T˜ = T˜ R F˜ T , J˜

T=

1 T R FT . J

Denote by F and H the deformation gradient and the displacement gradient associated with R˜ → Rt . Also, let J = det F . Hence ˜ F = 1 + H , J = det(1 + H ) = 1 + tr H + o(|H H |). F = F F, It follows that T=

1 ˜ T 1 ˜ F + H |) (T˜ R + T R )F˜ T F T = TF (T R + T R )F˜ T F T + T + o(|H J J J˜ J J˜ 1

T R F˜ T F T . Since where T = (1/J J˜)T 1 ˜ T ˜ F T − T∇ ˜ · H + o(|H F = T˜ + TF H |) TF J then

˜ = TF ˜ F T − T∇ ˜ · H + T + o(|H H |). T−T

The variation T − T˜ of the Cauchy stress T consists of two terms; T is the Eulerian ˜ F T − T∇ ˜ · H is the linear analogue of the variation T R of the Piola stress while TF ˜FT − ˜ approximation of the variation T − T induced by the pre-stress T˜ R . While TF ˜ T∇ · H is due only to deformation, T depends on the constitutive properties because so does T R . We now consider the equation of motion (2.31). First we observe that J −1 ∂ X K T Ri K = J −1 ∂x j T Ri K )∂ X K x j = ∂x j (J −1 T Ri K F j K ) where the identity (1.11)2 has been used. Since J −1T R FT = T and ρ0 /J = ρ, we can write (2.31) in the form ˜ ρ˙v = ∇x · T + ρ(b − b). This shows that, in the Eulerian formulation, the effective stress governing the motion is T , the transform of T R = T R − T˜ R . If, for definiteness, we let T R be linear and

2.5 Material Forms of the Balance Equations

99

T Ri K = Ai K lα ∂x˜α u l = (Ai K lα F jα )∂x j u l Fh K then we have 1 T R FT )i j = Bi hl j ∂x j u l , (T J

Bi hl j = Ai K lα F jα Fh K .

To the linear order in u, we find the equation of motion in the form ρ∂t2 u i = Bi hl j ∂xh ∂x j u l + (∂xh Bi hl j )∂x j u l + u j ∂x j bi .

2.5.2 Lie Derivative of the Cauchy Stress The Lie derivative of the stress T is also defined by ˙ LT = J −1 F J F−1 TF−T FT , J F−1 TF−T being the second Piola stress T R R . Since ˙ ˙ −1 T − TF−T F˙ T + (∇ · v)T)F−T J F−1 TF−T = J F−1 (T˙ − FF then

LT = T˙ − LT − TLT + (∇ · v)T;

accordingly, the Lie derivative of the Cauchy stress coincides with the Truesdell derivative. We now look at deformations subject to the identity stretch, U = 1. With reference to the polar decomposition, we let F = R,

U = 1.

Then J = det F = det R = 1. The derivative is given by ˙ LT = R R T TR R T . Since ˙ ˙ + R T TR ˙ + R T TR ˙ T TR + R T TR ˙ = RT R R ˙ T TR + R T TR ˙ R T R, R T TR = R ˙ T = −ZT we find by means of Z = RR LT = T˙ − ZT − TZT , that is the Green–Naghdi derivative.

100

2 Balance Equations

2.5.3 Eshelby Stress and Linear-Angular Momentum By the equation of motion in the (material) form (2.28), right multiplication by F results in ρ R v˙ F = ρ R bF + (∇R · T R )F. In view of the identities ˙ ρ R v˙ F = (ρ R vF)˙− ρ R vF,

ρ R vF˙ = ρ R ∇R ( 21 v2 ),

(∇R · T R )F = ∇R · (FT T R ) − T R ∇R F, we obtain6 (ρ R vF)˙ = ∇R · (FT T R ) + ρ R ∇R ( 21 v2 ) − T R ∇R F + ρ R bF. If, as in hyperelastic solids, T R is given by an energy function, say ψ, such that T R = ρ R ∂F ψ, then T R ∇R F = ρ R ∇R ψ. The balance of linear momentum then can be written in the form (ρ R vF)˙ = ∇R · (FT T R ) + ρ R ∇R L + ρ R bF,

(2.32)

where L = 21 v2 − ψ. Equation (2.32) can be viewed as an equation of motion where ρ R vF is the linear momentum and FT T R is the stress tensor. The effective body force comprises two terms, ρ R ∇R L arising from the energy per unit mass L, ρ R bF being the body force. The stress FT T R is similar to the second Piola stress (with FT in place of F−1 ). By (2.32), we can view Teff = FT T R + ρ R L 1 as the effective stress. Correspondingly the vector ρ R vF is called the material linear momentum. The definition of Teff as it stands is consistent if ρ R is constant. Sometimes definitions are given with differences of the signs for Teff and, consistently, for the linear momentum. 6

In component form, (T R ∇R F) Q = Th K ∂ X K Fh Q = Th K ∂ X Q Fh K .

2.5 Material Forms of the Balance Equations

101

Often the effective stress is employed in static conditions in which case Teff = FT T R − ρ R ψ 1 and (2.32) becomes an equilibrium condition; if b = 0 then (2.32) reduces to ∇R · (FT T R − ρ R ψ) = 0. This in turn shows why usually the stress T E := ρ R ψ1 − FT T R

(2.33)

is involved; owing to the decisive contributions of Eshelby to the subject [153, 154] the tensor (2.33) is usually referred to as Eshelby (material) stress. The original motivation and main applications of the Eshelby stress are within equilibrium problems. Concerning the motivation [154], let S be the boundary of a region in the current configuration. The vanishing of the couple on the material inside S is expressed by ∫ r × Tn da = 0, S

r being the position vector relative to a fixed base point. In the reference configuration, with S = f −1 (S), we have ∫ r × T R n R da R = 0. S

Since (r × T R n R ) p = pl j rl T jRK n KR it follows

pl j ∫ rl T jRK n KR da R = 0, S

p = 1, 2, 3,

whence ∫(rl T jRK − r j Tl RK )n KR da R = 0, S

l, j = 1, 2, 3.

Let r = d + x = d + X + u, d being any constant vector. By applying the divergence theorem and observing that ∇R · T R = 0 as a condition for equilibrium, it follows from ∫ ∂ X K (rl T jRK − r j Tl RK )dv R = 0, P

l, j = 1, 2, 3,

102

2 Balance Equations

that T R − TTR = HTTR − T R HT or, equivalently, FTTR = T R FT . This conclusion is not surprising. Since T R FT = J −1 TF−T FT = J −1 T, we have found the known result that the vanishing of the couple implies the symmetry of the Cauchy stress. Suppose we look for a function W (u, ∇R u, X) making F = ∫ W (u, ∇R u, X)dv R P

stationary. Then we consider the vector function u + αg on P and require that ∂α ∫ W (u + αg, ∇R (u + αg), X)dv R = 0 P

subject to g = 0 on ∂P. Since ∂α W = ∂u W · g + ∂∇R u W · ∇R g = ∂u W − ∇R · ∂∇R u W ) · g + ∇R · (∂∇R u W g) and g = 0 on ∂P the continuity of the integrand allows us to say that F is stationary at α = 0 if and only if (2.34) ∂u W − ∇R · ∂∇R u W = 0. We now assume the displacement function u(X) satisfies the differential equation (2.34). To compute the gradient ∇R W , we observe that (∇R W )L = ∂u i W Hi L + ∂ Hi K W ∂ X L Hi K + ∂ X L W, where ∂ X L W denotes the partial derivative. Since ∂u i = ∂ X K ∂ Hi K W , then it follows (∇R W )L = ∂ X K (HLTi ∂ Hi K W ) + ∂ X L W. Since ∇ R W = ∇ R · (W 1), letting7 T E = W 1 − H T ∂H W

(2.35)

we find that ∇ R · T E = ∂X W. In homogeneous bodies, ∂X W = 0, and hence the stationary condition is 7

The literature shows the use of both (2.33), with F, and (2.35), with H, for the Eshelby tensor.

2.6 Balance of Entropy

103

∇ R · T E = 0.

(2.36)

The formal structure of T E , with W viewed as a Lagrangian, justifies the name of energy-momentum tensor given to T E by Eshelby [154]. Despite the rather qualitative introduction of T E , applications of T E are essentially based on the property (2.36). The integral F = ∫ T E n R da R S

is zero when taken over a closed surface S within which the material is homogeneous. If defects (or inhomogeneities) occur inside S, then the integral should be nonzero. Moreover, if S1 , S2 are two closed surfaces with, e.g. S2 inside S1 and the region between S2 and S1 is homogeneous, then we conclude that ∫ T E n R da R = ∫ T E n R da R

S1

S2

and say that the integral is path- (surface-) independent. Accordingly, the vector F is said to be the force on the defect, which means a measure of the inhomogeneity.

2.6 Balance of Entropy It is assumed that there exists the absolute temperature field θ(x, t), for any x ∈ Rt ⊂ E and t ∈ R. The values of θ are strictly positive. In the thermodynamics of homogeneous processes, it is assumed that there exists a function S(t) such that the Clausius–Planck inequality θ S˙ ≥ Q holds where Q is the (positive or negative) heat transferred to the body per unit time. Consequently Q = 0 ⇒ S˙ ≥ 0, that is, in isolated bodies the entropy cannot decrease. Since we are dealing with non-homogeneous processes, we let the pertinent variables be functions of the position x and time t. Assume that there exists a specific entropy, per unit mass, η such that the entropy S(Pt ) of any subregion Pt is given by S(Pt ) = ∫ ρ η dv. Pt

Hence, the Clausius–Planck inequality may suggest that

104

2 Balance Equations

d q ρr ∫ ρ η dv ≥ − ∫ · n da + ∫ dv. dt Pt Pt θ ∂Pt θ

(2.37)

Inequality (2.37) is called the Clausius-Duhem inequality. Computation of the time derivative, the continuity of the functions ρη, ˙ ρr/θ, and ∇ · (q/θ), and the arbitrariness of Pt yield the local Clausius–Duhem inequality ρη˙ ≥ −∇ ·

q ρr + . θ θ

(2.38)

A thermodynamic process is described by the set of functions occurring in the balance equations and in the Clausius–Duhem inequality. By way of example, if no other fields occur, a thermodynamic process is given by the functions ρ, v, T, b, ε, q, r, η, θ on Rt × R. Such a set of functions is called a thermodynamic process if it is compatible with the balance equations of mass, momentum, and energy. As to (2.37) and (2.38), we might say that q/θ is the entropy flux and ρr/θ is the entropy supply. A more general statement of the second law is adopted correctly by Müller [337] by allowing the entropy flux, j say, to be an unknown function which thus enters the set of the thermodynamic process. The Clausius–Duhem inequality is then generalized by letting j be an unknown vector function to be determined so that the inequality ρr ≥ 0, (2.39) ρη˙ + ∇ · j − θ holds. Hence, a thermodynamic process is said to be admissible if it is compatible with (2.39), (see Chap. 3). Inequality (2.39) allows the balance of entropy to be stated in a form that we regard as the second law of thermodynamics. Second law of thermodynamics. For every admissible thermodynamic process, inequality (2.39) must hold for all times t and points x. Since the compatibility with the balance equations is satisfied, by definition, by any thermodynamic process, the requirement of the second law places restrictions on the effective evolutions and hence selects those which are physically admissible. There are cases where we can prove that j equals q/θ. Instead, in more involved models, e.g. when higher-order derivatives occur, j need not equal q/θ. It is convenient to let q k =j− , θ k being termed extra-entropy flux. Hence, (2.39) takes the form 1 1 ρη˙ + (∇ · q − ρr ) − 2 q · ∇θ + ∇ · k ≥ 0. θ θ In terms of the Helmholtz free energy ψ = ε − θη, we obtain

2.6 Balance of Entropy

105

1 − ρψ˙ − ρη θ˙ + T · D − q · ∇θ + θ∇ · k ≥ 0. θ

(2.40)

This is the form of the second law inequality8 we apply whenever the balance of energy (2.21) holds. The view that j need not equal q/θ generalizes the local form of the second law for the body under consideration. Yet we may argue that the entropy flux j enters the second law inequality in the divergence form. This allows us to say that the additive flux k is physically admissible provided the net contribution to the whole body, in the region  ∫ ∇ · k dv = ∫ k · n da 

∂

vanishes. That is why it is assumed ∫ k · n da = 0.

∂

By (2.38), we have q r 1 η˙ ≥ − ∇ · + . ρ θ θ Integration, at a fixed point X, on [t0 , t1 ] results in t1

q r 1 η(t1 ) − η(t0 ) ≥ ∫ − ∇ · + dt. ρ θ θ t0

In a cyclic process, the final values equal the initial values. In particular η(t1 ) = η(t0 ). Therefore, in any cyclic process at a point X the inequality t1

q r 1 ∫ − ∇ · + dt ≤ 0 ρ θ θ t0

holds. This inequality is sometimes referred to as Clausius inequality. The conceptual advantage of this statement is that entropy is not used as a primitive concept or is not assumed to exist from the outset [115]. The conceptual role of the second law, as a restriction on the admissible thermodynamic processes, is maintained in the following more general setting. Consider the inequality (2.39), or possibly (2.38), and define the entropy production σ, per unit volume and unit time, as 0 ≤ ρη˙ + ∇ · j −

8

ρr = σ. θ

(2.41)

Also referred to as Clausius–Duhem inequality or entropy inequality or local dissipation inequality.

106

2 Balance Equations

The previous statement of the second law can then be expressed as follows: For every admissible thermodynamic process, the entropy production must be non-negative for all times t and points x. Owing to the definition of the entropy production, the two statements look fully equivalent. The statement in terms of the entropy production might become more general as follows. Generalized second law of thermodynamics. The entropy production σ is a nonnegative valued constitutive function for every admissible thermodynamic process for all times t and points x. The constitutive dependence of the entropy production may happen in two ways. First σ coincides with what follows from the definition. As a remarkable example, in connection with Newtonian fluids, see Sect. 5.6, where the dissipative stress Tdis and the heat flux q are assumed in the form Tdis = 2μD0 + ζ(tr D)1, q = −κ∇θ, with μ, ζ, κ > 0, it follows that 1 θσ = 2μD0 · D0 + ζ(tr D)2 − κ∇θ · ∇θ. θ Secondly, σ is assumed to have a constitutive dependence per se subject to being positive valued and satisfying the equality in (2.41). This view proves profitable in the modelling of hysteresis such as in ferroelectric, ferromagnetic, and plastic materials. In [427], lecture 2, the internal dissipation δ is considered which amounts to letting δ = θσ/ρ. The requirement δ≥0 is referred to as the Planck inequality. Hence, the assumption σ ≥ 0 is equivalent to the Planck inequality. Remark 2.1 The statement of the second law in terms of the (specific) entropy production (γ = σ/ρ) traces back to Coleman and Noll [93], though Planck inequality involves the entropy production via the internal dissipation δ. The definition of the entropy production σ allows us to write ρη˙ + ∇ · j −

ρr − σ = 0. θ

This has motivated the writing of the balance of entropy as an equality (see, e.g. [209]). It is worth emphasizing that ignoring the non-negative character of σ may lead to unphysical constitutive equations. Remark 2.2 The thermodynamic scheme degenerates when the constitutive properties are independent of the temperature. In this case, the classical relation between entropy and Helmholtz free energy does not hold and a consistent scheme takes r

2.6 Balance of Entropy

107

and q to be zero. This view is developed in Sect. 5 in connection with the elastic fluid. We now go back to (2.40). Integration over any convecting region Pt gives −

d 1 ∫ ρψ dv − ∫ (ρη θ˙ + q · ∇θ)dv + ∫ T · D dv + ∫ θ∇ · k ≥ 0. dt Pt θ Pt Pt Pt

By (2.24), we can replace the integral of T · D to obtain ∫ v · Tn da + ∫ ρb · v dv −

∂ Pt

Pt

d 1 ∫ ρ(ψ + 21 v2 )dv − ∫ (ρη θ˙ + q · ∇θ − θ∇ · k)dv ≥ 0. dt Pt θ Pt

An approximation is sometimes used in the literature by assuming that thermal influences be negligible. Apart from the contribution of the extra-entropy flux k, very often disregarded, the last integral is assumed to vanish. Hence, it follows the free-energy imbalance ∫ v · Tn da + ∫ ρb · v dv −

∂Pt

Pt

d ∫ ρ(ψ + 21 v2 )dv ≥ 0, dt Pt

the left-hand side being viewed as the dissipation; in words, the net free energy and kinetic energy d ∫ ρ(ψ + 21 v2 )dv/dt is balanced by the conventional expended Pt power ∂Pt v · Tn da + Pt ρb · v dv plus the dissipation ([216], Chap. 29).

2.6.1 Material Formulation of the Second Law As with any balance equation, we can state the second law of thermodynamics in the material description. Let P be the inverse image of the region Pt , P = χ−1 (Pt , t), convecting with the body. By Nanson’s formula, inequality (2.37) can be given the form q d ∫ ρ R η dv R ≥ − ∫ J · F−T n R da R + ∫ ρ R r dv R , dt P θ P ∂P where ρ R = ρJ . In terms of the heat flux vector in the material description q R = J F−1 q, we can write

qR d ∫ ρ R η dv R ≥ − ∫ · n R da R + ∫ ρ R r dv R . dt P P ∂P θ

108

2 Balance Equations

The divergence theorem and the arbitrariness of P yield the local inequality ρ R η˙ ≥ −∇ R ·

ρR r qR + , θ θ

(2.42)

which can be viewed as the Clausius–Duhem inequality in the material description. Hence, the second law of thermodynamics may be stated by saying that for every admissible thermodynamic process, expressed by functions on R × R, the inequality (2.42) must hold for all times t and point X. This statement of the second law amounts to assuming that q/θ is the entropy flux in the spatial description and hence q R /θ, with q R = J F−1 q, is the entropy flux in the material description. The analogue of Müller’s assumption about the second law would be to let j R , with j R = J F−1 j, be an unknown vector function, of X and t, such that ρ R η˙ ≥ −∇R · j R +

ρR r θ

holds for every thermodynamic process. As with the spatial description, it is convenient to let jR =

qR + kR θ

and to regard k R as the extra-entropy flux. Upon substitution of j R , the entropy inequality can be written in the form 1 ρ R θη˙ ≥ ρ R r − ∇R · q R + q R · ∇R θ − θ∇R · k R . θ Substitution for ρ R r − ∇R · q R from the balance of energy (2.29) and use of the free energy ψ = ε − θη yield the material form of the Clausius–Duhem inequality, ˙ + 1 TR R · C ˙ − 1 q R · ∇R θ + θ∇R · k R ≥ 0. − ρ R (ψ˙ + η θ) 2 θ

(2.43)

For formal convenience, we consider the referential densities ε R = ρ R ε,

η R = ρ R η,

ψ R = ρ R ψ,

so that ε R , η R , and ψ R are the internal energy, the entropy, and the free energy per unit volume in the reference configuration. Moreover, let r R = ρ R r . Since ρ R is constant in time, the balance of energy and the Clausius–Duhem inequality are given the form ˙ − ∇R · qR + r R , ε˙R = 21 T R R · C ˙ + 1 TR R · C ˙ − 1 q R · ∇ R θ + θ∇ R · k R ≥ 0. − (ψ˙ R + η R θ) 2 θ

(2.44)

2.6 Balance of Entropy

109

If T is non-symmetric then the power T · L becomes ˙ ˙ −1 ) = (TF−T ) · F˙ = J −1 T R · F. T · L = T · (FF

2.6.2 Third Law of Thermodynamics The statement of the second law involves the entropy η only via the derivative η. ˙ Hence, we might compute the entropy to within a constant though the undetermined value of the constant might be of no consequence. Yet physical arguments motivate the way of determining the value of the constant in the computation of the entropy. This determination is given by Nernst’s principle or Nernst’s heat theorem or third law of thermodynamics. It is not really a theorem; we take it as an axiom in the form of Planck’s statement. Third law of thermodynamics. The entropy of a system is zero at absolute zero. This means that the entropy of a system depends only on the temperature as absolute zero is approached.

2.6.3 Exergy The balance of energy, or the first law of thermodynamics, is viewed as the conservation of energy; energy is conserved, it cannot be destroyed. Instead, as we see in a while, exergy is a quantity that is not conserved and depends on the process and the environment. ˆ Since ∇ · v = We start from the balance of energy (2.21) and let T = − p1 + T. −ρ/ρ ˙ = υ/υ ˙ we can write the balance of energy in the form ˆ · D. ρε˙ = ρ r − ∇ · q − ρ p υ˙ + T Substitution of ρ r − ∇ · q in (2.38) results in 1 Tˆ · D ≤ −ρ(ε˙ − θη˙ + p υ) ˙ − q · ∇θ. θ Neglect q · ∇θ/θ (no irreversibility). The left-hand side is the mechanical power and the inequality provides a bound for it. Upon integration over [t0 , t] × ,  being the region occupied by the body, we have t

t

t0 

t0 

∫ ∫ Tˆ · D dv dτ ≤ U (t) − U (t0 ) + ∫ ∫ ρ(θ η˙ − p υ)dv ˙ dτ ,

110

2 Balance Equations

where U =  ρ ε dv is the internal energy of the body. If, further, the pressure p and the temperature θ are kept constant, p = p0 , θ = θ0 ,then t

∫ ∫ Tˆ · D dv dτ ≤ −{U (t) − U (t0 ) − θ0 [S(t) − S(t0 )] + p0 [V (t) − V (t0 )]}, t0 

where S = ∫ ρ η dv and V = ∫ 1 dv are the entropy and the volume of the body. The 



work available is then bounded by the right-hand side. That is why the exergy E of the system is defined by [316] E = U (t) − U (t0 ) − θ0 [S(t) − S(t0 )] + p0 [V (t) − V (t0 )].

2.6.4 Entropy Production in Stochastic Kinetics In statistical mechanics, the time evolution of the probability density function, of the velocity of a particle, is governed by the Fokker–Planck equation, based on the view that particles are subject to force fields (drag forces) and random forces. The Fokker–Planck equation is only an approximation to the Boltzmann model [33]: the dynamics of a particle, subject to a (external) force field F(x) and a fluctuating force ξ(t), with amplitude a, is governed by the equation of motion m x¨ = −γ x˙ + F(x) + aξ(t), where γ is the frictional coefficient. Near equilibrium, the force γ x˙ is assumed to be predominant on the inertial force m x¨ and then the equation is considered in the (approximate) form γ x˙ = F(x) + aξ(t). This equation of motion is associated with the Fokker–Planck equation a2 F ∂t p = −∇ · ( p) + ( 2 p) γ 2γ for the probability density p. Letting D = a 2 /2γ 2 we can write the Smoluchowski equation 1 ∂t p = −∇ · J, J = −D∇ p + F p, γ where J plays the role of probability current density. If J is viewed as a diffusion flux then −D∇ p is the analogue of Fick’s model while F p/γ is an additional term due to the force field. If F is conservative, F = −∇U then J = −(D∇ p + p∇U/γ).

2.7 Second Law and Phase-Field Models

111

2.7 Second Law and Phase-Field Models Phase transitions describe the transformation of a substance from one phase to another, a phase denoting thermodynamic system with common physical states of matter. It is then necessary to identify at least one physical quantity which distinguishes the two phases. In the Landau theory, such a quantity is called an order parameter while quite often it is called a phase field. Denote the phase field by ϕ and assume it is a scalar. Phase transformations are modelled in detail in Chap. 16. Here, we outline the presumably far-reaching model of phase fields. Different approaches occur in the literature. Here we point out a procedure often referred to as Ginzburg theory. Now, by (2.40), it follows that at constant temperature, with no deformation and no heat conduction, ρψ˙ ≤ 0. Consequently d ∫ ρψdv ≤ 0. dt  The free energy is then a decreasing function of time. Suppose that a fixed constant temperature θ is maintained in the region  ∈ E and consider the free energy in the form of the Landau–Ginzburg functional (ϕ) = ∫[F(ϕ, θ) + 21 γ(ϕ, θ)|∇ϕ|2 ]dv, 

γ being a positive-valued function. The gradient term accounts for the interfacial energy; in multiphase systems the phase boundaries consist of small transition layers of finite nonzero thickness across which energy changes significantly. The functional  is assumed to have a minimum at equilibrium and hence ϕ is required to satisfy the Euler–Lagrange equation of ψ = F(ϕ, θ) + 21 γ(ϕ, θ)|∇ϕ|2 , δϕ ψ = 0, at any point x ∈ . Hence, we have ∂ϕ F + 21 ∂ϕ γ |∇ϕ|2 − ∇ · (γ∇ϕ) = 0 at the sought equilibrium configuration. An additional equation is needed to determine the evolution of ϕ and a rate-type equation seems to be the natural scheme. If ϕ represents a conserved quantity, like mass density of a phase, then the evolution has to satisfy an appropriate conservation law. That is why the literature distinguishes models for conserved or non-conserved phase fields. The (opposite of the) variational derivative δϕ ψ is interpreted as a generalized thermodynamic force that tends to decrease the total free energy. Hence, for nonconserving dynamics, a natural evolution equation is assumed by letting ∂t ϕ be proportional to −δϕ ψ so that

112

2 Balance Equations

∂t ϕ = k[∇ · (γ∇ϕ) − ∂ϕ F − 21 ∂ϕ γ |∇ϕ|2 ] k being a positive-valued function of ϕ and θ. This equation is referred to as Cahn– Allen equation [6]. Consequently, subject to the boundary condition n · ∇ϕ = 0 at x ∈ ∂, we find d = ∫ δϕ ψ∂t ϕdv = − ∫ k(δϕ ψ)2 dv ≤ 0 dt   and hence the free energy is proved to decrease in time. If instead ϕ is a conserved field, then we let d ∫ ϕ dv = 0. dt  Borrowing from the conservation of mass (or charge), we then assume that d ∫ ϕ dv = − ∫ J · n da dt P ∂P for any subregion P ⊆  and an appropriate flux J. Hence at any point x ∈  the continuity equation ∂t ϕ = −∇ · J is required to hold. If

˜ ϕ ψ, J = −k∇δ

k˜ being a function of ϕ and θ, then the evolution of ϕ is governed by ˜ ϕ ψ) = ∇ · {k∇[−∇(γ∇ϕ) ˜ ∂t ϕ = ∇ · (k∇δ + ∂ϕ F + 21 ∂ϕ γ|∇ϕ|2 ]}; if k˜ and γ are constants then this equation simplifies to the Cahn-Hilliard equation [74] 2 ˜ ∂t ϕ = k(∂ ϕ − γ ϕ). As the boundary condition, we assume that n · ∇ϕ = 0,

n · ∇δϕ ψ = 0

∀x ∈ ∂.

Hence, we obtain the conservation property d ˜ ϕ ψ)dv = ∫ kn ˜ · ∇δϕ ψ da = 0 ∫ ϕ dv = ∫ ∂t ϕ dv = ∫ ∇ · (k∇δ dt    ∂ and the decay property, d 2 ˜ ϕ ψ)dv = − ∫ k|∇δ ˜ ∫ ψ dv = ∫ δϕ ψ∂t ϕ dv = ∫ δϕ ψ∇ · (k∇δ ϕ ψ| dv ≤ 0. dt    

2.8 Bernoulli’s Law and Balance Equations for Fluids

113

2.8 Bernoulli’s Law and Balance Equations for Fluids Bernoulli’s law is a property of the equation of motion of fluids. Yet often arguments are given on the basis of particular properties of energy. To emphasize the effective properties of fluid motions, we now review some forms of the balance of energy and next we establish the general form of Bernoulli’s law. By the balance of energy in the form (2.19), with h = −q · n, Reynolds’ transport theorem and the divergence theorem result in ˙ ρ(ε + 21 v2 ) = ρb · v + ρr + ∇ · (Tv − q). Now for any function, f say, we employ the continuity equation (2.4) to obtain ρ f˙ = ρ∂t f + ρv · ∇ f = ∂t (ρ f ) + ∇ · (ρ f v) − f (∂t ρ + ∇ · (ρv)) = ∂t (ρ f ) + ∇ · (ρ f v).

Moreover let p = − 13 tr T so that T = T0 − p1 and p is the pressure. Consequently the balance of energy can be written in the form ∂t [ρ(ε + 21 v2 )] = −∇ · [ρ(ε + 21 v2 )v] + ρr − ∇ · q + ∇ · [(T0 − p1)v] + ρb · v (2.45) As is customary, let b = g, the acceleration gravity. Let z be the vertical coordinate being taken as positive upward. Hence g = −g∇z = −∇(gz) and ρg · v = −ρv · ∇(gz) = −∇ · (ρgzv) + gz∇ · (ρv). By the continuity equation (2.4), we can replace ∇ · (ρv) with −∂t ρ; hence letting ∂t z = 0, we conclude ρg · v = −∇ · (ρgzv) − ∂t (ρgz). Upon this condition, the balance of energy can be written in the form ∂t [ρ(ε + 21 v2 + gz)] = −∇ · [ρ(ε + 21 v2 + gz)v] + ρr − ∇ · q + ∇ · [(T0 − p1)v].

(2.46) On the basis of Eq. (2.46), ρ(ε + 21 v2 + gz) is often referred to as the total energy density. Another interpretation follows from (2.45) by observing that the power ∇ · (− pv) can be viewed as arising from a flux so that we can write the equivalent equation p 1 2 + 2 v )v] + ρr − ∇ · q + ∇ · (T0 v) + ρg · v. ρ (2.47) and ε + p/ρ is regarded as the enthalpy density, per unit mass. That is why sometimes ε + 21 v2 is said to be the energy density of the fluid, whereas ρ(ε + p/ρ + 21 v2 )v is viewed as the energy flux (see, e.g. [273], Chap. 1). ∂t [ρ(ε + 21 v2 )] = −∇ · [ρ(ε +

114

2 Balance Equations

2.8.1 Bernoulli’s Law Perhaps regarding the internal energy density ε as a function of the mass density ρ, and letting ρ be constant (incompressible fluids), often the view that ρ( 21 v2 + gz) is the effective energy is at the basis of the (heuristic) derivations of Bernoulli’s law. Yet Bernoulli’s law holds for compressible inviscid fluids, where T = − p 1, and the derivation has to be based on the equation of motion. Upon the observation that g = −g∇z, z being positive upward, we can write the equation of motion as ρ˙v = −∇ p − ρ∇(gz). Divide by ρ to have

1 ∂t v + (v · ∇)v = − ∇ p − ∇(gz). ρ

(2.48)

Denote by e the unit vector of v, that is v = ve, v = |v|. Now, e · ∇ is the derivative along e; we let v · ∇ = v∂ξ , ∂ξ = e · ∇, so that ξ is the arclength along the pertinent streamline, ∂ξ x = e.

x = x(ξ, t) :

At any point x, and time t, ∂ξ is the derivative along the tangent to the streamline at the point x, and time t. Hence, we have e · ∂ξ v = ∂ξ v.

(v · ∇)v = v∂ξ v,

Assume the fluid undergoes a stationary motion, ∂t ρ = 0,

∂t v = 0.

Inner multiplication of (2.48) by e results in 1 v∂ξ v = − ∂ξ p − ∂ξ gz. ρ

(2.49)

Now, assume the fluid is barotropic, that is p and ρ are related by an invertible function

2.8 Bernoulli’s Law and Balance Equations for Fluids

115

p = p(ρ). Hence, we define

p

h( p) := ∫

1 d p. ˜ ρ( p) ˜

1 ∇ p, ρ

∂ξ h =

p0

We have ∇h =

1 ∂ξ p. ρ

Consequently, (2.49) amounts to ∂ξ ( 21 v 2 + h + gz) = 0. If, further, the fluid is incompressible, then h( p) = and hence 1 2 v 2

+

p + constant ρ

p + gz = constant. ρ

We then state Bernoulli’s law by means of the following Theorem 2.2 (Bernoulli’s law) Along a streamline of an inviscid, barotropic fluid, undergoing a stationary motion 1 2 v 2

+ h + gz = constant.

Dividing by g, we have 1 2 v /g 2

+ h/g + z = constant.

Each term has the dimensions of a length; hence h/g is referred to as pressure head.

2.8.2 Variational Derivation of the Equation of Motion We show how the equation of motion can be derived in a variational setting. For definiteness, we let the fluid be incompressible,9 so that ∇ · v = 0 and ρ is a constant. Consider the Lagrangian density 9

More involved variational formulations for (compressible) fluids are given in [27].

116

2 Balance Equations

L = 21 ρv2 + ρb · x. We let the motion χ(X, t) be the unknown function, and hence we write the Lagrangian L in the form T

L = ∫ ∫ L (χ, p)(X, t) dv R dt, 0

L = 21 ρ(∂t χ)2 + ρb · χ + p(J − 1),

where J = det ∇χ and p(X, t) is a Lagrange multiplier to account for the incompressibility constraint. The extremality condition for χ and p and appropriate conditions on the boundary ∂ and at t = 0, T imply the Euler–Lagrange equations ∂χ L − ∂t ∂∂t χ L − ∇ · ∂F L = 0,

∂ p L = 0.

Observe that, in suffix notation ∂ Fi K J = J FK−1 i ,

−1 ∂ X K ( p J FK−1 i ) = J FK i ∂ X K p = J ∂xi p.

Hence, the Euler–Lagrange equations become ρb − ρ˙v − J ∇ p = 0,

J − 1 = 0,

whence ρ˙v = −∇ p + ρb,

J = 1;

the multiplier p takes the meaning of pressure.

2.9 Balance in a Control Volume Mathematically, a control volume is a fixed region in space. We then look for the formulation of balance laws in control volumes associated with balance laws in convecting regions. Let φ be any density field, per unit volume. The balance law for φ is written by letting two contributions occur, namely a volume term β and a surface term s such that d ∫ φ dv = ∫ β dv + ∫ s da. dt Pt Pt ∂Pt For technical convenience, we let ϕ = φ/ρ. As in Cauchy’s theorem, we can argue that s has to be linear in the unit normal n to the boundary; let h be such that s = h · n. Hence d ∫ ρϕ dv = ∫ β dv + ∫ h · n da. (2.50) dt Pt Pt ∂Pt

2.9 Balance in a Control Volume

117

If ϕ is a scalar (vector), then β is a scalar (vector) and h is a vector (tensor). Upon substituting and employing the transport theorem, we find ∫ ρ ϕ˙ dv = ∫ β dv + ∫ h · n da.

Pt

Pt

∂Pt

Since ρ ϕ˙ = ∂t (ρ ϕ) + ∇ · (ρ ϕ v), by means of the divergence theorem, we obtain ∫ ∂t (ρ ϕ)dv = ∫ β dv + ∫ (h − ρ ϕ v) · n da.

Pt

Pt

∂Pt

Since Pt is an arbitrary region in E , we let Pt =  and keep  fixed in space. Hence d ∫ ∂t (ρ ϕ)dv = ∫ ρ ϕ dv. dt   Consequently, the balance law can be written in the form d ∫ ρ ϕ dv = ∫ β dv + ∫ (h − ρ ϕ v) · n da. dt   ∂

(2.51)

It is apparent from (2.50) and (2.51) that, while the balance law of ρ ϕ in a convecting region involves the source terms β and h · n, the corresponding balance law in a fixed region (control volume) involves also a vector (or tensor) flux ρ ϕ v. Let ∂+ and ∂− be the subsets of ∂ where v · n < 0 (mass enters) and v · n > 0 (mass exits), respectively. Hence, we let cv := ∫ ρ ϕ dv, 

ˇ i := − ∫ ρ ϕ v · n da,  ∂+

ˇ e := ∫ ρ ϕ v · n da.  ∂−

Consequently dcv ˇi − ˇ e. = ∫ β dv + ∫ h · n da +  dt  ∂ This means that the time derivative of the content cv , within the control volume, depends not only on the physical sources β and s = h · n, but also on the flow through ˇ e. ˇi − the boundary of ρ ϕ at the rate  A customary approximation used in the technical literature amounts to assuming that ϕ is quite constant so that ˇ i  −ϕi ∫ ρ v · n da,  ∂+

ˇ e  ϕe ∫ ρ v · n da.  ∂−

The simplest balance law is that for mass. We let ϕ = 1 and β = 0, h = 0. Equation (2.51) reduces to

118

2 Balance Equations

dm cv = − ∫ ρ v · n da, dt ∂

m cv := ∫ ρ dv. 

This equation indicates that the rate at which mass changes within  is equal to the net inflow (or outflow) of mass through the boundary ∂. Indeed, let mˇ i := − ∫ ρv · n da, ∂+

mˇ e := ∫ ρv · n da; ∂−

mˇ i and mˇ e are the rates at which mass flows into, or outside, . Hence10 dm cv = mˇ i − mˇ e . dt Moreover, the generic balance equation (2.51) can be written in the (approximate) form dcv = ∫ β dv + ∫ h · n da + ϕi mˇ i − ϕe mˇ e . dt  ∂ The balance of linear momentum follows via the identifications ϕ = v,

β = ρ b,

h = T.

The balance equation Pcv dP = ∫ ρ b dv + ∫ T n da − ∫ ρ v v · n da dt  ∂ ∂ is approximated by Pcv dP = ∫ ρ b dv + ∫ T n da + vi mˇ i − ve mˇ e . dt  ∂ The balance of energy is expressed in various ways depending on the definition of the energy E cv within the control volume. Following the balance law (2.45), we make the identifications E cv = ∫ ρ(ε + 21 v2 )dv, 

β = ρ(b · v + r ),

s = (T v − q) · n.

The correct balance equation d E cv = ∫ ρ(b · v + r )dv + ∫ (T v − q) · n da − ∫ ρ(ε + 21 v2 )v · n da dt  ∂ ∂ The literature often indicates the two contributions by m˙ i and m˙ e . We avoid using m˙ i and m˙ e which would seem the derivatives of m i (t) and m e (t); formally we might determine m i (t) by t m i (t) − m i (t0 ) = t0 mˇ i (τ )dτ . 10

2.10 Basic Principles in Electromagnetism

119

simplifies to the approximate equation d E cv = ∫ ρ(b · v + r )dv + ∫ (T v − q) · n da + mˇ i (ε + 21 v2 )i − mˇ e (ε + 21 v2 )e . dt  ∂ Otherwise, letting T = Tˆ − p1, p > 0 being the pressure, we can write d E cv p = ∫ ρ(b · v + r )dv + ∫ (Tˆ v − q) · n da − ∫ ρ(ε + + 21 v2 )v · n da. dt ρ  ∂ ∂ Hence, the approximation results in d E cv p p = ∫ ρ(b · v + r )dv + ∫ (Tˆ v − q) · n da + mˇ i (ε + + 21 v2 )i − mˇ e (ε + + 21 v2 )e . dt ρ ρ  ∂

We can then say that the balance of E cv involves the heat and work powers ∫ ρ r dv − ∫ q · n da, 

∂

∫ ρb · v dv + ∫ (Tˆ v) · n da, 

∂

and the enthalpy inflow, mˇ i (ε + p/ρ + 21 v2 )i , and outflow, mˇ e (ε + p/ρ + 21 v2 )e . Really, the enthalpy (per unit mass) is ε + p/ρ and hence the inflow and outflow involve the enthalpy along with the kinetic term 21 v2 . If instead we consider the balance of energy (2.46), then we let Ucv = ∫ ρ(ε + 21 v2 + gz)dv, 

β = ρr,

s = (T v − q) · n

and state the balance law in the form dUcv = ∫ ρ r dv + ∫ (T v − q) · n da − ∫ ρ(ε + 21 v2 + gz)v da. dt  ∂ ∂ Hence, the approximate form reads (see, e.g. (4.9) of [316]) dUcv = ∫ ρ r dv + ∫ (T v − q) · n da + mˇ i (ε + 21 v2 + gz)i − mˇ e (ε + 21 v2 + gz)e . dt  ∂

2.10 Basic Principles in Electromagnetism Throughout, we use the rationalized MKSA system of units or SI units (see, e.g., [230, 401]). The electric field E and the magnetic induction B are defined operatively by means of the force on a charge. In fact, a particle with charge q and velocity v experiences a force F = q(E + v × B). Then measuring the force F on a test particle

120

2 Balance Equations

with different velocities determines the fields E and B at the location of the particle. The magnetic induction B can also be measured through the torque on a dipole. We can express the basic principles of electromagnetism in terms of charge (and current), electric field, and magnetic induction. These principles are essentially based on Coulomb’s inverse square law of force between charges, Ampère’s findings about the interaction of current elements, and Faraday’s experiments on variable magnetic fields. Following are the basic conceptual steps and axioms of electromagnetism (in free space).

2.10.1 Balance Equations in Free Space As with conservation of mass, conservation of charge can be stated by saying that if q is the charge density then d ∫ q dv = 0, dt Pt Pt being any time-dependent region convecting with the set of charges. Hence we find the continuity equation ∂t q + ∇ · (qv) = 0, v being the velocity of the charges. It is understood that q is the volume charge density of positive (ions) or negative (ions, electrons) free charges. Alternatively, we may let J = qv denote the current density vector and say that I S = ∫ J · n da S

is the current flowing through S in the direction given by n. While the continuity equation reads (2.52) ∂t q + ∇ · J = 0, the law of conservation of charge can be given the global form d ∫ q dv = − ∫ J · n da, dt  ∂  being any region fixed in space. Coulomb’s and Gauss’ laws. In 1785, Coulomb measured the forces that charges exert on each other. It resulted that the force between the charges varies directly as the product of their magnitudes and inversely as the square of the distance between

2.10 Basic Principles in Electromagnetism

121

them.11 If x1 , x2 are the position vectors of the charges q1 , q2 , then the force exerted by q2 on q1 is q1 q2 x1 − x2 F q2 = 4π 0 |x1 − x2 |3 and hence the electric field produced by q2 at x1 is E(x1 ) =

q2 x1 − x2 . 4π 0 |x1 − x2 |3

This indicates that the field produced by a continuous distribution of charges in the region , with charge density q, is given by 1 E(x) = 4π 0

q(r) 

x−r dvr |x − r|3

where vr denotes that r is the variable of integration. Let ∇x and ∇r be the gradients with respect to x and r. To evaluate ∇x · E(x), we observe ∇x ·

x−r x−r = −∇r · , |x − r|3 |x − r|3

∇r ·

x−r = 0, |x − r| > 0. |x − r|3

Hence, letting  R ⊂  be a sphere of radius R centered at x we have

4π 0 ∇x · E(x) = −



q(r)∇r ·

x−r dvr = − |x − r|3

R

q(r)∇r ·

x−r dvr . |x − r|3

Replace q(r) by q(x) + [q(r) − q(x)] and observe that, by the divergence theorem,

x−r q(x)∇r · dvr = q(x) |x − r|3 R

∂ R

x−r r−x da = −4πq(x). · |x − r|3 |x − r|

Further

R

[q(r) − q(x)]∇r ·

x−r x−r x−r dvr = ∇r · [q(r) − q(x)] dvr − · ∇r [q(r) − q(x)]dvr 3 3 |x − r| |x − r| R  R |x − r|3

R π 2π

r−x 1 = 2 [q(r) − q(x)]da + · ∇r q(r)dρ dθ dφ. |r − x| R ∂ R 0 0 0

The boundedness of ∇r q(r) and the continuity of q(r) imply that this integral approaches zero as R → 0. Hence, we find that ∇ ·E=

1 q.

0

In SI units the constant of proportionality is equal to 1/4π 0 , 0 being the permittivity of free space, 0 = 8.854 × 10−12 C2 /N · m2 .

11

122

2 Balance Equations

Based on this local equation, if S is the boundary of the region V , by the divergence theorem, we have 1 ∫ E · n da = ∫ q dv.

0 V ∂V Biot-Savart and Ampère’s laws. Denote by B the magnetic induction (also named the magnetic flux density). The operative definition of B is provided by measuring the mechanical torque T exerted on a magnetic dipole. If m is the magnetic moment, then T is given by12 T = m × B. In 1819, Oersted observed that wires carrying electric currents produced deflections of permanent magnetic dipoles. Next, Biot and Savart (1820) and Ampère (1820–1825) established the basic experimental laws relating B to the currents and determined also the law of force between currents. If I is the current flowing along the (closed) curve C, then the magnetic induction, at x, is given by the Biot–Savart law expressed by the line integral μ0 I B(x) = 4π

C

t(r) × (x − r) ds, |x − r|3

where13 t is the unit tangent vector to C. Now, since t is independent of x, ∂xi i jk t j

x k − rk δki (xk − rk )(xi − ri ) = i jk t j ( −3 ) = 0. |x − r|3 |x − r|3 |x − r|5

Hence, it follows that ∇ · B = 0. Now look at a surface S and denote by J the current density vector so that I = ∫ J · n da S

is the current flowing through S. Ampère’s (circuital) law is written in the form ∫ B · t ds = μ0 ∫ J · n da.

∂S

S

The arbitrariness of S and use of Stokes’ theorem yield ∇ × B = μ0 J.

12 13

Often N and μ are used in place of T and m in the physical literature. The constant μ0 is the permeability of free space, μ0 = 4π × 10−7 N/A2 .

2.10 Basic Principles in Electromagnetism

123

Faraday’s law. In 1831, Faraday observed that a time-varying induction B produces a current in a closed loop of wire. According to Lenz’s law, the induced current is in such a direction as to oppose the change of flux through the wire. Let S be a surface and let  = ∫ B · n da S

be the magnetic flux. The electromotive force around the circuit is EC = ∫ E · t ds, C

where E is the electric field at the point of the wire. Faraday’s (and Lenz’s) observations are summed up in the mathematical law ∫ E · t ds = − C

d ∫ B · n da. dt S

(2.53)

If the wire C is moving with velocity field v then, by (1.49) and (1.50), it follows d ∫ B · n da = ∫[∂t B + ∇ × (B × v) + v∇ · B] · n da. dt S S As a consequence, Faraday’s law (2.53) and the divergence-free condition ∇ · B = 0 result in ∫ E · t ds = − ∫ [∂t B + ∇ × (B × v)] · n da. C

S

Hence, in view of Stokes’ theorem, we have E + B × v] · t ds = − ∫ ∂t B · n da. ∫ [E C

S

Now E + B × v = E, that is the electric field in the chosen reference frame. Consequently, by a further use of Stokes’ theorem, we find the equation ∇ × E = −∂t B, where E and B are the fields in the chosen reference frame. As a check of consistency, if the wire C, and hence S, is fixed in space, then we have ∫ E · t ds = − ∫ ∂t B · n da, C

S

E and B being the fields as seen in the reference of the wire. The arbitrariness of S and use of Stokes’ theorem result again in the equation ∇ × E = −∂t B.

124

2 Balance Equations

By way of summary, the mathematical results so obtained, which are a consequence of the experiments performed in the first half of 1800s, can be written in the form of equations ∇ · E = q/ 0 ∂ E · n da = (1/ 0 )  q dv B · n da = 0 ∇ · B =0 ∂ B · t ds = μ J · n da ∇ × B = μ0 J 0 S ∂ S d E · t ds = − B · n da ∇ × E = −∂t B, ∂S dt S which can be viewed as the statements of Coulomb’s law, absence of free magnetic poles, Ampère’s law, and Faraday’s law.

2.10.2 Balance Laws in Matter Electromagnetism in matter involves also the polarization P and the magnetization M. For compactness of presentation, balance laws and constitutive properties are given in a unified scheme. An externally applied electric field may cause microscopic separations of the centres of positive and negative charges which thus behave as dipoles of charges. This phenomenon is referred to as the polarization of the (dielectric) material. The tendency of the dipoles to align with the electric field causes charges to appear at the surfaces of the material. Hence, Gauss’ law in matter is stated by saying that, for any region  1 1 ∫ λ p da, ∫ E · n da = ∫ q dv +



 ∂ 0 0 ∂ where q is the density, per unit volume, of free charges while λ p is the charge density, per unit area, due to polarization. By the generalized Cauchy’s theorem, there exists a vector field, say −P, such that ∫ λ p da = − ∫ P · n da = − ∫ ∇ · P dv.

∂



∂

The negative sign allows us to view P as the polarization vector, namely the dipole moment per unit volume. Letting D = 0 E + P we can write Gauss’ law in the form ∫ D · n da = ∫ q dv.

∂



2.10 Basic Principles in Electromagnetism

125

The vector D is termed electric displacement or electric flux density. This view is the mathematical basis for the occurrence of the polarization vector P. A physical picture of the polarization field is gained by looking at an assembly of dipoles. An electric dipole consists of a positive and a negative charge, q and −q, and d is the displacement from the position of −q to that of q. We denote by p = q d the vector which is then directed from the negative to the positive charge; we say that p = q d is the electric dipole moment and d is the displacement vector. Assume the dipoles have the same length |d| = c. Let f (x, d, t) be the distribution function, on the sphere  of radius c, of d at (x, t). Hence, we define the dipole moment density in the form p(x, t) =  p  = q d = q ∫ d f (x, d, t)da, 

that is the first moment of f . Letting n(x, t) be the number density of dipoles we let P = np = nqd be the polarization vector. Outside possible surface and line discontinuities, p is taken to be a continuous function of x and t. We associate a constant number of dipoles to each atom or molecule. This in turn justifies the assumption that n is a constant times the mass density ρ. As a consequence, n satisfies the balance equation ρ˙ n˙ = = −∇ · v. n ρ An atom has a magnetic moment with orbital and spin contributions. For atoms or ions containing many electrons, the electrons in closed shells pair up with their spins and orbital angular momenta opposite each other, thus producing a zero net magnetic moment. Atoms with an odd number of electrons must have at least one unpaired electron and a spin magnetic moment of at least one Bohr magneton μ B , μB =

e , 2m e

where e and m e are the (positive) charge and the mass of the electron. The total magnetic moment of an atom is the vector sum of the orbital and spin magnetic moments. An unpaired outer electron can contribute both an orbital moment and a spin moment. The orbital moment is about the same order of magnitude as the Bohr magneton. The nucleus of an atom also has a magnetic moment associated with protons and neutrons. Yet, owing to their masses, the magnetic moment of a proton or neutron is small compared with the magnetic moment of an electron, by a factor of about 10−3 . The magnetization M is the magnetic moment of the substance per unit volume. Let B0 be the magnetic induction produced by, e.g. a current-carrying conductor. If the region is filled with a magnetic substance, the total magnetic induction is

126

2 Balance Equations

B = B0 + Bm , where Bm is the field produced by the magnetic substance. We let Bm = μ0 M and hence B = B0 + μ0 M. By letting H = (B/μ0 ) − M, we can write B = μ0 (H + M). The field M is defined as the magnetization14 while H is termed the magnetic intensity15 ; H is then 1/μ0 times the magnetic induction B0 in the absence of matter. In a large class of substances, the magnetization M is proportional to the magnetic intensity H, M = χH, χ being called the magnetic susceptibility. If χ > 0, the substance is said to be paramagnetic, if χ < 0 the substance is said to be diamagnetic. Hence B = μm H,

μm = μ0 (1 + χ).

We can view B = (1 + χ)B0 as the magnetic induction in the material and hence H = B0 /μ0 as the magnetic field. Since usually |χ|  1 then diamagnetism is such a weak property that its effects are not observable in everyday life. All conductors exhibit an effective diamagnetism when they experience a changing magnetic field. The Lorentz force on electrons causes them to circulate around forming eddy currents. Such currents in turn produce an induced magnetic field opposite to the applied field. In the limit case χ = −1, the material is called a perfect diamagnet and all magnetic fields applied to it are expelled. Both in diamagnetic and paramagnetic substances μm ≥ 0. As a consequence, B is in the same direction as H; in paramagnetic substances M · H > 0, whereas in diamagnetic substances M · H < 0. For ferromagnetic substances, μm is several thousand times μ0 but is not a constant. A paramagnetic substance has a positive, but small, susceptibility which is due to the presence of atoms, or ions, with permanent magnetic dipole moments. These dipoles are randomly oriented in the absence of an external magnetic field and tend to align with an applied magnetic field. This alignment is contrasted by thermal motion which tends to randomize the dipole orientations. This competition is made formal by Curie–Weiss law, whereby M =C

14 15

B , μ0 (θ − θC )

Or magnetic polarization or magnetic moment density. Or magnetic field (strength).

2.10 Basic Principles in Electromagnetism

127

C being the Curie constant and θC the Curie temperature; the magnetization increases with increasing the induction B and with decreasing the temperature θ. As θ < θC , the behaviour of M is nonlinear (Sect. 15.3.1). So as to establish balance equations, we observe that an atom as a whole has a small magnetic moment but the random orientation of the atoms in a larger sample produces an average zero magnetic moment. When an external field is applied there is a small torque on each atomic moment and these moments tend to become aligned with the external field. Accordingly, an atom is similar to a current loop and can thus be considered to have magnetic dipole moments. The current, however, reflects the movement of bound charges (orbital electrons, electron spin, and nuclear spin). We then let the current density be the sum of the free current density J and the bound current density Jb . To model Jb we observe that the current loops in the material result in a circulating current around the periphery of the material. We then let Jb = ∇ × M and modify Ampère’s law as ∫ B · t ds = μ0 ∫ J · n da + μ0 ∫ M · t ds,

∂S

∂S

S

where J is the free current density. Since B = μ0 (H + M) we have ∫ H · t ds = ∫ J · n da.

∂S

S

The balance equations in matter are then given the form ∂

D · n da =

∂S

H · t ds =





S

q dv

J · n da

∇ · D = q, ∇ × H = J.

It is naturally claimed that E and B are the fundamental fields while D and H are the derived fields. Incidentally, a physical picture of M arises by considering an assembly of magnetic dipoles with moment |s| = c m = μ0 s, and then letting m = μ0 s,

M = nm.

It was Maxwell [306] who compiled the known results obtained by Ampère, Faraday, Gauss, Coulomb, and others and made an important addition to one of them by overcoming an inconsistency. All these laws, but Faraday’s law, were derived from steady-state observations. Hence, it is natural to expect that such equations requires some improvements for time-dependent fields. The faulty equation is Ampère’s law.

128

2 Balance Equations

The divergence of ∇ × H = J gives ∇ · J = 0, which holds only for steady-state problems. In general, the continuity condition and Gauss’ law imply that ∇ · J = −∂t q = −∇ · ∂t D. Hence, Ampère’s law is made consistent by replacing J with J + ∂t D. The whole set of differential equations become ∇ · D = q, ∇ · B = 0,

∇ × H = J + ∂t D,

(2.54)

∇ × E + ∂t B = 0.

(2.55)

Equations (2.54)–(2.55) are known as Maxwell’s equations. Maxwell called the added term ∂t D the displacement current. The form (2.54)–(2.55) of Maxwell’s equations is usually referred to as Minkowski formulation.16 The Field H Even within the MKS system of units, the literature shows different definitions of M and H. By analogy with the definition of E + P/ 0 as the field D/ 0 then B − μ0 M is ˜ Instead, as the majority does in ˜ so that B = μ0 M + H. defined as the vector, say H, ˜ 0 as the new field so that B = μ0 (M + H). the literature, we have chosen H = H/μ ˆ and H, ˆ so that B = M ˆ + H. ˆ Of course, Others (e.g. [372]) define M and H, say M ˜ and those of M, M ˆ are different. the physical units of H, H

2.11 The Invariant Form of Maxwell’s Equations According to the special theory of relativity: (i) the laws of physics are the same in all systems which move with uniform velocity relative to one another, (ii) the speed of light in free space17 is c whatever the relative uniform motion of source and observer. 16

See, e.g. [357]. An analysis of the connection between various formulations of Maxwell’s equations is given in [381]. 17 The speed of light in free space, c, is a universal physical constant. Its exact value is defined as 299792458 metres per second. It is exact because, by the international agreement, a metre is defined as the length of the path travelled by light in free space during a time interval of 1/299792458 second.

2.11 The Invariant Form of Maxwell’s Equations

129

We now show how these principles affect the transformation laws of coordinates relative to two different inertial frames F , F  . Let O x yz and O  x  y  z  be Cartesian coordinate systems in F and F  . The coordinates are chosen so that F  is moving with uniform speed v relative to F along both O x and O  x  . Moreover let t = 0 and t  = 0 when O and O  coincide. There is no relative motion in the y and z directions and hence these coordinates are not affected by the transformation law y  = y,

z  = z.

Since x  = 0 when x = 0, t = 0, it is reasonable to assume that x  = γ(x − vt) where γ is a dimensionless constant possibly dependent on v. It is expected that the Galilei transformation x  = x − vt is recovered at the limit of small speeds, relative to c, and hence it is expected that γ → 1 as v/c → 0. By (i), the inverse law x = γ(x  + vt  ) has to hold. Let a light pulse be emitted at the origin of F at t = 0. By (ii), we have x = ct, x  = ct  . Hence x = ct ⇐⇒ x  = ct  . As a consequence, we can write 0 = x  − ct  = γ(x − vt) − ct  = γ(ct − vt) − ct  whence

v t  = γ(t − t). c

Also, by x = γ(x  + vt  ), we have ct = γ(ct  + vt  ) = γ(c + v)t  . Upon substitution of t  , it follows γ2 v ct = γ(c + v)γ(t − t) = (c2 − v 2 )t c c whence γ2 =

c2 . c2 − v 2

130

2 Balance Equations

Since γ > 0 we have

γ = (1 − v 2 /c2 )−1/2 .

Now, that we have determined γ, we replace x  to obtain x = γ(x  + vt  ) = γ[γ(x − vt) + vt  ] whence

t  = γ(t − vx/c2 ).

The set of relations x  = γ(x − vt),

y  = y, z  = z, t  = γ(t − vx/c2 )

(2.56)

constitutes the Lorentz transformation. The inverse transformation, that is x, y, z, t in terms of x  , y  , z  , t  , follows by replacing v by −v and by interchanging primed and unprimed coordinates, x = γ(x  + vt  ),

y = y  , z = z  , t = γ(t  + vx  /c2 ).

By (2.56), it follows that x 2 + y 2 + z 2 − c2 t 2 = x 2 + y 2 + z 2 − c2 t 2 . Hence, letting x 1 = x, x 2 = y, x 3 = z, x 4 = ct, we conclude that the invariant distance of events x α is given by gαβ x α x β , where the metric tensor {gαβ } is given by {gαβ } = diag[1, 1, 1, −1]. Consequently {xα } = [x, y, z, −ct],

{g αβ } = diag[1, 1, 1, −1].

If v 2  c2 , then γ approaches unity and the Lorentz transformation approaches the Galilean transformation x  = x − vt,

y  = y, z  = z, t  = t.

By (i), any physical law must be put in a form which is unaltered by a Lorentz transformation. Now, in the four-dimensional setting, we can express the Lorentz transformation as α x  = αβ x β ,

2.11 The Invariant Form of Maxwell’s Equations

131

the matrix  being given by ⎡

γ ⎢ 0 αβ = ⎢ ⎣ 0 −γv/c

0 1 0 0

⎤ 0 −γv/c 0 0 ⎥ ⎥. 1 0 ⎦ 0 γ

We then show that Maxwell’s equations (2.54)–(2.55) can be written in tensor form. By (2.55), there exist a vector potential A and a scalar potential V such that B = ∇ × A,

E = −∂t A − ∇V.

Regard Aα = (A, V /c) as a four-vector; we next check the consistency of this assumption while A1 = A1 ,

A2 = A2 ,

A3 = A3 ,

A4 = −V /c.

Define the skew-symmetric tensor Fαβ = ∂x α Aβ − ∂x β Aα , α, β = 1, 2, 3, 4, that is Fαβ = −Fβα ,

Fhk = hki Bi ,

Fh4 = E h /c, h, k = 1, 2, 3.

In matrix form it is ⎤ −B2 E 1 /c 0 B3 ⎢ −B3 0 B1 E 2 /c ⎥ ⎥. =⎢ ⎣ B2 −B1 0 E 3 /c ⎦ −E 1 /c −E 2 /c −E 3 /c 0 ⎡

Fαβ

The two equations (2.55) may then be given the tensor form ∂x σ Fαβ + ∂x α Fβσ + ∂x β Fσα = 0,

(2.57)

where α, β, σ are any three of 1, 2, 3, 4. Indeed, if (α, β, σ) = (1, 2, 3), then (2.57) gives ∇ · B = 0. If, instead, (α, β, σ) = (1, 2, 4), (3, 1, 4), (2, 3, 4), then we obtain the components of ∇ × E + ∂t B = 0. Let G αβ be the skew-symmetric tensor defined by G hk = hki Hi , G h4 = −cDh , that is

132

2 Balance Equations

⎤ 0 H3 −H2 −cD1 ⎢ −H3 0 H1 −cD2 ⎥ ⎥ =⎢ ⎣ H2 −H1 0 −cD3 ⎦ , cD1 cD2 cD3 0 ⎡

G αβ

and J α the four-vector given by J α = (J1 , J2 , J3 , cρ). Equation (2.55) can be written in the form ∂x β G αβ = J α .

(2.58)

If α = 4, we obtain ∇ · D = ρ. If α = 1, 2, 3, we obtain the components of ∇ × H − ∂t D = J. Moreover, by the skew-symmetry of G, it follows that 0 = ∂x α ∂x β G αβ = ∂x α J α ,

(2.59)

that is the continuity equation (2.52). The tensor form (2.57)–(2.58) of Maxwell’s equations implies that they hold invariant under a Lorentz transformation. The same is true for the continuity equation (2.59). Moreover, the tensor representation of E, B, D, H allows us to determine how these fields change relative to different observers. First, since F involves only E and B, then it follows that E , B , the fields relative to F  , are linear combinations of E and B. Analogously, the form of G implies that H , D , the fields relative to F  , are linear combinations of H and D. By F αβ = g αμ g βν Fμν it follows that −F 4i = F i4 = −F4i = F4i , i = 1, 2, 3, and the like for G. Instead, F i j = Fi j , i, j = 1, 2, 3. Let F  αβ and G  αβ be the values of F αβ and G αβ at F  . By F

αβ

= αμ βν F μν

and the like for G we find the transformation laws E = γE + (1 − γ)(E · v)v/v 2 + γv × B, B = γB + (1 − γ)(B · v)v/v 2 − γv × E/c2 , and

D = γD + (1 − γ)(D · v)v/v 2 + γv × H/c2 , H = γH + (1 − γ)(H · v)v/v 2 − γv × D.

2.11 The Invariant Form of Maxwell’s Equations

133

Moreover, J  α = αβ J β gives J = J − (1 − γ)(J · v)v/v 2 − γqv,

q  = γ(q − J · v/c2 ).

It is of interest to look at the transformation properties under the approximation of small speeds, |v|  c, where γ  1. The 1/c2 terms are often disregarded and hence the approximations J  J − qv,

q   q.

are considered in the literature. This means that q is (roughly) invariant, whereas J and J differ by the convective term qv. The fields E, B, D, H in F , F  are approximately related by E  E + v × B,

B  B,

D  D,

H  H − v × D.

If F  is (momentarily) at rest with the charge, then E is the electric field at rest with the charge. At F  , a charge q at rest experiences a force qE = q(E + v × B), E, B being the fields at F . This shows, and confirms the experimental evidence, that a charge q in motion with velocity v experiences a force q(E + v × B), called the Lorentz force. Instead of the primed quantities, henceforth we use E , B , D , H , J ; the approximations can then be taken in the form E = E + v × B,

B = B,

D = D,

H = H − v × D, J = J − qv.

The approximations for B and D are justified in several ways [354, 409] with purely Galilean arguments. Assume the body is rigid and let ∇  × E = −∂t  B . Hence B, ∂t  B = (∂t + v · ∇)B v being the velocity of F  with respect to F . Since ∇ · B = 0 and ∇v = 0, then we have B. ∇ × (v × B ) = −(v · ∇)B Hence, since ∇ = ∇  , we can write E − v × B ) = −∂t B . ∇ × (E This is Maxwell’s equation with respect to F if B = B and E = E − v × B. A similar argument leads to D = D.

134

2 Balance Equations

2.12 Maxwell’s Equations for the Fields at the Rest Frame We now review Maxwell’s equations by means of the fields at the local rest frame of the body. Let the surface S and the contour C = ∂ S of S be in motion with velocity v, possibly varying along C. By (1.51) we have d ∫ D · n da = ∫[∂t D + ∇ × (D × v) + v∇ · D] · n da = − ∫ v × D · t ds + ∫(∂t D + qv) · n da dt S S S ∂S

Moreover H + v × D = H, is the magnetic intensity in the chosen reference frame whereas H is the value at the contour C = ∂ S. Also J = J − qv is the current density relative to the observer at rest with the body while J = J + qv comprises the convective term qv. Hence, Maxwell’s equations in integral form can be written as ∫ D · n da = ∫ q dv,

∂



∫ H · t ds = ∫ J · n da +

∂S

∫ B · n da = 0,

∂

S

∫ E · t ds = −

∂S

d ∫ D · n da, dt S

d ∫ B · n da. dt S

(2.60) (2.61)

Equations (2.60)–(2.61) imply (2.54)–(2.55). In the next section, we give a full account of the transformation properties of E, B, D, H thus providing a better understanding of the connection between (2.60)–(2.61) and (2.54)–(2.55). Equations (2.54)–(2.55) are relative to a chosen reference frame, while (2.60)–(2.61) involve the fields relative to the observer at rest with the body (S and ∂ S). For any C 1 vector field w d  ∫ w · n da = ∫ w ·n da. dt S S Then the arbitrariness of S and  in (2.60)–(2.61) implies ∇ · D = q, ∇ · B = 0,



∇ × H = J + D, 

∇ ×E = − B .

(2.62) (2.63)

As it must be, substitution of H = H − v × D and E = E + v × B and the definition  ˙ − Lw + (∇ · v)w imply that Maxwell’s equations (2.62)–(2.63) are equivaw= w lent to Maxwell’s equations (2.54)–(2.55).

2.13 Material Description of Electromagnetic Fields

135

2.13 Material Description of Electromagnetic Fields The material description of the electromagnetic fields is obtained by means of the integral forms of Maxwell’s equations. The fields so obtained are referred to as Lagrangian fields; here they are denoted by the subscript R. By Gauss’ law relative to the region Pt = χ(P, t), we have 1 1 ∫ q J dv R = ∫ q dv = ∫ D · n da = ∫ D · J F−T n R da R .

0 P

0 Pt ∂Pt ∂P Hence, we have Gauss’ law in the material description 1 ∫ q R dv = ∫ D R · n R da R

0 P ∂P if q R = J q,

D R = J F−1 D.

For a curve C = ∂ St parameterized by γ ∈ [a, b], x = χ(X(γ)), we have ∂γ χ = F∂γ X and hence b

b

a

a

∫ E · F∂γ X dγ = ∫ E · ∂γ χ dγ = − Hence, we have E R = EF,

d d ∫ B · n da = − ∫ B · J F−T n R da R . dt St dt S B R = J F−1 B.

Likewise, Ampere’s law yields b

b

a

a

∫ H · F∂γ X dγ = ∫ H · ∂γ χ dγ = ∫ J · n da + St

= ∫ J · J F−T n R da R + St

Hence, we find that H R = HF,

d ∫ D · n da dt St

d ∫ D · J F−T n R da R . dt S

J R = J F−1 J

and, again, D R = J F−1 D. Moreover, we let P and M transform like D and B. Hence, we have E R = EF, H R = HF; D R = J F−1 D, P R = J F−1 P, B R = J F−1 B, M R = J F−1 M.

It then follows that

136

2 Balance Equations

D R = 0 J F−1 E R F−1 + P R ,

B R = μ0 (J F−1 H R F−1 + M R ).

The material description of Maxwell’s equations is then formally invariant to the spatial description, in that ∇R · DR = qR ,

∇ R · B R = 0,

∇ R × H R = J R + ∂t D R ,

∇ R × E R = −∂t B R ,

hold along with the continuity equation ∂t q R + ∇ R · J R = 0. To complete the scheme, we have to specify the constitutive relations. The relations D = 0 E + P,

B = μ0 (H + M)

provide two of these fields in terms of the other ones. Moreover, in linear dielectrics P is a (linear) function of E. In view of a possible anisotropy, P and E are allowed to be non-collinear and hence we let P = 0 χe E,

D = 0 (1 + χe )E,

where χe is the positive-definite bulk electric susceptibility18 tensor. Likewise, in linear paramagnets, M is assumed to be a linear function of H so that M = χm H,

B = μ0 (1 + χm )H,

χm being the positive-definite bulk magnetic susceptibility tensor. Equivalently, we can evaluate M and H in terms of B in the form M=

1 χ B, μ0 b

H=

1 (1 − χb )B, μ0

χb := χm (1 + χm )−1 .

Both χe and χm (and hence χb ) are allowed to depend on temperature and deformation. Now we see that different definitions are allowed for P R and M R and here we indicate three possibilities. First we take the view that D(E) and B(H) keep the same form in the reference configuration. Since D R = J F−1 D and E R = EF then D = 0 (1 + χe )E implies D R = 0 (J F−1 F−T + J F−1 χe F−T )E R .

18

Also 0 χe is called electric susceptibility.

2.13 Material Description of Electromagnetic Fields

137

The analogue holds for B and H. Accordingly, if the relation P = 0 χe E has to hold in the reference configuration then we conclude that P R = 0 J F−1 χe F−T E R = 0 J F−1 χe E and hence D R = 0 J C−1 E R + P R ,

P R = 0 χe R E R ,

χe R = J F−1 χe F−T .

Likewise, since B R = J F−1 B, H R = HF, and B = μ0 (H + M), M = χm H then B R = μ0 J F−1 (H + M) = μ0 J C−1 H R + μ0 J F−1 χm F−T H R . Hence, we have B R = μ0 (J C−1 H R + M R ),

M R = χm R H R ,

χm R = J F−1 χm F−T .

Accordingly, P R = χe R E R and M R = χm R H R provide the polarization P R and the magnetization M R in terms of the fields E R , H R via the susceptibilities χe R , χm R . Because of the deformation, χe R and χm R show anisotropic dependences of P R (E R ) and M R (H R ) even though χe and χm are isotropic. Another possibility arises from the view, or the assumption, that D, E, P and B, H, M in the reference configuration are related as they are in the spatial description, so that B R = μ0 R (H R + M R ) D R = 0 R E R + P R , for appropriate tensors 0 R , μ0 R . Since D R = J F−1 ( 0 E + P) = 0 J C−1 E R + J F−1 P and

B R = J F−1 μ0 (H + M) = μ0 J C−1 (H R + CF−1 M)

then it follows that P R = J F−1 P, and

M R = CF−1 M = FT M = MF,

0 R = 0 J C−1 ,

μ0 R = μ0 J C−1 .

This scheme arises without any assumption on the possible linearity of P(E) and M(H). Incidentally, this scheme implies that P and D have the same transformation law as is the case for M and H. A third scheme arises by regarding E and B as the primitive variables so that the constitutive functions are

138

2 Balance Equations

P = P(E, B, ...),

M = M(E, B, ...)

in the spatial description and P R = D R (E R , B R , ...),

M R = M R (E R , B R , ...),

in the material description. We then determine P and M via M=

P = D(E, B, ...) − 0 E,

1 B − H(E, B, ...). μ0

By the transformation laws of D, E and B, H, we obtain P= M=

1 1 FD R − 0 F−T E R = F(D R − 0 J C−1 E R ), J J

1 1 1 1 FB R − F−T H R = F( B R − J C−1 H R ) = F−T ( CB R − H R ). μ0 J J μ0 μ0 J

Accordingly

J F−1 P = D R − 0 J C−1 E R , FT M =

allow us to define

1 CB R − H R μ0 J

P R = J F−1 P,

M R = FT M

and hence to state the constitutive relations in the material description as D R = 0 R E R + P R , H R = μ−1 0R BR − MR ,

0 R = 0 J C−1 , μ0 R = μ0 J C−1 .

As a consequence, we have also the transformation laws

0 → 0 R = 0 J C−1 ,

μ0 → μ0 R = μ0 J C−1 .

Conversely, we might start with the definitions P R = D R (E R , B R , ...) − 0 E R , Hence, we have

MR =

1 B R − H R (E R , B R , ...). μ0

2.14 Formulations of Electromagnetism in Matter

139

P R = J F−1 D − 0 FT E = J F−1 (D − MR =

0 T FF E), J

J −1 J F B − FT H = FT ( F−T F−1 B − H). μ0 μ0

As a consequence, the relations D = 0 E + P,

H = μ−1 0 B−M

holds provided we let again P R = J F−1 P, whereas 0 =

0 T FF , J

M R = FT M μ0 =

(2.64)

μ0 T FF . J

The second and third views lead to the transformation (2.64) as the appropriate ones for P and M in passing from the spatial description to the material description. As to the permittivity and permeability tensors (of free space), it looks consistent to let μ0 R = J F−1 μ0 F−T . 0 R = J F−1 0 F−T , Things are simpler in rigid motions. Since F = R, FFT = FT F = 1, and J = 1, then we have μ0 R = μ0 = μ0 1. 0 R = 0 = 0 1, In all the three schemes, we obtain the invariance properties E R = E, P R = P, D R = D, B R = B, M R = M, H R = H and hence D R = 0 E R + P R , D = 0 E + P,

B R = μ0 (H R + M R ), B = μ0 (H + M).

2.14 Formulations of Electromagnetism in Matter The subject of electromagnetism in matter is quite controversial, the origin of the controversy being related to different views about the selection of the primary fields. This in turn leads to seemingly different balance equations in matter. Seemingly, the literature shows different basic laws about force and torque produced by electromagnetic fields in matter. Most often the Lorentz force F on a charge

140

2 Balance Equations

q and the torque T on a magnetic dipole m are (e.g. [377], p. 212; [230], p. 168) F = q(E + v × B),

T = m × B.

(2.65)

Yet the Lorentz force is also taken in the form ([357], p. 48) F = q(E + v × μ0 H).

(2.66)

The view underlying (2.66) is that in free space B = μ0 H and in matter the reference magnetic field is maintained μ0 H. Of course, due to a nonzero magnetization M, in matter B = μ0 H. We now examine the Chu formulation and the Amperian formulation of Maxwell’s equations.19 Throughout, Maxwell’s equations are considered in the form ∇ × E = −∂t B, ∇ · D = q,

∇ × H = J + ∂t D, ∇ · B = 0,

along with the relations D = 0 E + P, B = μ0 (H + M). This form is referred to as Minkowski formulation. It was the first formulation for moving media, all four field variables E, D, H, B being present both in free space (where D = 0 E, B = μ0 H) and in matter, although in free space two of them are redundant. In matter, six significant fields occur and hence constitutive equations are needed. We now examine two formulations that sometimes are applied in the literature. To see how other formulations can be determined, we observe that, in free space, Maxwell’s equations can be written in the form ∇ × E = −μ0 ∂t H,

0 ∇ · E = q,

∇ × H = 0 ∂t E + J, μ0 ∇ · H = 0.

Often, in the literature, J and q are written as J f and q f to make it evident that they are the current density and the charge density generated by free charges (mainly electrons). Though these equations are generally valid, in any formulation, they allow one to say that the primitive fields are E, H while one is led to viewing E and B as the primitive fields in the Minkowski formulation. Consistently, this might be the origin of the different forms of Lorentz force in (2.65) and (2.66). We now examine two well-known formulations and contrast them with the Minkowski formulation.

19

See [357], Chap. 7, and [165]. The equivalence of these formulations is investigated by [381] in the relativistic context.

2.14 Formulations of Electromagnetism in Matter

141

Chu formulation. It is based on the view that only two field quantities are necessary in free space20 and these quantities are E and H. In matter, the source terms q and J are supplemented by a current density J p and charge density q p produced by the polarization of the medium. By analogy, a magnetic current density Jm and a magnetic charge density qm are allowed. Hence, the equations become ∇ × EC = −μ0 ∂t HC − Jm ,

∇ × HC = 0 ∂t EC + JC + J p ,

0 ∇ · EC = q + q p ,

μ0 ∇ · HC = qm ;

the subscript C labels fields pertaining to the Chu formulation. To determine J p , we consider a surface S and a picture where two types of motion of dipoles occur around S; roughly, a motion makes dipoles flowing perpendicularly to S, the other one makes the dipoles moving along a curve C on S. The polarization current I p is then made formal as I p = ∫ ∂t P · nda + ∫ P × v · tds = ∫(∂t P + ∇ × P × v) · n da. S

C

S

This motivates the definition J p = ∂t P + ∇ × (P × v). By requiring that ∇ · J p + ∂t q p = 0, it follows that q p = −∇ · P. By analogy, Jm and qm are Jm = ∂t (μ0 M) + ∇ × (μ0 M × v),

qm = −∇ · (μ0 M).

Hence, Maxwell’s equations in the Chu formulation for moving polarizable and magnetizable matter are ∇ × EC = −μ0 ∂t (HC + MC ) − ∇ × (μ0 MC × v), ∇ × HC = ∂t ( 0 EC + PC ) + ∇ × (PC × v) + JC , ∇ · ( 0 EC + PC ) = q,

∇ · μ0 (HC + MC ) = 0.

To find the connection between Chu variables and Minkowski variables, we observe that the two versions of Maxwell’s equations are equivalent if E = EC + μ0 MC × v, H = HC − PC × v, B = μ0 (HC + MC ), D = 0 EC + PC , and then 20

This is true also for the Minkowski formulation.

142

2 Balance Equations

M = MC + PC × v,

P = PC − 0 μ0 MC × v.

Conversely, we derive the Chu fields in terms of the Minkowski fields. By direct substitutions, we have EC = E − μ0 MC × v = E − B × v − μ0 HC × v = E − (B − μ0 H) × v − v × μ0 (PC × v).

Since PC × v = D × v − 0 EC × v then EC (1 − v 2 /c2 ) = E − (B − μ0 H) × v − μ0 v × (D × v) − (1/c2 )(v · E)v, where c2 = 1/ 0 μ0 . In addition 1 1 v × (E × v) [E − 2 (v · E)v] = E + . 2 2 1 − v /c c c2 − v 2 Likewise, we determine HC , PC , MC . We then find EC = E +

v × [(E − D/ 0 ) × v] v × (B − μ0 H) + , c2 − v 2 1 − v 2 /c2

HC = H +

v × [H − B/μ0 ) × v] v × (D − 0 E) − , c2 − v 2 1 − v 2 /c2

PC = D − 0 E +

v × [(D − 0 E) × v] 0 v × (B − μ0 H) − , c2 − v 2 1 − v 2 /c2

μ0 MC = B − μ0 H +

v × [(B − μ0 H) × v] μ0 v × (D − 0 E) + . c2 − v 2 1 − v 2 /c2

By the transformation law of electromagnetic fields, the effective fields E and H (i.e. the fields at the reference at rest at the point under consideration) are given by E = E + v × B,

H = H − v × D,

where E and B are the Minkowski fields in the laboratory frame. In terms of the Chu variables, the effective fields are given by E C = E + v × μ0 H,

HC = H − v × 0 E.

Amperian formulation. It is based on the view that, in free space, only two fields are necessary, namely E and B. The effects of polarization and magnetization are modelled by having mind distributions of charges and currents in free space. We denote by the subscript A the fields in the Amperian formulation and then start with Maxwell’s equations in the form

2.15 Poynting’s Theorem

143

∇ × E A = −∂t B A ,

0 ∇ · E A = q + q pm ,

∇ × B A = μ0 (J + J pm ) + μ0 0 ∂t E A ,

∇ · B A = 0,

where q pm , J pm denote terms arising from polarization and magnetization. Based on arguments about the effects of current loops, the contributions to q pm , J pm are determined so that Maxwell’s equations become [165] ∇ × E A = −∂t B A ,

0 ∇ · E A = q − ∇ · P A + ∇ · (M A × v/c2 ),

∇ × B A /μ0 = J + 0 ∂t E A + ∂t P A + ∇ × (P A × v) + ∇ × M A − ∂t (M A × v/c2 ),

∇ · B A = 0.

Incidentally J pm = ∂t P A + ∇ × (P A × v) + ∇ × M A − ∂t (M A × v/c2 ),

q pm = −∇ · P A + ∇ · (M A × v/c2 )

satisfy identically the continuity equation ∇ · J pm + ∂t q pm = 0. The connection between the Chu variables, Amperian variables, and Minkowski variables are established by requiring the equivalence of the formulations. It follows that E = E A , B = B A , H = B A /μ0 − M A − P A × v, D = 0 E A + P A − M A × v/c2 , and then M = M A + P A × v,

P = P A − M A × v/c2 .

Equalling the Minkowski variables in terms of Chu variables and Amperian variables, we find E A = EC + μ0 MC × v, B A = μ0 HC + μ0 MC , P A = PC , M A = MC , and then D A = DC + 0 μ0 MC × v,

H A = HC .

Since E = E A , B = B A , H = B A /μ0 − M A − P A × v, and D = 0 E A + P A + v × M/c2 then the effective fields in terms of Amperian variables are E A = EA + v × BA,

H A = H A − v × ( 0 E A + v × M A /c2 ).

2.15 Poynting’s Theorem A classical relation in electromagnetism, known as Poynting’s theorem, is often framed within a balance of energy. To obtain this relation, inner multiply Ampère’s law by E,

144

2 Balance Equations

E · ∇ × H = J · E + E · ∂t D. By the vector identity ∇ · (E × H) = −E · ∇ × H + H · ∇ × E and Faraday’s law, ∇ × H = −∂t B, we obtain H · ∂t B + E · ∂t D = −J · E − ∇ · (E × H).

(2.67)

Equation (2.67) is the content of Poynting’s theorem; the vector E × H is known as the Poynting vector. If there is a function u(x, t) such that ∂ t u = H · ∂t B + E · ∂ t D

(2.68)

∂t u = −J · E − ∇ · (E × H)

(2.69)

then can be interpreted as a balance of energy where u is the energy density, J · E is the power (dissipated) per unit volume and E × H is the energy flux vector. In integral form, we have d ∫ u dv = − ∫ J · E dv − ∫ E × H · n da. dt   ∂ The interpretation of the Poynting vector as an energy flux is subject to the existence of the function u. Now, if D = E,

B = μH,

where , μ are possibly functions of the position, then u(x, t) is given by u = 21 ( E2 + μH2 ). Equation (2.69) is often regarded as a balance law for electromagnetic energy. Yet Eq. (2.69) is not a balance law; it is an identity satisfied by all fields that are solutions of Maxwell’s equations provided u satisfies (2.68).

2.16 Balance Equations in Electromagnetism Balance equations for electromagnetic materials show different formulations in the literature. A formal apparent reason is due to the different formulations of Maxwell’s equations (Minkowski, Chu, Amperian). Yet even in a selected formulation, like the

2.16 Balance Equations in Electromagnetism

145

Minkowski one adopted here, subjective positions occur about the pertinent roles of fields in matter. Hence, we first motivate the present selection of fields in matter in connection with the basic balance laws. Look at the Lorentz force on electric charges. In free space, the Lorentz force (2.65) is well established; possibly one might write F = q(E + v × μ0 H) thanks to the equality B = μ0 H in free space. Yet in matter B = μ0 H and hence the two formulae F = q(E + v × B) and F = q(E + v × μ0 H) are inequivalent. Following the historical, experimental, and conceptual development of Maxwell’s equations, it emerges that B and E are physically the fundamental fields ([168], Sect. 36.6). Furthermore, we may take the view that, owing to the elementary structure of charged particles (protons or electrons), the particles are subject to the laws valid in the free space and hence (2.65) are maintained in matter. This view is consistent, e.g. with the model of ferromagnetism in connection with the spinning electron; while the torque on the spinning electron is taken to obey (2.65) then the evolutions of the magnetic dipole m is assumed to be governed by (e.g. [235], Sect. 12.3) ˙ = γm m × B. m The question now emerges as to the appropriate or effective magnetic field in magnetic media (M = 0). Maxwell’s equations in the form Electrostatics Static ferromagnetism ∇ ·D=q ∇ ·B=0 ∇×E = 0 ∇×H = 0 show mathematical correspondences between the two sets of equations with the analogy between E and H ([168], p. 36.12) We observe that while E + v × B is the electric field in the rest frame of the body, E = E + v × B, the magnetic intensity in the rest frame is H = H − v × D. Hence, we might pursue the analogy between E and H by deriving the force on a magnetic charge q∗ in the form E. F P = qE F M = μ0 q∗H , Yet electric monopoles exist in the form of particles that have a positive or negative electric charge. The analogy breaks down when we try to find the magnetic counterpart for the electric charge. While we can find electric monopoles in the form of charged particles, we have never observed magnetic monopoles. Magnets exist only in the form of dipoles with a north and a south end. When a bar magnet is split into two pieces, we don’t get a separate north part and a south part. Rather we get two new, smaller magnets, each with a north and south end. Nevertheless, we pursue the analogy in a mathematical way so as to derive forces and torques on electric and magnetic dipoles and then to determine forces and torques associated with the polarization P and the magnetization M. The present approach is then based on this assumption.

146

2 Balance Equations

Assumption. In matter and in free space, an electric charge q experience the Lorentz force in the form F = q(E + v × B), while a magnetic charge q∗ experience the force F = μ0 q∗ (H − v × D).

2.16.1 Forces, Torques, and Energy Supply To determine the force, per unit volume, due to polarization, we start with the force on a single electric dipole. Following the model of the electric dipole, let −q be the negative (point) charge, at x, and q the positive charge, at x + d. We denote the ˙ By evaluating the Lorentz force on the velocities at x and x + d by v and v + d. charges of the dipole, we have F− = −q E(x) − q v × B(x), ˙ × B(x + d). F+ = q E(x + d) + (q v + q d) Evaluate F+ up to linear terms in d. The sum of the two forces F+ + F− , at x, then becomes F+ + F− = (pp · ∇)E + v × (pp · ∇)B + p˙ × B, where p = q d is the electric dipole moment. We let p be the average of p , that is p = pp = qd, and define the dielectric polarization as P = np = nqd. ˙ = d˙and hence Moreover, we let d ˙ = n p˙ = n(P/n)˙ = P˙ − P n/n. nqd ˙ ˙ = ρ/ρ ˙ then we can write Since, by assumption, n/n ˙ = ρπ, nqd ˙

π := P/ρ.

The force per unit volume is denoted by f P ; by definition, we let f P := nF+ + F− . Hence, f P can be given the form21 f P = (P · ∇)E + v × (P · ∇)B + ρπ˙ × B.

(2.70)

Equation (2.70) coincides with the analogous Eq. (5.7a) of [354], except for μ0 H in place of B due to the choice of the Chu formulation.

21

2.16 Balance Equations in Electromagnetism

147

The use of B, instead of μ0 H as is done in [354], allows f P to be written in terms of the fields E , B at the rest frame. Since v × (P · ∇)B = P · ∇(v × B) − [(P · ∇)v] × B and π = P/ρ then f P = (P · ∇)E + P · ∇(v × B) − [(P · ∇)v] × B + ρπ˙ × B 

E + ρ(π˙ − Lπ) × B = (P · ∇)E E + ρ π × B. = (P · ∇)E Also, we observe that 



ρ π= ρ(P/ρ)˙− LP =P and hence



E + P × B. f P = (P · ∇)E Consequently, we can write the force density f P in terms of the Oldroyd and the Truesdell rates 



E + ρ π × B = (P · ∇)E E + P × B. f P = (P · ∇)E As a comment, the vector (P · ∇)E is often referred to as Kelvin polarization force density.22 The representation (2.70) of the polarization force density comprises also the term ρπ˙ × B which is of interest in time-dependent fields. Similar results occur in the literature, e.g. in the form [255] F = (p · ∇)E + p˙ × B. The torque on an electric dipole is T + = d × F+ and hence, to within linear terms in p T + = p × E + p × (v × B). The corresponding torque per unit volume, T+ , τ P := nT is given by τ P = P × (E + v × B) = P × E . Likewise, the vector μ0 (M · ∇)H, we will find shortly, is referred to as Kelvin magnetization force density. 22

148

2 Balance Equations

A magnetic dipole moment is thought to be generated by two possible mechanisms. One way is by a small loop of electric current, called an Ampèrian magnetic dipole. Another way is by a pair of magnetic monopoles of opposite magnetic charge, called a Gilbertian magnetic dipole,23 though elementary magnetic monopoles seem to be so far unobserved. To determine body forces and torques due to magnetization, we consider Gilbertian dipoles and then proceed by analogy with the electric polarization by computing the force and the torque on a magnetic charge q∗ by starting with F = q∗H = q∗ (H − v × D); Hence, on a magnetic dipole, we have the forces F− (x) = −q ∗ H(x) + q∗ v × D(x), ˙ × D(x + d). F+ (x) = q ∗ H(x + d) − (q∗ v + q∗ d) ˙ it follows that Within linear terms in d, d, m · ∇)H − v × (m m · ∇)D − q∗ d˙ × D, F+ + F− = (m where m = q∗ d is the magnetic dipole moment. We then define the magnetic polarization M such that m = nq∗ d. μ0 M = nm ˙ = d˙, we have Hence, letting d ˙ − μ0 M n˙ . nq ∗ d˙ = n(μ0 M/n)˙ = μ0 M n Further, since n/n ˙ = ρ/ρ, ˙ then we have ˙ nq∗ d˙ = ρμ0 m,

m :=

M . ρ

Define the magnetic force per unit volume, f M , in the form f M := F+ + F− . Hence we obtain ˙ × D. f M = μ0 (M · ∇)H − μ0 v × (M · ∇)D − ρμ0 m

23

Due to William Gilbert’s modelling of elementary magnetism in sixteenth century.

(2.71)

2.16 Balance Equations in Electromagnetism

149

Again, we can express the force density in terms of the fields at the rest frame. Since H − μ0 v × D = H , we obtain H − ρμ0 (m ˙ − Lm) × D. f M = μ0 (M · ∇)H Hence, it follows that 



H − ρμ0 m × D = μ0 (M · ∇)H H − μ0 M × D , f M = μ0 (M · ∇)H 



where m is the Oldroyd rate and M is the Truesdell rate. ˙ = ∂t m and As to (2.71), we observe that if v = 0, then m f M = ρμ0 (m · ∇)H − ρμ0 ∂t m × D is the expression of the force on a magnetic dipole at rest in the laboratory frame. Restrict attention to free space, where H = B/μ0 and D = 0 E. Consequently, we can write f M in the form f M = ρμ0 (m · ∇)H − ρ 0 μ0 ∂t m × E. Otherwise, in light of Maxwell’s equation ∇ × H = J + ∂t D, we have f M = ρμ0 (m · ∇)H − ρμ0 ∂t (m × D) + ρμ0 m × (J + ∇ × H). If, further, J = 0 and ∇m = 0, then it follows f M = ρμ0 ∇(m · H) − ρμ0 ∂t (m × D). In free space, this relation can be written as f M = ρ∇(m · B) − ρ 0 μ0 ∂t (m × E). This result establishes a direct connection with a representation of the force often given in the literature. Since magnetic dipoles tend to align in the direction of the magnetic field, the potential energy is assumed to be U = −m · H and then −∇U = ∇(m · H) is viewed as the force on m. The additive term −ρ 0 μ0 ∂t (m × E) is then regarded as hidden momentum of the magnetization24 m. If the body is also an electric conductor, then we may view the charges as subject to the Lorentz force. Let v be the velocity of body, at the pertinent point, and vc the velocity of the charges relative to the body. Hence, we write the Lorentz force in the form E + J × B, q[E + (v + vc ) × B] = q(E + v × B) + qvc × B = qE 24

Usually with respect to the magnetic moment m ; see, e.g. [255, 396, 430].

150

2 Balance Equations

q being the charge density and J the current density relative to the reference at rest. The force FJ associated with the current density field J in the region  is then FJ = ∫ J × B dv. 

Correspondingly, the torque TJ can be written in the form T J = ∫ x × (J × B)dv. 

Very often the literature25 takes the force, on the magnetic dipole with moment m · B) on the basis of the view that the potential energy is m , in the form F = ∇(m m · B and the force then would be −∇U . If m is independent of the position U = −m and B is irrotational, then m · ∇)B. F = ∇(mi Bi ) = mi ∇ Bi = mi ∂xi B = (m If instead we start from f = μ0 ∇(M · H), then it is very restrictive to assume M as independent of the position (and H or B as irrotational). To obtain the expression for the torque Tm on a magnetic dipole, we merely observe that the magnetic moment shows the tendency of aligning with the applied magnetic field. Hence, we assume a magnetic dipole m at rest in matter is subject to the torque Tm = μ0m × H . The torque per unit volume, due to magnetization, is then Tm  = μ0 M × H . τ M = nT Thus, the total torque per unit volume is τ = P × E + μ0 M × H . The computation of power on electric and magnetic dipoles is deeply dependent on the physical modelling. Start with electric dipoles and look at a dipole as two ˙ Hence, we charges, −q and q, displaced by d, moving with velocities v and v + d. write the power, on a dipole, as

25

E.g. [230], p. 185.

2.16 Balance Equations in Electromagnetism

151

˙ F− · v + F+ · (v + d)

˙ × B(x + d)] · (v + d) ˙ = −q[E(x) + v × B] · v + q[E(x + d) + (v + d) ˙ = qE(x) · d˙ + q(d · ∇)E(x) · v + q[(d · ∇)E] · d. ˙ the limit as d, d˙ → 0 with qd and qd˙ fixed shows The last term is nonlinear in d, d; that the third term is negligible. With this view, we then obtain ˙ = E · p˙ + [(p · ∇)E] · v. F− · v + F+ · (v + d) Define the power per unit volume, w P , by ˙ w P = nf − · v + f + · (v + d). Hence, we obtain

 w P = ρ E · P/ρ)˙+ [(P · ∇)E] · v.

The power on the magnetic dipoles is sometimes taken as [354]  w M = ρμ0 H · M/ρ)˙+ [μ0 (M · ∇)H] · v  by appealing to similarity. In addition to similarity with ρ E · P/ρ)˙, the power ρμ0 H · m · H is the energy of the dipole moment m M/ρ)˙is also justified by the view that −m ˙ is the power expended to vary the moment m . Further in the field H and then H · m ˙  = m m˙ = μ0 ρ(M/ρ)˙ m  and then we find the power ρμ0 H · M/ρ)˙. To sum up, we represent the body forces f P , f M , the body couples τ P , τ M , and the powers w P , w M in the form ˙ × D, f P = (P · ∇)E + v × (P · ∇)B + ρπ˙ × B, f M = μ0 (M · ∇)H − μ0 v × (M · ∇)D − ρμ0 m

τ P = P×E, ˙ w P = [(P · ∇)E] · v + ρE · π,

τ M = μ0 M × H , ˙ w M = μ0 [(M · ∇)H] · v + μ0 ρH · m.

To complete the scheme, we determine the force and power associated with the electric current. The free charges, with charge density q, are taken to flow with velocity vc , relative to the body. Hence, we let J = q(v + vc ) be the current density. By the Lorentz force, the free charges are then subject to a force density E + J × B, fc = q[E + (v + vc ) × B] = qE

152

2 Balance Equations

where J = J − qv = qvc . The power density wc is obtained via the observation wc = q[E + (v + vc ) × B] · (v + vc ) = qE · (v + vc ) = qE · v + qE · vc . E · v + E · J , then it follows Since qE · v + qE · vc = qE E · v + E · J − qE E · v − (E + v × B) · J = E · J . wc − fc · v = qE To state the balance of angular momentum, we observe that, since no rotational structure occurs, then we require that the total torque vanishes, namely τ mech + P × E + μ0 M × H = 0 or skwT = skw(P ⊗ E + μ0 M ⊗ H ). We can then state the balance equations of linear momentum, angular momentum, and energy in the form ρ˙v = ∇ · T + ρb + (P · ∇)E + v × (P · ∇)B + ρπ˙ × B E + J × B, ˙ × D + qE +μ0 (M · ∇)H − μ0 v × (M · ∇)D − μ0 ρm skwT = skw(P ⊗ E + μ0 M ⊗ H ),

(2.72) (2.73)

˙ ρ(ε + 21 v2 )˙ = ρb · v + ∇ · (vT) − ∇ · q + ρr + ρE · π˙ + μ0 ρH · m E · v + E · J , (2.74) +[(P · ∇)E] · v + μ0 [(M · ∇)H] · v + qE where b is the mechanical force per unit mass. In light of (2.72), we obtain26 H·m E · π˙ + μ0 ρH ˙ + T · L − ∇ · q + ρr. ρε˙ = E · J + ρE

(2.75)

Equation (2.75) involves the electric field E and the magnetic intensity H at the frame Fˆ locally at rest with the body. Also, J = qvc is the current density relative to Fˆ . By analogy with the first principle (2.19) and the first law (2.22), we may view (2.74) as a principle of conservation of energy (mechanical + electromagnetic). Once the consequences of (2.72) are evaluated, the balance law of electromagnetic energy (2.75) follows. For undeformable media, at rest, Eq. (2.75) simplifies to ˙ · H − ∇ · q + ρr. ρε˙ = E · J + P˙ · E + μ0 M

26

(2.76)

Equation (2.75) coincides with the analogous one in [354] and is in agreement with that assumed in [357], Eq. (7.35).

2.16 Balance Equations in Electromagnetism

153

Quite often, the balance of energy for undeformable media is taken as a direct consequence of Poynting’s theorem whereby E × H is viewed as the energy flux of electromagnetic character. With this view, the balance is expressed by u˙ 0 = −∇ · (q + E × H) + ρr, where u 0 is the energy per unit volume and ρ is a constant; u 0 is the energy density (per unit volume) free from the kinetic energy 21 ρv2 . The identity ∇ · (E × H) = H · ∇ × E − E · ∇ × H and Maxwell’s equations yield ˙ − H · B. ˙ ∇ · (E × H) = −E · J − E · D Hence, the balance of energy becomes ˙ + H · B˙ − ∇ · q + ρr. u˙ 0 = E · J + E · D

(2.77)

Now, substitution of D = 0 E + P,

B = μ0 (H + M)

and some rearrangements give (2.76) provided ρε and u 0 are related by ρε = u 0 − 21 [ 0 E2 + μ0 H2 ].

(2.78)

Hence, (2.76) and (2.77) are equivalent provided ε and u 0 are connected by (2.78). This result holds irrespective of any constitutive equation for the energy density. We may then view Eq. (2.77) as the balance of the internal + electromagnetic energy and (2.76) as the balance of internal energy. An alternative form of Poynting’s theorem is based on the form (2.62)–(2.63) E × H ) = H · ∇ × E − E · ∇ × H , by (2.62)– of Maxwell’s equations. Since ∇ · (E (2.63), we obtain 



E· D . E · J − H · B −E ∇ · (E × H) = −E If the balance of energy is assumed in the form ρε˙ = ρr − ∇ · (q + E × H ) + ..., the dots representing powers of different character, then we have 



E · D +..., ρε˙ = ρr − ∇ · q + E · J + H · B +E

154

2 Balance Equations

which is inequivalent to (2.76) and (2.75). This is a further indication that Poynting’s theorem has not to be viewed as a balance law.

2.16.2 Angular Momentum and Magnetic Moment By (2.73), we infer that a magnetically polarized body experiences a torque μ0 M × H ; a dual behaviour happens for an electrically polarized body under the action of an electric field. The body is then in (rotational) equilibrium if a mechanical torque is applied so that the resulting torque vanishes τ mech + μ0 M × H = 0. If the mechanical torque is produced by a deformation, then we might observe that, at equilibrium, a magnetic field produces a deformation. The Einstein–de Haas effect [143] arises from experimental observations where a change in the applied magnetic field causes the body to rotate. In the classical experiment, a suspended (ferromagnetic) cylindrical specimen is subject to the magnetic field produced by a magnetizing coil. The magnetic field is along the axis of the cylinder as well as the magnetic moment M. Hence, skw(M ⊗ H ) = 0 and no rotation is caused by the applied magnetic field. We then look for an explanation by means of microscopic considerations. The (total) angular momentum L is the sum of contributions from spin, orbit, and crystal lattice motions L = L spin + L orbit + L lattice . The observation shows the changes of L lattice . Likewise, the magnetic moment M consists of three terms M = Mspin + Morbit + Mlattice . It is assumed that the lattice contribution is negligible because of the relatively large mass, and too slow rotation, of the positive ions [256]. Let L and M be parallel and denote by L, M the components in the common direction. First observe that M = Mspin + Morbit + Mlattice  Mspin + Morbit and L = Lspin + Lorbit + Llattice . The magnetization M is measured through a pickup coil around the specimen, the angular momentum is measured through the angular velocity produced. Hence, measures are possible of the ratio

2.16 Balance Equations in Electromagnetism

λ=

155

Mspin + Morbit Mspin + Morbit = . −Llattice Lspin + Lorbit

Elementary arguments suggest that the ratio equals e/2m, the ratio between the charge and twice the mass of electrons. Instead, the measure of λ=

ge 2m

indicates that g = 2; g is called the magneto-mechanical factor (or Landé g-factor) and λ the gyromagnetic ratio. If we let Lorbit /Lspin = 2 , Morbit /Mspin = , then we have λ=

Mspin 1 + Morbit /Mspin ; Lspin 1 + Lorbit /Lspin

As  1, we find

e 1+

ge = . 2m m 1 + 2

g  2(1 − ).

If only the spin term occurs, then g = 2.

2.16.3 Maxwell Stress Tensor The Maxwell stress tensor is a second-order tensor used in classical electromagnetism to describe the electromagnetic forces in terms of a tensor in a way similar to the mechanical contact forces represented by the Cauchy stress tensor. In free space, the (Lorentz) force on the charge q is E = q(E + v × B) qE and hence the (electromagnetic) force fc per unit volume is given by fc = nq(E + v × B) = qE + J × B, where J = qv. By Gauss’ and Ampère’s laws, q and J can be written q = 0 ∇ · E, Hence

J=

1 ∇ × B − 0 ∂t E. μ0

156

2 Balance Equations

fc = 0 (∇ · E)E +

1 (∇ × B) × B − 0 ∂t E × B. μ0

In light of Faraday’s law, we find ∂t (E × B) = ∂t E × B + E × ∂t B = ∂t E × B − E × (∇ × E). Consequently, we have fc = 0 (∇ · E)E +

1 (∇ × B) × B − 0 ∂t (E × B) − 0 E × (∇ × E). μ0

Since ∇ · B = 0, then we can formally add a zero term (∇ · B)B/μ0 to obtain fc = 0 [(∇ · E)E − E × (∇ × E)] +

1 [(∇ · B)B − B × (∇ × B)] − 0 ∂t (E × B). μ0

For any vector w, the identity w × (∇ × w) = ∇ 21 w2 − (w · ∇)w allows us to obtain (∇ · w)w − w × (∇ × w) = −∇ 21 w2 + ∇ · (w ⊗ w), where ∇ · (w ⊗ w)|i = ∂x j (wi w j ). Consequently, we can write fc in the form fc = 0 [−∇ 21 E2 + ∇ · (E ⊗ E)] +

1 [−∇ 21 B2 + ∇ · (B ⊗ B)] − 0 ∂t (E × B). μ0

Letting σ := 0 (E ⊗ E − 21 E2 1) +

1 (B ⊗ B − 21 B2 1) μ0

we find that fc can be written fc = ∇ · σ − 0 ∂t (E × B).

(2.79)

The tensor σ is referred to as the Maxwell stress tensor. The reason for the term “stress tensor” is due to the form of Eq. (2.79). If 0 E × B is viewed as the momentum density of the electromagnetic field, then

0 ∂t (E × B) = −fc + ∇ · σ,

2.16 Balance Equations in Electromagnetism

157

might be interpreted as the balance of linear momentum. Yet it is rather odd that −fc instead of fc occurs in the balance of linear momentum with σ as the stress tensor. It is perhaps wiser to regard (2.79) as an identity. Incidentally

0 E × B = 0 μ0 E × H,

H = B/μ0 ,

is usually regarded as 0 μ0 times the energy flux associated with the electromagnetic field (Poynting’s theorem).

Maxwell Stress Tensor in Matter In matter, we need to account also for the forces f P and f M . Let f = fc + f P + f M . As the simplest approximation, we let f = qE + J × B + (P · ∇)E + μ0 (M · ∇)H be the whole electromagnetic body force (per unit volume). Since ∇ · D = q and D = 0 E + P we obtain qE + (P · ∇)E = ∇ · (E ⊗ D) − 0 (E · ∇)E. By a direct calculation, for any triplet of vector functions u, v, w, it follows from ∇×u = w that ∂xk u i = ∂xi u k − ikp w p ,

vk ∂xk u i = vk ∂xi u k − (v × w)i .

If we let u = E, v = E, and w = −∂t B, then it follows from Faraday’s law ∇ × E = −∂t B that (E · ∇)E = 21 ∇ E2 + E × ∂t B. As a consequence qE + (P · ∇)E = ∇ · [E ⊗ D − 21 0 E2 1] − 0 E × ∂t B. Now examine the magnetic force. By Ampère’s law, we can write J × B = [∇ × (B/μ0 − M) − ∂t D] × B. Since [∇ × (B/μ0 − M) × B]i =

1 Bk (∂xk Bi − ∂xi Bk ) + Bk (∂xi Mk − ∂xk Mi ) μ0

158

2 Balance Equations

it follows [J × B + μ0 (M · ∇)H]i = {∇ · [

1 (B ⊗ B − 21 B2 1) − M ⊗ B + M · B 1]}i μ0 −Mk (∂xi Bk − μ0 ∂xk Hi ) − (∂t D × B)i

Accordingly, we have 1 ˜ f = ∇ · [E ⊗ D − 21 0 E2 1 + (B ⊗ B − 21 B2 1) − M ⊗ B + M · B 1 + 21 μ0 M2 1] + f, μ0

where

(2.80)

f˜ = − 0 E × ∂t B − μ0 Mk (∇ Hk − ∂xk H) − ∂t D × B.

For a polarizable and magnetizable solid, the electromagnetic force fem can be written as the divergence of a (stress) tensor ([260], Sect. 56) in static conditions (∂t D = 0, ∂t B = 0) provided Mk (∇ Hk − ∂xk H) = 0. In non-polarizable and nonmagnetizable media (P = 0, M = 0), Eq. (2.80) reduces to (2.79). As a formal particular case, we now look for a representation of f if (P · ∇)E and μ0 (M · ∇)H are neglected. Consider f = fc = qE + J × B. We replace q = ∇ · D,

J = ∇ × H − ∂t D.

and use Maxwell’s equations to obtain f = (∇ · D)E + (∇ × H) × B − D × (∇ × E) − ∂t (D × B). In view of the identity (∇ × u) × w = (w · ∇)u − wk ∇u k for any pair of vectors u, w, the body force f can be written f = (∇ · D)E + (D · ∇)E − Dk ∇ E k + (B · ∇)H − Bk ∇ Hk − ∂t (D × B). Hence, since ∇ · B = 0, we have f = ∇ · (E ⊗ D + H ⊗ B) − Dk ∇ E k − Bk ∇ Hk − ∂t (D × B). If, further, the linear relations B = μH,

D = E

(2.81)

2.16 Balance Equations in Electromagnetism

159

hold, where μ and are constants, then Dk ∇ E k + Bk ∇ Hk = ∇ 21 (D · E + B · H). Subject to linearity, we can then write f = ∇ · {E ⊗ D + H ⊗ B − 21 (D · E + B · H)1} − ∂t (D × B).

(2.82)

Letting σ = E ⊗ D + H ⊗ B − 21 (D · E + B · H)1 we can write (2.82) in the form f = ∇ · σ − ∂t (D × B). Again, as expected, Eq. (2.82) reduces to (2.79) if D = 0 E, B = μ0 H. The tensors σ shown in (2.80) and (2.82) are different forms of the Maxwell stress tensor in matter, subject to different models of f. It is worth remarking that the result (2.82) coincides with the result obtained within the relativistic four-dimensional setting (see, e.g. [237], Sect. 2.12); in that case the linearity is expressed by G μν being a constant multiple of F μν . The stress tensor is also found if f = qE + J × B + Pk ∇ E k + μ0 Mk ∇ Hk . Since Pk ∇ E k + Mk ∇ Bk = Dk ∇ E k − 0 E k ∇ E k + Bk ∇ Hk − μ0 Hk ∇ Hk then adding Dk ∇ E k − 0 E k ∇ E k + Bk ∇ Hk − μ0 Hk ∇ Hk = fc to the right-hand side of (2.81) we find f = ∇ · {E ⊗ D + H ⊗ B} − ∇ 21 ( 0 E2 + μ0 H2 ) − ∂t (D × B). Hence, the Maxwell stress tensor would be given by σ = E ⊗ D + H ⊗ B − 21 ( 0 E2 + μ0 H2 )1. The stress tensor (2.83) is considered in [144] where it is claimed that f = ∇ · σ = (P · ∇)E + μ0 (M · ∇)H,

(2.83)

160

2 Balance Equations

that is the Kelvin expression for the force. As is known, this is true if there is no free charge (q = 0, J = 0) and B, D are time independent, which implies ∇ × E = 0, ∇ × H = 0. Accordingly ∂x j σi j = −( 0 E k ∂xi E k + μ0 Hk ∂xi Hk ) + D j ∂x j E i + B j ∂x j Hi = −( 0 E k ∂xk E i + μ0 Hk ∂xk Hi ) + D j ∂x j E i + B j ∂x j Hi = Pk ∂xk E i + μ0 Mk ∂xk Hi .

2.16.4 Balance of Entropy Consistent with the general form (2.39), we still state the balance of entropy in the form ρr ≥ 0, ρη˙ + ∇ · j − θ and again, for formal convenience, the entropy flux j is split in the form j = q/θ + k. As the second law of thermodynamics, we assume that, for every admissible thermodynamic process, the balance of entropy must hold for all times t and points x. Since 1 ρθη˙ + ∇ · q − ρr − q · ∇θ + θ∇ · k ≥ 0, θ by compatibility with the balance of energy, we evaluate ∇ · q − ρr from (2.75) to obtain 1 E · π˙ + ρμ0H · m ˙ + E · J + T · L − q · ∇θ + θ∇ · k ≥ 0. ρθη˙ − ρε˙ + ρE θ In terms of the free energy ψ = ε − θη, we can write the (second law) inequality as 1 ˙ + ρE E · π˙ + ρμ0H · m ˙ + E · J + T · L − q · ∇θ + θ∇ · k ≥ 0. − ρ(ψ˙ + η θ) θ (2.84) In rigid bodies, at rest, we can take E = E, H = H, J = J and π˙ =

1˙ P, ρ

˙ = m

1 ˙ M. ρ

Moreover we have L = 0 and hence the second law inequality simplifies to ˙ + E · P˙ + μ0 H · M ˙ + E · J − 1 q · ∇θ + θ∇ · k ≥ 0. − ρ(ψ˙ + η θ) θ

(2.85)

2.17 Conservation Laws Across a Singular Surface

161

2.17 Conservation Laws Across a Singular Surface Look at a conservation law in the general form d ∫ φ dv = ∫ β dv + ∫ h n da, dt Pt Pt ∂Pt

(2.86)

where h is a vector if φ is a scalar and a (second-order) tensor if φ is a vector. Hence, h n stands for h · n or the tensor composition according as φ is a scalar or a vector. The region Pt is allowed to be divided into two subregions Pt+ , Pt− by a surface σ(t) which is singular with respect to φ. Let σ(t) ˆ = σ(t) ∩ Pt . Hence, by (1.38) with  ˆ we obtain and σ replaced by Pt and σ, ∫ {∂t φ + ∇ · (φv)}dv + ∫[[φ(ν − v)]] · m da = ∫ β dv + ∫ h n da. (2.87)

Pt \σˆ

σˆ

Pt \σˆ

∂Pt

Let ∂t φ, ∇ · (φv), and β be bounded in Pt \ σˆ and h bounded at the lateral surface ˆ with of the pillbox. By the arbitrariness of Pt , we can take Pt as a pillbox across σ, one face in Pt+ and the other one in Pt− . At the limit of vanishing thickness of the ˆ Hence equation (2.87) reduces to pillbox, ∂Pt reduces to the + and − faces of σ. ∫[[h m + φ(ν − v) · m]] da = 0. σˆ

The arbitrariness of σˆ and the continuity of the integrand imply that [[h m + φ(ν − v) · m]] = 0. By (1.39), we can write [[h]]m + [[U φ]] = 0,

(2.88)

U = (ν − v) · m being the local speed of propagation of σ. At a shock front [[v · m]] = 0 and hence [[U ]] = 0. As a consequence, (2.88) holds with different values of U according as the + and − sides of the surface are considered. At a weak singularity, instead, we have [[v]] = 0 and hence [[U ]] = 0, which allows (2.88) to be written in the form (2.89) [[h]]m + U [[φ]] = 0. We can then state these results by Theorem 2.3 (Kotchine) Given the conservation law (2.86), if ∂t φ, ∇ · (φv), β and h are bounded in their domains then, at a singular surface, φ and h are required to satisfy (2.88). For weak singularities, [[U ]] = 0 and then (2.88) simplifies to (2.89).

162

2 Balance Equations

2.17.1 Imbalance Laws Across a Singular Surface We now consider balance laws expressed by inequalities, say imbalance laws. The generic form (2.86) is then replaced with d ∫ φ dv ≥ ∫ β dv + ∫ h n da. dt Pt Pt ∂Pt

(2.90)

A remarkable example of (2.90) is the balance (or imbalance) of entropy where φ = ρη, β = ρr/θ, and h n = −j · n while j = q/θ + k. By the theorem on the transport of discontinuous fields, see (1.38), we have the kinematic relation d ∫ φ dv = ∫ {∂t φ + ∇ · (φv)}dv + ∫[[φ(ν − v)]] · m da. dt Pt \σ σ Pt \σ In light of (2.90), we have ∫ {∂t φ + ∇ · (φv)}dv + ∫[[φ(ν − v)]] · m da ≥ ∫ β dv + ∫ h n da. σ

Pt \σ

Pt

∂Pt

Hence, reducing Pt to a pillbox around σ while ∂t φ, ∇ · (φv), and β are bounded, we have ∫[[φ(ν − v)]] · m da ≥ − ∫[[h]] · m da + w(Pt ), σ

σ

where w(Pt ) approaches zero when the thickness of the pillbox approaches zero. Taking the limit, we have ∫{[[φ(ν − v) + h]] · m da ≥ 0 σ

By the arbitrariness of σ, we find the local inequality [[φ(ν − v) + h]] · m ≥ 0. In the remarkable case of the second law (imbalance of entropy), it follows that [[ρη(v − ν) + j · m ≤ 0, where j = q/θ + k.

2.17 Conservation Laws Across a Singular Surface

163

2.17.2 Jump Conditions The balance equations for mass, linear momentum, and energy correspond to φ h Mass ρ 0 Linear momentum ρv T Energy ρ(ε + v2 /2) −q + Tv Hence, we find the consequences of (2.88). The balance of mass gives [[ρU ]] = 0, which means that ρU is constant across a discontinuity surface. By the balance of linear momentum, we have [[T]]m + ρU [[v]] = 0. At a weak singularity, where [[v]]= 0, it follows that [[T]]m = 0, which is usually referred to as Poisson’s condition. At a shock wave in inviscid fluids, we have ρU [[v]] − [[ p]]m = 0. By the balance of energy, it follows [[−q + Tv]] · m + ρU [[ε + 21 v2 ]] = 0. At a weak singularity, [[v]] = 0, [[T]]m = 0, and hence −[[q]] · m + ρU [[ε]] = 0. For definiteness, let j = q/θ be the entropy flux. The jump relation induced by the second law inequality is ρU [[η]] − [[q/θ]] · m ≥ 0.

164

2 Balance Equations

2.17.3 Rankine–Hugoniot Condition There are systems of conservation laws in the form ∂t φ + ∇ · F (φ) = f, where φ, f :  × R → Rn , F : Rn → Rn×3 in the unknown vector function φ. In component notation ∂t φ j + ∂xk F jk (φ) = f j ,

j = 1, ..., n.

They arise from the global balance laws d F + φ ⊗ v)n da. ∫ φ dv = ∫ f dv + ∫ (−F dt Pt Pt ∂Pt F + φ ⊗ v with h of (2.86) to obtain We then identify −F F ]]m + [[φ]]ν · m = 0. −[[F Since ν · m is the speed of displacement u n of σ, we can write F ]]m = u n [[φ]]. [[F

(2.91)

The set of jump relations (2.91) is usually referred to as the Rankine–Hugoniot jump condition.27 Equation (2.91) involves the speed of displacement u n which is the normal velocity of the singular surface relative to the chosen observer F and hence is dependent on F . Equation (2.89), instead, involves the local speed of propagation U that is invariant under change of the observer. In this regard, it is worth checking the effect of a change of the observer on Eq. (2.91). Let (2.91) be the equation associated with F and denote by x the position of a point relative to the origin of F . Let Fˆ be the observer such that the position vector is y = x − Vt. Accordingly, V is the velocity of Fˆ relative to F . The space-time coordinates are related by (x, t) ⇐⇒ (y, τ ),

t = τ , x = y + Vτ .

Hence ∂τ φ = ∂t φ + V · ∇y φ,

27

See, e.g., [273], Sect. 87.

∇y · F = ∇x · F .

2.17 Conservation Laws Across a Singular Surface

165

Relative to (y, τ ) the system of equations then becomes F − φ ⊗ V) = 0. ∂τ φ + ∇y · (F The corresponding jump condition is F − φ ⊗ V]]m = uˆ n [[φ]], [[F uˆ n being the speed of displacement relative to Fˆ . As a consequence, letting Vn = V · m, we have F ]]m = (uˆ n + Vn )[[φ]]. [[F We then conclude that the flux function F is dependent on the observer, F → F − φ ⊗ V, and that the speed of displacement is just the speed relative to the chosen observer. This is confirmed by the result that, as expected, u n = uˆ n + Vn .

2.17.4 Interaction Between Discontinuities of Different Order Balance equations of the form ρϕ˙ = β + ∇ · h, in the spatial description, have 1–1 correspondence with balance equations ρ R ϕ˙ = J β + ∇R · h R , in the material description, via the definition (2.26), h R := J hF−T . Accordingly, in the material description, balance laws can be written in the form ∂t φ + ∇R · F = f

(2.92)

where φ, f :  × R → Rn , F :  × R → Rn×3 and  ⊆ R; φ = ρ R ϕ, F = −J hF−T , f = J β. The arguments of the previous section apply again with the advantage that the body is at rest in , formally v = 0. Let m R be the unit normal to the singular surface in . Hence, the Rankine–Hugoniot condition F ]]m R = U R [[φ]] [[F

166

2 Balance Equations

holds with U R being the same relative to the two subregions. Letting F = F m R , we can write [[F]] = U R [[φ]] or F − − U R φ− = F + − U R φ+ .

(2.93)

To fix ideas, we say that [[φ]] = 0 represents a strong discontinuity. We have in mind the case where [[v]] = 0; hence, for brevity, we say that [[φ]] = 0 characterizes a shock. Denote by a superposed dot the time derivative relative to the reference at rest with the shock and let X be the coordinate in the direction of m R . Since U R is the speed of the shock, we have φ˙ = ∂t φ + U R ∂ X φ. Time (dot) differentiation of (2.93) results in F˙ − − U R φ˙ − = F˙ + − U R φ˙ + + U˙ R [[φ]], whence ∂t F− + U R ∂ X F− − U R (∂t φ− + U R ∂ X φ− ) = ∂t F+ + U R ∂ X F+ − U R (∂t φ+ + U R ∂ X φ+ ) + U˙ R [[φ]].

(2.94)

Weak discontinuities are characterized as those where φ is continuous, whereas ∂t φ, ∂ X φ suffer jump discontinuities. This is the case of acceleration (or acoustic) waves where v is continuous, while ∂t v, ∂ X v are not. To avoid ambiguities, here we denote by [ · ] the jump of weak discontinuities and hence [ φ]] = 0,

[ ∂t φ]] = 0.

Accordingly, if U is the speed of the surface then the kinematical condition of compatibility implies [ ∂t φ]] = −U [ ∂ X φ]]. The occurrence of a weak discontinuity perturbs the values ∂t F± , ∂ X F± ∂t φ± , ∂ X φ± , and U˙ R . If, e.g. a weak discontinuity occurs in the region ahead, then the associated contributions satisfy [∂t F+] = −U[[∂ X F+] = U[[∂t φ+] = −U 2 [∂ X φ+] The overall contribution of, e.g., the jump of ∂t F+ + U R ∂ X F+ − U R (∂t φ+ + U R ∂ X φ+ ) is given by [ ∂t F+ + U R ∂ X F+ − U R (∂t φ+ + U R ∂ X φ+ )]] = −(U − U R )2[ ∂ X φ+] .

2.17 Conservation Laws Across a Singular Surface

167

2.17.5 Eigenvector Form of Weak Discontinuities Go back to the system (2.92) of balance equations. In suffix notation, we have ∂t φi = ∂ X K Fi K + f i . Letting f be continuous, we can write [ ∂t φi ] + [ ∂ X K Fi K ] = 0. If F is a function of φ, then ∂ X K Fi K = ∂φ j Fi K ∂ X K φ j and hence [ ∂ X K Fi K ] = ∂φ j Fi K [ ∂ X K φ j ] . By the kinematical condition of compatibility and Maxwell’s theorem, we obtain −U[[∂ X φi ] = ∂φ j Fi K n K [ ∂ X φ j ] . Hence, any admissible weak discontinuity [ ∂ X φ]] is an eigenvector of ∂φF m R with eigenvalue −U . Denote by D the eigenvectors, namely + Dinc , D− k − , Dk +

are the incident eigenvector and the outgoing eigenvectors in the states behind (k− ) and ahead (k+ ). Hence, Eq. (2.94) becomes 

 2 2 2 ˙ k− ck− (Uk− − U R ) Dk− + k+ ck+ (Uk+ − U R ) Dk+ + [ U R ] [[φ]] = cinc (Uinc − U R ) Dinc ,

where [U˙ R ] denotes the jump of U˙ R induced by the weak discontinuities28 Upon the interaction of the shock U R , [[φ]] with the incident weak discontinuity cinc Dinc a number of emergent outgoing weak discontinuities ck−− Dk− , ck++ Dk+ arise. The solution + to this problem exists and is unique if the eigenvectors D− k− , Dk+ and the jump [[φ]] are n linearly independent vectors. As a consequence, for the existence and uniqueness, + there must be n − 1 (linearly independent) vectors D− k− , Dk+ . Moreover, the n − 1 ± speeds Uk± are required to satisfy Un−− ≤ ... ≤ U1− ≤ U R ≤ U1+ ≤ ... ≤ Un++ . These conditions are referred to as Lax inequalities. See [71], Sect. 15; applications are developed in [320, 321]. The jump of U˙ R at time t is denoted as U˙ R in [71]. If the interaction occurs at (X, t), we assume that φ(X, t+ ) − φ(X, t− ) = φ(X − , t) − φ(X + , t) [322].

28

168

2 Balance Equations

2.17.6 Jump Conditions of Integrals Let w be a scalar, vector or tensor function on R. Consider the time-dependent integral ∞

g(t) = ∫ A(s)w(t − s)ds, 0

where A(s) is a tensor-valued differentiable function. Hence, we may regard g as a functional of w, g(t) = G(wt ), wt (s) = w(t − s). The dependence of g on the position x ∈ Rt or x ∈ R, is understood. We let w suffer a jump at time t and evaluate the jumps [[g]](t), [[∂t g]](t). We replace the jump [[w]], at time t, with a smooth bounded function h with compact support in (t − δ, t]. We let h(τ ) → [[w]] as τ → t and look for the jumps [[g]](t), [[∂t g]](t) as δ → 0. Let w(τ ) = w0 (τ ) + h(τ ). Then the jump of g in passing from w0 to w is given by δ

G(w) − G(w0 ) = ∫ A(s)h(x, t − s)ds. 0

Furthermore

δ

∂t [G(w) − G(w0 )] = ∫ A(s)∂t h(t − s)ds. 0

Upon the observation ∂t h(t − s) = −∂s h(t − s) and an integration by parts, we find

δ δ ∂t [G(w) − G(w0 )] = −{ A(s)h(t − s) 0 − ∫ A (s)h(t − s)ds}. 0

Thus, as δ → 0, we find G(w) − G(w0 ) = 0,

∂t [G(w) − G(w0 )] = A(0)[[w]].

(2.95)

The argument can be applied with further integrations by parts to singularities of higher order [322]. We now apply these results to a singular surface for w. Select a point x of the body in the current configuration (or in the reference configuration) and let w− (x, τ ) := w(x− , τ ),

w+ (x, τ ) := w(x+ , τ ),

τ ≤ t,

x∓ being the limit point x as seen from behind or ahead of the discontinuity surface. Accordingly, we write

2.17 Conservation Laws Across a Singular Surface ∞

g− (x, t) = ∫ A(s)w− (x, t − s)ds, 0

169 ∞

g+ (x, t) = ∫ A(s)w+ (x, t − s)ds. 0

Hence, in view of (2.95), we find [[g]] = 0,

[[∂t g]] = A(0)[[w]].

(2.96)

2.17.7 Jump Conditions in the Reference Configuration The referential version of the conservation law (2.86) is obtained by the change of variable, x → X, and use of Nanson formula. It follows that d ∫ φ R dv R = ∫ β R dv R + ∫ h R n R da R , dt P P ∂P where φ R = J φ,

β R = J β,

(2.97)

h R = J h F−T .

By analogy with the spatial description, the region P is allowed to be divided in two subregions P+ , P− by a singular surface (t) which is singular with respect to φ R . ˆ directed from P− to P+ . ˆ = (t) ∩ P and denote by m R the unit normal to  Let  ˆ Hence, we have Also, denote by U R the velocity of . d ∫ φ R dv R = ∫ φ˙ R dv R + ∫ φ R− U R · m R da R dt P− P− ˆ  and

d ∫ φ R dv R = ∫ φ˙ R dv R − ∫ φ R+ U R · m R da R . dt P+ P+ ˆ 

Adding these relations, we find d ∫ φ R dv R = ∫ φ˙ R dv R + ∫[[φ R ]]U R · m R da R , dt P P ˆ  where [[φ R ]] = φ R− − φ R+ . As a consequence of (2.97), we obtain ∫(φ˙ R − b R )dv R + ∫[[φ R ]]U R · m R da R = ∫ h R m R da R . P

ˆ 

∂P

ˆ with one face in P+ and the other one in P− . Now we take P as a pillbox across  At the limit of vanishing thikness of the pillbox

170

2 Balance Equations

∫ h R m R d A → − ∫[[h R ]]m R da R

∂P

ˆ 

while, if φ˙ R − b R is bounded, the volume integral approaches zero. Hence we obtain ∫{[[φ R ]]U R · m R + [[h R ]]m R }da R = 0.

ˆ 

ˆ allows the following statement, that is the dual of Kotchine’s The arbitrariness of  theorem. Theorem 2.4 Given the conservation law (2.97), if φ˙ R and s R are bounded then, at a singular surface, φ R and h R are required to satisfy [[φ R ]]U R · m R + [[h R ]]m R = 0.

(2.98)

We now derive some consequences of (2.98). In the balance of mass φ = ρ and h = 0. Hence, (2.98) becomes [[ρ R ]]U R · m R = 0. Across moving surfaces, U R · m R = 0, the reference mass density ρ R is continuous, [[ρ R ]] = 0. Instead, [[ρ R ]] = 0 is allowed if the surface is stationary in the reference configuration, U R · m R = 0. In the balance of linear momentum φ = ρv and h = T and so φ R = ρ R v,

hR = TR .

Equation (2.98) becomes [[ρ R v]]U R · m R + [[T R ]]m R = 0. Since [[ρ R ]]U R · m R = 0, it follows that ρ R [[v]]U R · m R + [[T R ]]m R = 0. At a weak singularity, [[v]] = 0, we have [[T R ]]m R = 0, that is the dual of Poisson’s condition. Look at the balance of energy, where φ = ρ(ε + 21 v2 ) and h = −q + Tv. Hence φ R = ρ R (ε + 21 v2 ) and h R = −q R + vT R . Substitution in (2.98) gives ρ R [[ε + 21 v2 ]]U R · m R + [[vT R − q R ]] · m R = 0.

2.18 Balance Laws for Discontinuous Electromagnetic Fields

171

At a weak discontinuity [[vT R ]] · m R = 0 and the jump condition becomes ρ R [[ε]]U R · m R − [[q R ]] · m R = 0.

2.18 Balance Laws for Discontinuous Electromagnetic Fields The balance laws of electromagnetic fields are reviewed by letting a discontinuity surface ϒ occurring in a region  and a discontinuity line γ occurring in a surface S. Both  and S convect with the underlying body. We show how the starting global (integral) law results in a local (differential) equation along with a jump condition. Conservation of charge. The charge within a region  convecting with the body is constant d ∫ q dv = 0. dt \ϒ By (1.38), we have 0 = ∫ [∂t q + ∇ · (q v)]dv + ∫ [[q(ν − v)]] · nϒ da, ϒ

\ϒ

where ν is the velocity of ϒ, v is the velocity of material points. By the arbitrariness of  and ϒ, it follows that ∂t q + ∇ · (qv) = 0, in  \ ϒ

[[q(ν − v)]] · nϒ = 0, in ϒ.

Gauss’ law. The flux of D through any closed surface S equals the charge inside S. Let S = ∂ and let a surface ϒ occur with a surface charge density qS . The charge Q  inside  is given by Q  = ∫ q dv + ∫ q S da. \ϒ

ϒ

The divergence theorem in the generalized form (1.37) gives ∫ D · n da = ∫ ∇ · Ddv − ∫ [[D]] · nϒ da. \ϒ

∂

ϒ

Gauss’ law then means that ∫ ∇ · D dv − ∫ [[D]] · nϒ da = ∫ q dv + ∫ q S da.

\ϒ

ϒ

\ϒ

ϒ

Hence we have ∇ · D = q in  \ ϒ,

[[D]] · nϒ = −q S in ϒ.

172

2 Balance Equations

Conservation of magnetic flux. The flux of B through any closed surface is zero. The same procedure, as with Gauss’ law, gives [[B]] · nϒ = 0 in ϒ.

∇ · B = 0 in  \ ϒ,

Faraday’s law. If the surface S and the contour C = ∂ S are convecting with the body then the circulation of E + v × B around C is equal to minus the time derivative of the flux of B through S. Let γ be a discontinuity line, in S, moving with velocity ν. Faraday’s law reads ∫ (E + v × B) · t dl = −

∂S

d ∫ B · n da. dt S\γ

By Stokes’ theorem in the generalized form (1.52), we have ∫ (E + v × B) · t dl = ∫ [∇ × E + ∇ × (v × B)] · n da − ∫ [[E + v × B]] · tγ dl

∂S

γ

S\γ

while (1.53) and the condition ∇ · B = 0 in S \ γ yield d ∫ B · n da = ∫ [∂t B + ∇ × (B × v)] · n da + ∫[[B × (ν − v)]] · tγ dl. dt S\γ γ S\γ Hence, Faraday’s law results in ∫ (∇ × E + ∂t B) · n da − ∫ [[E + ν × B]] · tγ dl = 0, γ

S\γ

whence ∇ × E + ∂t B = 0 in S \ γ,

[[E + ν × B]] · tγ = 0 in γ.

Ampère–Maxwell law. The circulation of H − v × D around the contour C of a convecting surface S is equal to the electric current through S plus the time derivative of the flux of D through S. Let J be the electric current density vector. Because S is convecting, the net electric current through S is the flux of J − q v. Also, for generality, we allow a current density i, per unit length, to flow along a discontinuity line29 γ ⊂ S. Hence, we write the Ampère–Maxwell law in the form ∫ (H − v × D) · t dl = ∫ (J − q v) · n da + ∫ i · n dl +

∂S

S\γ

γ

d ∫ D · n da. dt S\γ

We may view γ as the intersection of S with a surface ϒ through which a surface current density flows. 29

2.18 Balance Laws for Discontinuous Electromagnetic Fields

173

By the generalized Stokes theorem (1.52) and the time derivative of a flux (1.53), we can write ∫ [∇ × (H − v × D)] · n da − ∫ [[H − v × D]] · tγ dl = ∫ (J − q v) · n da + ∫ i · n dl γ

∂S

γ

S\γ

+ ∫ [∂t D + ∇ × (D × v) + v∇ · D] · n da + ∫ [[D × (ν − v)]] · tγ dl. γ

S\γ

Upon use of the local Gauss law ∇ · D = q, it follows that ∇ × H − J − ∂t D = 0 in S \ γ,

[[H + D × ν]] · tγ + i · n = 0 in γ.

The whole set of Maxell’s equations is then obtained. We now examine the content of the jump conditions across surfaces or lines.

2.18.1 Boundary Conditions The jump conditions, derived above, at a discontinuity surface or line provide the appropriate conditions at the boundary surface or line of a body. To fix ideas, consider a surface ϒ, or a line γ, that is the common boundary between two regions R+ , R− . Denote by the subscripts − and + the limit values at ϒ, or γ, of any quantity evaluated from inside R− and R+ . Let n be the normal to ϒ, or γ, directed from R− to R+ . Gauss’ law implies that (D+ − D− ) · n = q S . The normal component of D changes discontinuously across the surface ϒ by an amount equal to the surface charge density. If q S = 0 then the normal component of D is continuous across the surface. The conservation of magnetic flux implies that B+ · n = B− · n. This signifies that the normal component of B is continuous across a surface. Faraday’s law implies that, across a line γ (E + ν × B)+ · t = (E + ν × B)− · t. If γ is fixed, ν = 0, the tangential component of E · t is continuous across γ. If ν = v, that is if γ is convected, then (E + v × B) · t is continuous. The Ampère–Maxwell law implies that (H + D × ν)+ · t − (H + D × ν)− · t = i · n.

174

2 Balance Equations

If, in particular, the line γ is fixed, ν = 0, then H+ · t − H− · t = i · n. We now look at ϒ as the interface between two different media. By (D+ − D− ) · n = q S ,

B+ · n = B− · n

we say that the normal component of B is continuous across the interface, whereas the jump of D · n equals the surface charge density qS . The line γ is any line along the interface and hence t is any unit vector in the (tangent plane to the) interface. If, further, JS = 0, then the conditions E+ · t = E− · t,

H+ · t = H− · t

hold. Owing to the arbitrariness of t, it follows that the tangential components of E and H are continuous across the interface between the two media. The effect of the surface current i can be better understood by letting nϒ be normal to the interface and t = n × nϒ , nϒ being directed from R− to R+ . Hence, the jump condition for H becomes (H+ − H− ) · t = i · n. Hence i · n = (H+ − H− ) · n × nϒ = nϒ × (H+ − H− ) · n. Since t and n are arbitrary, in the tangent plane to the interface ϒ, it follows that nϒ × (H+ − H− ) = i. Moreover, if E is the continuous tangential electric field at ϒ and i = σϒ E, then the boundary condition becomes nϒ × (H+ − H− ) = σϒ E. If the interface is fixed (ν = 0) and no surface charges and currents occur, then the boundary conditions become [[D]] · n = 0,

[[B]] · n = 0,

[[E]] · t = 0,

[[H]] · t = 0,

n being orthogonal to the interface and t being any direction parallel to the interface.

Part II

Constitutive Models of Simple Materials

Chapter 3

Generalities on Constitutive Models

The balance equations of a body form an under-determined differential system, insufficient to yield specific results unless further relations are supplied. The balance equations for a continuum (free from internal structures) are the continuity equation, the equation of motion, and the balance of energy, namely five equations, for the fourteen unknowns (mass density, velocity, symmetric stress, energy density, heat flux) in the pertinent space-time domain. The insufficiency of the balance equations to solve a dynamic problem is conceptual and is consistent with the fact that different material properties are expected to provide different responses. Mathematically the material properties of the body are expressed by constitutive equations, or constitutive assumptions, which provide a model of the material behaviour. Constitutive equations are not a mere mathematical model. They have to be physically admissible, and this is ascertained through the compatibility with the objectivity principle and the second law of thermodynamics. This chapter investigates both the contents of the objectivity and the second law of thermodynamics and also their application to the characterization of constitutive equations. The second law plays the role of selecting physically admissible processes and hence admissible constitutive properties. As a new approach, the entropy production is taken to be an unknown function to be determined as any other constitutive function.

3.1 Constitutive Equations A constitutive equation defines an ideal material which may represent a real material, within a reasonable approximation, in a region of validity of the pertinent variables (constitutive variables). In addition to insights from experimental data, constitutive equations are required to be admissible on physical grounds. Admissibility means consistency with objectivity and thermodynamic requirements. In addition, © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_3

177

178

3 Generalities on Constitutive Models

material invariance of a body requires that the response functions are form-invariant with respect to the group of transformations of the material frame of reference. Objectivity, or frame-indifference, means that the constitutive functions (or functionals) must be form-invariant under arbitrary rigid motions of the spatial frame of reference. Compatibility with thermodynamics means that the functions entering the constitutive equations must satisfy the second law of thermodynamics, subject to the constraints placed by the balance equations. Moreover a rule of equipresence is adopted in that at the outset all response functions (or functionals) are taken to depend on the same list of variables. Differences in the dependences are to be derived as a consequence of thermodynamics or motivated on the basis of the structure of the material. For materials which have memory from the past it is assumed that the constitutive variables at distant past do not affect appreciably the values of the constitutive functionals at present. This natural assumption is made formal by specifying the mathematical properties of the functionals characterizing the memory of the material. Simple materials were originally characterized by saying that the response (stress) functions at a point X, at the time t, is determined by the motion in a neighbourhood of X, up to the time t [426]. For generality and definiteness, we distinguish between materials with long memory and those with short memory. For long memory, simple materials are those with response functions, at X and at time t, that depend on the values of F, θ, ∇θ at X up to time t. Instead, for short memory simple materials are those with response functions, at X and at time t, that depend on the values of ˙ ∇ θ˙ at X at time t. Roughly, in materials with short ˙ θ, F, θ, ∇θ and the derivatives F, memory, the information of F, θ, ∇θ up to time t is reduced to the values of F, θ, ∇θ ˙ ∇ θ, ˙ at the time t− . ˙ θ, and of the time derivatives F, Objectivity and compatibility with thermodynamics are made precise and commented upon in the next sections.

3.2 Objectivity All the equations that are used in continuum physics must satisfy some properties of invariance. This means that they should describe a physical phenomenon in a unique way, yielding unique results, which are not dependent upon the frame of reference. The material properties of a body should not depend on the observer, no matter how he moves. However, the material properties may be affected by the motion of the body and hence it is assumed that the material properties are independent of the frame of reference within the set of Galilean frames. This independence of the frame of reference is made precise by the following Principle of objectivity. Constitutive equations must be invariant under changes of frame of reference; frames of reference F , F ∗ are related by a Euclidean transformation t ∗ = t − a, (3.1) x∗ = χ∗ (X, t ∗ ) = c(t) + Q(t)χ(X, t), where Q is a rotation, det Q = 1.

3.2 Objectivity

179

This assumption places restrictions on the possible constitutive equations. To establish objectivity requirements we need to know, at least by assumption, the properties of the involved scalars, vectors, and tensors under change of frame. We say that scalars α, vectors u, and tensors A are objective if the transformation property is u∗ = Qu, A∗ = QAQT . α∗ = α, Indeed, objective scalars are invariants. The tensors Q entering the transformation law (3.1) are subject to QQT = QT Q = 1,

det Q = 1

and are often called proper rotations. They constitute a nonabelian group denoted SO(3) (special orthogonal group). The orthogonal tensors Q with det Q = ±1 constitute a group denoted O(3); they comprise also reflections (improper rotations). A vector u such that Q ∈ O(3) u∗ = (det Q)Qu, is called objective axial vector (or pseudovector). Instead, a vector r such that r∗ = Qr,

Q ∈ O(3)

is called objective vector. Physical arguments indicate that the magnetic vectors B, H, M are axial vectors. An immediate example of axial vector is the vector product. If u, v are two objective vectors and w = u × v, Q = −1 then w∗ = (−u) × (−v) = w consistent with the transformation law w∗ = (−1)(−1)w. Restrictions placed by objectivity will be investigated in connection with the constitutive models of materials. Here, though, we establish some preliminary results. To fix ideas we look at the possible dependence of the stress tensor T and the heat flux q on deformation, kinematic variables, and other pertinent variables. We assume that T and q are objective and hence in the change of frame F → F ∗ they satisfy T∗ = QTQT ,

q∗ = Qq.

As to the Cauchy stress we might start with the assumption that the traction t and the unit vector n transform (as objective vectors) according to t∗ = Q t, Hence

n∗ = Q n.

Qt = t∗ = T∗ n∗ = T∗ Qn

180

3 Generalities on Constitutive Models

implies t = (QT T∗ Q)n whence T∗ = QTQT ; the objectivity of the traction t implies the objectivity of the Cauchy stress T. Further, ∫ T∗ n∗ da = ∫ (QTQT )Qn da = Q ∫ Tn da.

P ∗t

Pt

Pt

Hence it follows the objectivity of the surface force [216], ∫ T∗ n∗ da = Q ∫ Tn da.

Pt∗

Pt

Likewise b∗ = Qb implies ∫ ρb∗ dv = Q ∫ ρb dv.

Pt∗

Pt

The whole force on the body is then objective. The objectivity of the heat flux q, q∗ = Qq, implies the invariance of the heat flow through the surface, ∫ q∗ · n∗ da = ∫ q · n da.

Pt∗

Pt

By the polar decomposition theorem, F = RU, where R is a rotation1 and U is a positive definite, U2 = FT F. Since F∗ = QF it follows that U is invariant and then J = det F = det R det U = det U is invariant too. Since the Cauchy stress T is objective then the first and second Piola stresses obey the transformation laws T∗R R = T R R . T∗R = Q T R , These properties follow by direct evaluation of T R = J TF−T and T R R = J F−1 TF−T , T∗R = J (QTQT )Q−T F−T = QJ TF−T = QT R , T∗R R = J (F−1 Q−1 )(QTQT )Q−T F−T = J F−1 TF−T = T R R . 1

det R = 1 if det F > 0.

3.2 Objectivity

181

Instead, the objectivity of the heat flux q, q∗ = Qq, implies the invariance of the reference heat flux q R , q∗R = (J qF−T )∗ = J (Qq)QF−T = J qF−T = q R . A constitutive tensor function G of K is objective if G (K∗ ) = Q G (K) QT . Depending on the transformation law for K we can find the restrictions placed on G as a consequence of the arbitrariness of Q, subject to det Q = 1. This is now exemplified via the main constitutive properties.

3.2.1 Dependence on the Velocity Gradient Look at the constitutive equation ˆ T = T(L). Objectivity requires that T T ˆ ˆ ∗ ) = T(QLQ ˆ T∗ = Q T QT = QT(L)Q = T(L + )

˙ T . Since this requirement has to hold for any time dependent rotation where  = QQ Q we choose the function Q such that, at the pertinent time t, Q(t) = 1,

˙ Q(t) = −W(t).

Hence, since L − W = D we have ˆ ˆ T(L) = T(D). Consequently, by objectivity the Cauchy stress T is allowed to depend on the velocity gradient L only through the stretching D, that is T cannot depend on the spin W.

3.2.2 Dependence on the Deformation Gradient We now examine the possible dependence of the stress on the deformation gradient. Consider the first Piola stress T R = J TF−T and let T R = Tˆ R (F).

182

3 Generalities on Constitutive Models

Since T∗R = QT R then objectivity requires that Tˆ R (F∗ ) = QTˆ R (F), the constitutive function Tˆ R being the same in F and F ∗ . Hence ˆ R (F) = QT Tˆ R (F∗ ) = QT Tˆ R (QRU). T Letting QT = R = FU−1 we have ˆ R (F) = FU−1 Tˆ R (U). T This means that, apart from the multiplicative (linear) term F, the dependence of T R on F is through U. In addition we find that ˆ R (U)FT . T = (det U)−1 FU−1 T Consequently, by the principle of objectivity the Cauchy stress T takes the form T ˆ T = FT(U)F ,

ˆ where T(U) is a shorthand for (det U)−1 U−1 Tˆ R (U). The conclusion holds unchanged if T depends on the deformation gradient F and on the temperature θ, θ being invariant. Since the finite strain tensor E = 21 (FT F − 1) is invariant then T R cannot be merely a function of E. Instead, T R R = Tˆ R R (E) satisfies the invariance property of T R R whereas T R = FT R R takes the form T R = F Tˆ R R (E).

3.2.3 Thermoelastic Variables In thermoelasticity the independent variables may be F, θ, and ∇R θ. Let ψ be a scalar, q be a vector, and T be a tensor, all of them being objective and hence ψ being invariant. Since F∗ = QF whereas θ and ∇R θ are invariant then

3.2 Objectivity

183

ψ(F, θ, ∇R θ) = ψ ∗ (F, θ, ∇R θ) = ψ(F∗ , θ∗ , ∇R θ∗ ) = ψ(QF, θ, ∇R θ). In light of the polar decomposition, F = RU, it follows that ψ(F, θ, ∇R θ) = ψ(QRU, θ, ∇R θ). Choosing Q = R T we find ψ(F, θ, ∇R θ) = ψ(U, θ, ∇R θ). Likewise for q and T we have Q q(F, θ, ∇R θ) = q(Q F, θ, ∇R θ),

Q T(F, θ, ∇R θ)QT = T(Q F, θ, ∇R θ),

whence q(F, θ, ∇R θ) = R q(U, θ, ∇R θ),

T(F, θ, ∇R θ) = R T(U, θ, ∇R θ)R T .

In the customary Fourier’s law q is a (linear) function of the temperature gradient in the current configuration, ∇θ. Since (∇θ)∗ = ∂x∗ θ = Q∂x θ = Q ∇θ then objectivity requires that Q q(..., ∇θ) = q∗ (..., ∇θ) = q(..., Q ∇θ), the dots indicating possible further dependences. Now, let q = −κ ∇θ. We have

q∗ = Q q = −κQ ∇θ = q(Q ∇θ),

and hence the isotropic Fourier’s law is objective. In anisotropic materials we can write q = −K ∇θ where K is a second-order tensor, possibly dependent on F. We have q∗ = Q q = −Q K ∇θ,

q((∇θ)∗ ) = −K∗ Q ∇θ.

Objectivity, i.e. q∗ (∇θ) = q((∇θ)∗ ), requires that Q K ∇θ = K∗ Q ∇θ

184

3 Generalities on Constitutive Models

for any vector ∇θ whence

K∗ = Q K QT .

Fourier’s law is objective if and only if the conductivity tensor K is objective. In a model of generalized thermoelasticity [288, 335], the dependence on θ˙ is also allowed. Since θ is invariant then θ˙ is invariant too and then the dependence on θ˙ is compatible with the principle of objectivity.

3.2.4 Thermo-Viscous Variables In modelling thermo-viscous materials the dependence on the velocity gradient L and the temperature gradient ∇θ is usually considered. The dependence on the mass density ρ and the temperature θ are also allowed. To fix ideas, let T(ρ, θ, L, ∇θ) be the constitutive equation for the Cauchy stress tensor. Objectivity requires that Q T(ρ, θ, L, ∇θ)QT = T(ρ, θ, L∗ , (∇θ)∗ ), where the invariance of ρ and θ has been used. Since L∗ = Q L QT +  and (∇θ)∗ = Q∇θ we have Q T(ρ, θ, L, ∇θ)QT = T(ρ, θ, Q L QT + , Q ∇θ). Choose Q = 1. Hence T(ρ, θ, L, ∇θ) = T(ρ, θ, L + , ∇θ) ˙ T = Q. ˙ Hence letting Q ˙ = −W has to hold for every skew-symmetric tensor  = QQ we find that T cannot depend on the skew part of L. Therefore we take T in the form T = T(ρ, θ, D, ∇θ) and objectivity requires that Q T(ρ, θ, D, ∇θ)QT = T(ρ, θ, Q D QT , Q ∇θ). As an example, let T = 2μ(ρ, θ)D + λ(ρ, θ)(tr D)1 + ν(ρ, θ)∇θ ⊗ ∇θ. Then we have

3.2 Objectivity

185

Q T QT = 2μQ D QT + λ(tr D)1 + ν Q ∇θ ⊗ Q ∇θ. Since tr Q D QT = tr D we can write Q T QT = 2μQ D QT + λ(tr (Q D QT ))1 + ν Q ∇θ ⊗ Q ∇θ, which makes it apparent that the example provides an objective constitutive equation. Likewise we can show that q cannot depend on the skew part of L and that q = −κ(ρ, θ)∇θ + h(ρ, θ)D ∇θ is an objective constitutive equation. Since D and ∇θ are objective, the scalars D · D, ∇θ · D ∇θ, and ∇θ · ∇θ are invariant. By a direct check we have (∇θ)∗ · D∗ (∇θ)∗ = Q∇θ · QDQT Q∇θ = ∇θQT QDQT Q∇θ = ∇θ · D∇θ and the like for the other terms. Hence the scalars μ, λ, ν, κ, h may depend also on tr D, D · D, ∇θ · D∇θ, ∇θ · ∇θ and meanwhile T and q are objective.

3.2.5 Dependence on Histories The causality of natural processes may be interpreted as implying that the conditions in a body, at time t, are determined by the past history of the body and that no aspect of its future behaviour need be known. This might be phrased by saying that the effect at time t is determined by the history of the cause prior to time t. Given a function F on R we define Ft (s) = F(t − s),

s ∈ R+

and say that Ft is the history of F up to time t. To fix ideas we take the stress tensor T as the effect produced by the motion and take that, at a point X, T(t) = T (Ft ), where T and F are considered at the same point X. Following [428], p. 58, objectivity is said to require that Q(t)T(t)QT (t) = T (Qt Ft ),

186

3 Generalities on Constitutive Models

where (Qt Ft )(s) = Q(t − s)F(t − s),

s ∈ [0, ∞).

Based on this statement we can use the polar decomposition, F = RU, and take Q(t − s) = R T (t − s) to obtain R T (t)T(t)R(t) = T (Ut ), whence

T (Ut )R T (t). T(t) = R(t)T

Since R = FU−1 then T(t) = F(t)U−1 (t) T (Ut )U−1 (t)FT (t). Let Tˆ = U−1 (t) T (Ut )U−1 (t). Since Tˆ is invariant under change of frame then it is apparent that F Tˆ FT is an objective tensor.

3.2.6 Variables in Rate-Type Equations There are cases where the behaviour of the body is taken to be modelled by specifying the time derivative, of appropriate physical quantities, in terms of the independent variables. This is the content of the constitutive rate-type equations. Yet, in deformable bodies, the time derivative is not objective. It is then required by objectivity that rate-type equations involve objective derivatives (e.g. corotational, upper and lower convected). To fix ideas, assume the vector q and the tensor T are given by rate-type equations ◦

ˆ F, ∇R θ), q = q(θ,



ˆ T = T(θ, F, ∇R θ).

Objectivity requires that ˆ F, ∇R θ) = q(θ, ˆ QF, ∇R θ), Qq(θ,

ˆ ˆ QT(θ, F, ∇R θ)QT = T(θ, QF, ∇R θ).

The restrictions are then formally the same as those for finite equations. We then find that ◦ ◦ ˆ ˆ U, ∇R θ), q = R q(θ, T = R T(θ, U, ∇R θ)R T .

3.2 Objectivity

187

3.2.7 Objectivity and Invariants A scalar-valued function f is invariant under changes of frames. If f is a constitutive function which depends on tensors then f can depend only on tensor’s invariants. Given a 3 × 3 tensor A the three quantities I1 = tr A,

I2 = 21 [(tr A)2 − tr A2 ],

I3 = det A

are invariant. To prove this we observe tr A∗ = Aii∗ = Q ik Q i h Akh = δkh Akh = Akk = tr A. Likewise we prove the invariance of tr A2 . As to the determinant we have det A∗ = det[QAQT ] = det A. Yet also the three scalars I1 = tr A,

I2 = tr A2 ,

I3 = tr A3

are invariant, as it follows owing to the trace property. Indeed, by definition I1 = I1 ,

I2 = 21 (I12 − I2 ).

Now, by the Cayley-Hamilton theorem A3 − I1 A2 + I2 A − I3 1 = 0 we obtain tr A3 − I1 tr A2 + I2 tr A = 3 I3 , whence I3 = 13 [I3 − 23 I1 I2 + 21 I13 ]. From a physical point of view tr A and det A are more convenient in the description of constitutive properties. That is why I1 , I2 , I3 are less frequent in the literature. For formal convenience the notation of I1 , I2 , I3 (in place of I A , I I A , I I I A ) is preferred whenever no ambiguity occurs about the tensor under consideration.

188

3 Generalities on Constitutive Models

3.3 Objectivity and Euclidean Invariants in Electromagnetism Whenever a constitutive model arises from an appropriate potential function (free energy, enthalpy) it is crucial to select the pertinent independent variables. For definiteness we examine a scalar-valued function f (F, E, B) of the deformation gradient F, the electric field E, and the induction B. Upon a Euclidean transformation x → x∗ = c(t) + Q(t)x,

det Q = 1,

F, E, and B are transformed into F∗ = QF, E∗ = QE, B∗ = QB.

(3.2)

Hence objectivity requires that f (QF, QE, QB) = f (F, E, B) for any orthogonal tensor Q. To determine consequences of (3.2) we apply a classical argument based on the use of the polar decomposition F = RU. By selecting Q = R T it follows f (F, E, B) = f (R T RU, R T E, R T B) = f (U, U−1 FT E, U−1 FT B). Since U = C1/2 we can write f = fˆ(C, FT E, FT B). Other possible restrictions might be obtained by selecting Q in different ways. Yet this is only a sufficient condition on the function f (F, E, B) induced by the invariance under the rotation associated with the motion of the continuum. We may observe that f is objective if it depends on Euclidean invariants. If A is any Euclidean invariant, i.e. A∗ = A, then the restriction placed by objectivity, f (A) = f (A∗ ) is identically satisfied. Now, C, FT E, FT B are invariants in that C∗ = F∗T F∗ = FT QT QF = FT F = C, F∗T E∗ = FT QT QE = FT E,

3.4 Consistency with the Second Law

189

and the like for FT B. Further, F−1 E and F−1 B are invariants in that F∗−1 E∗ = F−1 Q−1 QE = F−1 E and the like for F−1 B. Since J = det F = det C1/2 also f (J )FT E and f (J )F−1 E are invariants for any function f (J ). In addition, the scalars E2 , B2 , E · B are invariants, i.e. E∗2 = (QE) · (QE) = EQT QE = E2 ,

E∗ · B∗ = (QE) · (QB) = EQT QB = E · B.

All of the invariants are possible variables for a (objective) constitutive function f . Also functions of the invariants are invariants, e.g. FT E · FT E,

(CFT E) · FT E,

(C2 FT E) · FT E

are invariants. Henceforth it is understood that the dependence of a scalar function on the variables F, E, B and their derivatives is in fact given by a composite function of Euclidean invariants. In practice, the selection of the appropriate variables is related to the particular model of the continuum. As we see, e.g. in Sect. 12.3, the dependence on FT E induces the occurrence of a torque and then implies skwT = 0. Instead, no torque arises if the dependence on E is via E2 . In passing, we observe that the selection might be suggested by the physical meaning of the variables. This seems to be the case for the selection of FT E in [137] and J F−1 B in [136]; they are the Lagrangian counterparts of E and B, namely the fields occurring in the material description of Maxwell’s equations. Of course the Lagrangian fields FT E, J F−1 D, J F−1 P, J F−1 B, FT M, FT H are Euclidean invariants.

3.4 Consistency with the Second Law Granted the compatibility with the balance equations (mass, linear momentum, angular momentum, energy), the constitutive equations are required to be consistent with the second law of thermodynamics; the second law then places restrictions on the effective evolutions and hence selects those which are physically admissible. Here we first go back to the statement of the second law and next specify a view underlying the analysis of the consistency with the second law. The requirement of the second law on the thermodynamic process might be stated as a requirement on the constitutive equations. The standard view can be given the following form. Through the balance of entropy, in Sect. 2.6, we have established the second law of thermodynamics by assuming that for every admissible thermodynamic process, characterized by constitutive equations and balance equations, the

190

3 Generalities on Constitutive Models

inequality (2.39) must hold for all times t and points x. In the generalized form of the second law, the entropy production σ, as well as the entropy η and the entropy flux j, is given by a constitutive function. The more general view where also the entropy production is given by a constitutive function per se proves profitable in the modelling of hysteresis such as in ferroelectric, ferromagnetic, and plastic materials. It is worth mentioning that in [93, 335] the (rate of) entropy production σ is defined and required to be non-negative, as in (2.41). Yet no constitutive property is considered for σ. In [209] the entropy production is considered as a constitutive function but the requirement σ ≥ 0 is ignored. Granted the balance equations, the second law becomes a criterion of admissibility of the assumed constitutive functions. In this connection, while exploiting the requirement of the second law it is essential to envisage the freedom allowed in the selection of the admissible thermodynamic processes. In practical problems the supplies b and r are assigned in advance and hence the balance equations of motion and energy result in differential equations in the pertinent unknowns, e.g. ρ, θ, v. Yet we take the view that conceptually the functions b(x, t) and r (x, t) can be selected arbitrarily. Then for any value of the right-hand sides of 1 b = v˙ − ∇ · T ρ

and

1 1 r = ε˙ − T · D + ∇ ·q, ρ ρθ

(3.3)

the balance equations hold because b and r are just taken to be given by (3.3). Analogous claims hold for the space and time derivatives. Hence Eq. (3.3) provides the body force b and the heat supply r , and possibly their derivatives, that must be applied to support the process. This view on the conceptual role of the supplies traces back to Coleman and Noll [93]. Based on this view, for later use we show that, given any spatial point and any ˙ θ, ¨ ∇ θ, ˙ and L, L, ˙ L, ¨ ∇L or time, it is possible to find a process such that ρ, ˙ ρ, ¨ ∇ ρ, ˙ θ, ˙ F¨ have arbitrarily prescribed values at that point and time. First we observe that, F, since ρ˙ = −ρ∇ · v, ρ¨ = −ρ∇ ˙ · v − ρ∇ ˙· v, ∇ ρ˙ = −∇ρ ∇ · v − ρ∇ ∇ · v, then ρ, ˙ ρ, ¨ and ∇ ρ˙ can take arbitrarily prescribed values if so is the case for ∇ · v, ∇ ˙· v, and ∇ ∇ · v; of course ρ˙ and ∇ · v are not independent. This in turn is the case if ˙ and ∇L can take arbitrarily prescribed (tensor) values. L, L, By the equation of motion we can write t

t

t0

t0

v(X, t) = ∫ v˙ (X, ξ)dξ + v(t0 ) = ∫ b(X, ξ)dξ + . . . the dots denoting the remaining terms. Hence we have t

F˙i K (X, t) = ∂ X K vi (X, t) = ∂ X K ∫ bi (X, ξ)dξ + . . . t0

3.4 Consistency with the Second Law

191

F¨i K = ∂ X K v˙i (X, t) = ∂ X K bi (X, t) + . . . . Arbitrarily prescribed values of F¨ and F˙ are allowed by appropriate values of ∇ R b and of the integral of ∇ R b. Within the Eulerian description we observe t t  ∂x j vi = ∂x j ∫ bi (X, ξ)dξ + · · · = ∫ ∂ X K bi (X, ξ)dξ FK−1i + . . . . t0

t0

Now by (1.30) and (1.31) we have L˙ = ∇ v˙ + · · · = ∇b + . . . and

¨ = ∇ v¨ + · · · = ∇ b˙ + . . . . L

˙ L ¨ are allowed by appropriate values of Hence arbitrarily prescribed values of L, L, t

˙ ∫ ∇ R b(X, ξ)dξ, ∇b, ∇ b. t0

˙ ∇ θ, ˙ θ¨ we let the internal energy ε depend on In connection with the values of θ, the temperature θ so that ∂θ ε = 0. The balance equation of energy gives ∂θ ε θ˙ + · · · = r. Consequently  r  r r θ˙ = + ... , ∇ θ˙ = ∇ + ... , θ¨ = ˙+ ... . ∂θ ε ∂θ ε ∂θ ε ˙ ∇ θ, ˙ and θ¨ are allowed by appropriate values Thus arbitrarily prescribed values of θ, of r, ∇r , and r˙ . ˙ implies that of D = symL, ∇ · v = tr L, W = skwL, The arbitrariness of L and L ˙ = symL, ˙ W ˙ = skwL. ˙ Moreover, by (1.26) we have and D ˙ = ∇ θ˙ − LT ∇θ = ∇ r + . . . ∇θ ∂θ ε ˙ is allowed to take and hence, for fixed values of L and ∇θ, the time derivative ∇θ any arbitrarily prescribed value. Especially in connection with the statement and the exploitation of the second law, the literature shows different approaches. The approach so described is often referred to as that of rational thermodynamics and that is the approach we follow throughout the book.

192

3 Generalities on Constitutive Models

3.4.1 Exploitation of the Second Law Granted the compatibility with the balance equations, the constitutive equations are required to be consistent with the second law of thermodynamics. We can then view the balance equations as constraints on the fields that are required to satisfy the second law (via the entropy inequality). Let  be a set of variables and assume the entropy inequality takes the form f ()α + g(, β) ≥ 0.

(3.4)

˙ If α can take arbitrary ˙ β = (ρ, ˙ ∇θ). As an example,  = (ρ, θ, D, ∇θ), α = θ, ˙ D, (positive or negative) values, independent of  and β, then (3.4) implies f = 0 and g ≥ 0. Otherwise, if f = 0 then the arbitrariness of α allows | f α| > |g| and this in turn allows f α + g < 0, which contradicts (3.4). Likewise, if vectors are involved then f ·α+g ≥0 implies f = 0. If tensors are concerned in the form F · A + g ≥ 0, the arbitrariness of A implies F · A = 0 and then F = 0. Rather, if A ∈ Sym (or Skw) then F ∈ Skw (or Sym). This is the essence of the so-called Coleman-Noll procedure ([93]; [216], ch. 39). It is apparent that the linearity is essential in that it allows both positive and negative values of f α if f = 0. To save writing, henceforth we mention the pertinent arbitrariness and understand the self-evident linearity. If α, β, and  are not independent then we cannot conclude f = 0. Instead a representation formula holds as shown in Sect. 3.6.

Exploitation via Lagrange Multipliers Within the classical approach described in the previous section, the balance of mass (i.e. the equation of continuity) is unavoidably a constraint while the balances of linear momentum and energy allow, e.g. for arbitrary values of v˙ and ε. ˙ This is so because the body force b and the heat supply r are arbitrary and can be assigned to satisfy the balance equations for any chosen values of v˙ and ε. ˙ In a slightly different approach established by Liu [279], the entropy inequality ρη˙ + ∇ · φ = ∂t (ρη) + ∇ · (ρηv + j) ≥ 0

3.4 Consistency with the Second Law

193

is exploited for supply-free bodies, i.e. bodies where no body forces and no heat supplies occur. Hence, in view of the balance equations, which are considered as constraints, the entropy inequality can be written in the form ∂t (ρη) + ∇ · (ρηv + j) − ρ [∂t ρ + ∇ · (ρv)] − v · [∂t (ρv) + ∇ · (ρv ⊗ v − T)] −ε [∂t (ρ(ε + 21 v2 )) + ∇ · (ρ(ε + 21 v2 )v − vT + q))] ≥ 0,

ρ , v , ε being the Lagrange multipliers, viewed as additional constitutive functions of the chosen independent variables. For definiteness let η, ε, T, q be functions of ρ, θ, D, ∇θ. The entropy inequality becomes [η + ρ∂ρ η − ρ − ε (ε + 21 v2 − ρ∂ρ ε)]∂t ρ + ρ(∂θ η − ε ∂θ ε)∂t θ − ρ(v + ε v) · ∂t v + ...

˙ ∇ρ, ∇D, ∇∇θ. The arbitrariness and the ˙ ∇θ, the dots denoting linear terms in D, linearity of ∂t ρ, ∂t θ, ∂t v imply that ∂ρ (ρη) − ρ − ε [ 21 v2 + ∂ρ (ρε)] = 0,

∂θ η − ε ∂θ ε = 0,

v + ε v = 0,

whence we obtain formally the Lagrange multipliers ε , ρ , v . Further restrictions are derived via the reduced inequality and appropriate assumptions on the dependence on D, ∇θ. In particular, further assumptions are required to determine the dependence of the Lagrange multipliers on the independent variables.2

3.4.2 Other Formulations of the Second Law Within the classical approach of rational thermodynamics (see, e.g. [93, 95, 99]), the free energy ψ, the entropy η, and the internal energy ε cannot depend on the temperature gradient. In a rational thermodynamics approach such dependence is ˙ allowed provided the extra-entropy flux k is taken to depend on the time derivative θ. Sometimes the second law is modified so that the expected dependence be allowed. Following are two formulations occurring in the literature that are devised to account for the dependence on gradients.

Extended Entropy Rate According to [382], in our notation the entropy inequality is taken in the form q ρr · n da − ∫ dv ≥ 0 S˙ + ∫ Pt θ ∂Pt θ 2

An extensive application of the method of Lagrange multipliers is given in [338] within the theory of extended thermodynamics.

194

3 Generalities on Constitutive Models

but the entropy rate is assumed to be given by ρ ˙ S˙ = ∫ (θη˙ + ∇θ · w)dv, Pt θ

(3.5)

˙ would be an additional entropy supply, w being an appropriate as though ∇θ · w/θ vector field. Hence the (local) entropy inequality becomes 1 1 1 ρ ˙ + ∇ · q − ρr − 2 q · ∇θ ≥ 0. (θη˙ + ∇θ · w) θ θ θ θ Substitution of ∇ · q − ρr from the balance of energy, ∇ · q − ρr = T · D − ρε, ˙ results in 1 ˙ + T · D − ρε˙ − q · ∇θ ≥ 0. ρ(θη˙ + ∇θ · w) θ Let the internal energy be defined by ε := ψ + ηθ + w · ∇θ. It follows

(3.6)

˙ + T · D − 1 q · ∇θ ≥ 0. ˙ − ρw · ∇θ −ρ(ψ˙ + η θ) θ

Let ξ be a suitable variable describing a possible internal structure of the body and assume the constitutive functions depend on ρ, θ, ∇θ, D, ξ. The entropy inequality takes the form ˙ + ρ2 ∂ ψ∇ · v + T · D − ρ∂ ψ · D ˙ − ρ∂ξ ψ ξ˙ − 1 q · ∇θ ≥ 0. −ρ(∂θ ψ + η)θ˙ − ρ(∂∇θ ψ + w) · ∇θ ρ D θ

˙ implies that ˙ ∇θ ˙ θ, The arbitrariness of D, ∂D ψ = 0, and

η = −∂θ ψ,

w = −∂∇θ ψ

1 ρ2 ∂ρ ψ∇ · v + T · D − ρ∂ξ ψ ξ˙ − q · ∇θ ≥ 0. θ

˙ Further restrictions follow by specifying the constitutive equation for the rate ξ. Hence the dependence of the free energy ψ on ∇θ is allowed if both the entropy ˙ · ∇θ and rate S˙ and the internal energy are defined via the ad-hoc insertion of w w · ∇θ. We are unaware of a physical argument to justify (3.5) and (3.6).

3.4 Consistency with the Second Law

195

Second Law via an Entropy Equation Green and Naghdi [209, 210] postulate the balance of entropy in the form q r ρη˙ = ρ( + ξ) − ∇ · , θ θ where ξ is the “internal rate of production of entropy” (per unit mass). As is customary, we replace ρr − ∇ · q via the balance of energy and consider the free energy ψ = ε − θη to find ˙ + T · D − 1 q · ∇θ − ρθξ = 0. (3.7) − ρ(ψ˙ + η θ) θ According to [210], once constitutive equations are specified for ψ, η, q, ξ then (3.7) can be regarded as an identity and can be used to place restrictions on the form of the constitutive functions.3 We might observe that (3.7) can be written in the form ˙ + T · D − 1 q · ∇θ = ρθξ. −ρ(ψ˙ + η θ) θ This condition is equivalent to the statement of the second law via the ClausiusDuhem inequality if ξ is defined by ρξ := ρη˙ −

q ρr +∇ · θ θ

and we assume that ξ ≥ 0 for any admissible process. Without the assumption ξ ≥ 0 we might face, e.g. with the equality [210] q · ∇θ + ρθ2 ξ = 0 and we could not conclude about the sign of q · ∇θ.

Generalized Gibbs Equation and Ginzburg-Landau Entropies In classical thermodynamics of homogeneous processes, the relation θd S = dU + pd V, where S, U , and V are the entropy, the energy, and the volume of the system, is taken to hold for reversible changes in a closed system. For non-homogeneous processes the relation becomes 3 Variants of this view occur if both the absolute temperature θ and an empirical temperature T are considered [210].

196

3 Generalities on Constitutive Models

θη˙ = ε˙ + p υ. ˙ The extension to mixtures is usually referred to as Gibbs equation. Borrowing from the Gibbs equation, the second law is established by means of the time derivative η˙ such that [118, 245] ρη˙ = −∇ · j + σ, where j is the entropy flux and σ is the non-negative entropy production. As we see in a while, though this view is formally correct, physically significant results follow depending on the choice of the independent variables. For formal simplicity we let the body be rigid; hence ρ is constant and we let η stand for ρη. Incidentally, in the modelling for phase transformation theories, as an extreme model we may introduce an interface separating the two phases. More realistically, the interfacial region may have a significant thickness and hence the gradients of the pertinent fields are likely to play an important role. This is merely an example that justifies the recourse to appropriate gradients within the independent variables. First we let η = f (θ, ϕ, q, ϕ) ˙ − 21 α|∇θ|2 − 21 β|∇ϕ|2 , where α, β are constants and ϕ is an appropriate field (e.g. the phase field describing an interface). The gradients make the function to be of the Ginzburg-Landau form. Computation of η˙ and some rearrangements yield ˙ + β ϕ∇ϕ). ˙ η˙ = (∂θ f + αθ)θ˙ + (∂ϕ f + βφ)ϕ˙ + ∂ϕ˙ f ϕ¨ + ∂q f · q˙ − ∇ · (αθ∇θ Hence we may identify j as ˙ + β ϕ∇ϕ. j = αθ∇θ ˙ The remaining inequality allows σ ≥ 0 if ∂ϕ˙ f = 0 and ϕ˙ = λ(βϕ + ∂ϕ f ), θ˙ = μ(αθ + ∂θ f ), λ, μ being non-negative functions of θ, ϕ. The term ∂q f · q˙ might vanish if ∂q f = 0 or might be non-negative with a rate-type equation for q, e.g. q˙ = ∂q f . The partial differential equations for θ and φ have the form of Ginzburg-Landau equations [178]. This is physically reasonable for the phase field ϕ while the temperature θ and the heat flux q are expected to model heat conduction. If instead we replace θ with ε as independent variable it follows η˙ = (∂ε f − ∇ · ∂∇ε f )˙ε + ∇ · (∂∇ε f ε˙ ) + ∂q f · q˙ + (∂ϕ f + βϕ)ϕ˙ + ∂ϕ˙ f ϕ ¨ − ∇ · (β ϕ∇ϕ). ˙

3.4 Consistency with the Second Law

197

In view of the balance of energy, ε˙ = −∇ · q, we obtain η˙ = q · ∇(∂ε f − ∇ · ∂∇ε f ) + ∂q f · q˙ + (∂ϕ f + βϕ)ϕ˙ + ∂ϕ˙ f ϕ¨ ˙ −∇ · (β ϕ∇ϕ ˙ + (∂ε f − ∇ · ∂∇ε f )q − ∂∇ε f ε). As with the previous case we conclude that ∂ϕ˙ f = 0 while j = β ϕ∇ϕ ˙ + (∂ε f − ∇ · ∂∇ε f )q − ∂∇ε f ε. ˙ Now let f (ε, ϕ, q) = fˆ(ε, ϕ) − 21 γq2 ,

η = f (ε, ϕ, q) − 21 ν(∇ε)2 − 21 β(∇ϕ)2 .

Hence we have ˙ + (βϕ + ∂ϕ fˆ)ϕ. σ = η˙ + ∇ · j = q · (∂ε fˆ + νε − γ q) ˙ The independence of q and ϕ implies that both terms are non-negative whence ϕ˙ = λ(βϕ + ∂ϕ fˆ),

˙ q = ξ(∂ε fˆ + νε − γ q),

where λ and ξ are non-negative functions of ε, ϕ. Hence we have q + ξγ q˙ = ξ(∂ε fˆ + νε). This is a Maxwell-Cattaneo type equation for the heat flux q (see, Sects. 6.2.1, 7.3, 6.6), with relaxation time ξγ, embodying the non-local term ξνε.

3.4.3 Remarks on Onsager’s Reciprocal Relations As it emerges in various contexts, the exploitation of the entropy inequality eventually leads to the positive valuedness of an entropy production σ = J · X, where X is a proper set of scalars, vectors, tensors; the components of X are called forces, the associated components of J are called fluxes. Assume that, in linear theories, the constitutive equations have the form J = X, where  is non-singular. Hence the thermodynamic requirement is

198

3 Generalities on Constitutive Models

σ(X) = X · X ≥ 0 for any n-tuple X. It is apparent that σ ≥ 0 if and only if + = sym is positive semi-definite while the skew part − = skw contributes nothing to σ. Incidentally, in the Fourier model of heat conduction J = q,

X = −∇ ln θ.

If w is the axial vector of − we can take X = κ1 + w × X and J · X = κX2 ≥ 0

=⇒

κ ≥ 0.

Of course we may well ask about the physical motivation of the vector w and hence of − . Things are more involved when different phenomena occur like in the case of heat conduction and diffusion. As we see in mixture theory, Chap. 9, for binary mixtures of non-reacting inviscid fluids the entropy inequality reduces to σ = m1 · (v2 − v1 ) + q · (−∇ ln θ) ≥ 0,

(3.8)

where m1 is the force on constituent 1 by way of constituent 2 and q stands for the inner part of the heat flux. In a linear theory we let X = (−∇ ln θ, m1 ) and assume [314] q = −κ∇ ln θ + ξm1 , v2 − v1 = −ν∇ ln θ + ζm1 . Accordingly, ξm1 represents the effect of diffusion on heat conduction (Dufour effect) while ν(−∇ ln θ) is the effect of heat conduction on diffusion (Soret effect). The symmetry of the matrix  results in ξ =ν; physically we say that the transport coefficients for the Soret and Dufour effects equate. Two remarks are in order. In general the symmetry of  is not merely a consequence of thermodynamics; the general class of Onsager’s reciprocal relations (in irreversible processes) are claimed to follow from the assumption of microscopic reversibility [350]; a critical review of the theory of Onsager’s relations is given in [427], Chap. 7. As a second remark we observe that the symmetry of , due to Onsager’s relations, places no bound on the cross-coupling terms, e.g. ξm1 , −ν∇ ln θ. Now, by (3.8) it

3.5 Entropy Equation and Gibbs Equations

199

follows σ = ζm12 + (ξ + ν)m1 · (−∇ ln θ) + κ(∇ ln θ)2 . Let m1 = y(−∇ ln θ) whence σ = [ζ y 2 + (ξ + ν)y + κ](∇ ln θ)2 . Hence σ ≥ 0 if and only if ζ ≥ 0,

κ ≥ 0,

(ξ + ν)2 − 4κζ ≤ 0.

In words, the positive value of the entropy production requires that the cross-coupling terms are not too large relative to the direct terms; formally, the sum ξ + ν cannot be too large with respect to κ and ζ. Indeed, if by chance we use the Onsager relation ξ = ν then we find the bound ν 2 ≤ κζ.

3.5 Entropy Equation and Gibbs Equations While the second law is expressed by an entropy inequality, ρη˙ ≥ ρr/θ − ∇ · j, it is worth looking at equations for the entropy function. The entropy equation results in a relation between entropy and energy. Let w be a generic power, per unit volume, so that the balance of energy is ρε˙ = w + ρr − ∇ · q.

(3.9)

Look at a constitutive setting where the temperature θ and the variable α are the ˆ α). As is most frequently independent variables, hence η = η(θ, ˆ α) and ψ = ψ(θ, ˆ It follows that proved, assume ηˆ = −∂θ ψ. ˙ + θη. ˙ ε˙ = ψ +˙ θη = (∂θ ψˆ + η)θ˙ + ∂α ψˆ · α Hence we find the entropy equation ˙ ρθη˙ = ρε˙ − ρ∂α ψˆ · α.

(3.10)

Some examples are now examined. First, as is the case for fluids, let α = ρ, w = ˆ Since ∇ · v = −ρ/ρ T · D = − p∇ · v + T · D, with p = ρ2 ∂ρ ψ. ˙ then, in view of (3.10), we find the Gibbs equation ρθη˙ = ρε˙ −

p ρ. ˙ ρ

(3.11)

200

3 Generalities on Constitutive Models

By the entropy inequality, 1 ρθη˙ ≥ ρr − ∇ · q + q · ∇θ, θ and use of (3.9) we have 1 1 θσ = ρθη˙ − ρr + ∇ · q − q · ∇θ = T · D − q · ∇θ. θ θ This shows that the entropy Eq. (3.11) holds even if heat conduction and viscosity effects occur. Second, as is shown in the next chapter, for thermoelastic solids we have α = F and 1 1 ˙ w = T R · F, TR , ∂F ψˆ = Jρ J T R being the first Piola stress. Hence ˙ + θη˙ = 1 T R · F˙ + θη. ˙ ε˙ = ψ +˙ θη = ∂θ ψˆ θ˙ + ∂F ψˆ · F˙ + θη Jρ Hence we get the entropy equation in the form 1 ˙ ρθη˙ = ρε˙ − T R · F. J

(3.12)

ˆ Third, let α = p while w = − p∇ · v and η = −∂θ φ(θ, p), with φ = ψ + p/ρ ˆ being the Gibbs free energy. As we show later on, ∂ p φ(θ, p) = 1/ρ. Now, ˙ − θη˙ + p/ρ ˙ 2. ˙ − p ρ/ρ ˙ 2 = −θη˙ + p ρ/ρ ε˙ = φ + θη˙ − p/ρ = ∂θ φˆ θ˙ + ∂ p φˆ p˙ − θη Hence we obtain the entropy equation in the form (3.11). As a comment we observe that entropy equations can take the same form in different continua, e.g. Eq. (3.11) for compressible fluids described by θ, p and inviscid fluids described by θ, ρ. Further, entropy- or Gibbs-equations hold even though dissipative effects (σ > 0) happen. In terms of φˆ we have 1 1 p 1 1 ˆ = ∂θ φˆ − 2 φ = − η − 2 (ε + − θη). ∂θ (φ/θ) θ θ θ θ ρ Letting h = ε + p/ρ be the enthalpy we can write ∂θ

 φˆ  θ

=−

h ; θ2

this result is usually referred to as the Gibbs-Helmholtz relation.

3.6 Second Law and Representation of Constitutive Functions

201

3.6 Second Law and Representation of Constitutive Functions Consider the Clausius-Duhem inequality in the generic form ˙ + w = θσ ≥ 0, − ρ(ψ˙ + η θ)

(3.13)

where w includes e.g. the pertinent expression of the power and the classical term −q · ∇θ/θ2 of heat conduction. Let  be a set of variables, e.g.  = (θ, α, u, K), α being a scalar, u a vector, K a tensor. To fix ideas, by assumption or by proof ˙ is not arbitrary and occurs ˙ If e.g. K let ψ = ψ() while w, σ depend on  and . linearly in (3.13) then we can write ˙ α, ˙ = g(θ, α, u, K, θ, ˙ ˙ u), AK · K

(3.14)

where AK is a second-order tensor. The following statement allows us to derive the ˙ provided only that AK = 0. general restriction placed on K, Lemma 3.1 (Representation) Given a second-order tensor A let N = A/|A|. If Z is a second-order tensor such that only Z · N is known, say Z · N = g, then4 Z = gN + (I − N ⊗ N)G

(3.15)

for any second-order tensor G. Equation (3.15) is merely the representation formula (A.8) where Z · N is indicated as equal to g. Now apply the Representation Lemma to Eq. (3.14) with ˙ A = AK . We find Z = K, ˙ = (K ˙ · AK ) AK + (I − AK ⊗ AK )G = g AK + (I − AK ⊗ AK )G. K |AK |2 |AK |2 |AK |2 |A2K | A suggestive geometrical interpretation follows by observing that ˙ · AK ) (K

AK AK AK ˙ =( ⊗ )K. |AK |2 |AK | |AK |

˙ along AK Hence thermodynamics places a restriction on the longitudinal part of K ˙ to AK unrestricted. A strictly analogous statement but leaves the orthogonal part of K ˙ holds for u.

4

The symbol I denotes the fourth-order identity tensor.

Chapter 4

Solids

The key property of solids is that they hold their shape because their molecules are tightly packed together. Atoms are essentially locked in position and, in the same way that a large solid holds its shape, the atoms inside of a solid are not allowed to move around too much. Electrons instead can move around and hence electromagnetic and thermal phenomena occur. The interest here is restricted to solids as simple materials namely materials described by first-order gradients of deformation, velocity, and temperature. Due to these features of solids, the chapter develops models of linear and nonlinear thermoelastic solids. Correspondingly equations are examined for discontinuity (shock) waves and time harmonic waves. Nonlinear elastic solids are modelled and related nonlinear waves are considered thus arriving at the Christoffel equation. The linear theory of elasticity leads to classical parameters such as the elasticity tensor and the Lamé moduli and to the transverse and longitudinal waves. Next attention is addressed to models of hyperelastic and rubber-like materials. Further, different approaches to the description of dissipation in solids are examined. The concept of phonon is introduced and shown numerically by means of vibrating strings, namely elastic arrangements of atoms in solids.

4.1 Thermoelastic Solids Thermoelasticity accounts for the interaction of the motion, within an elastic solid, with a nonuniform temperature field. Hence we say that a thermoelastic solid is characterized by constitutive functions of the variables F, θ, ∇θ, © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_4

203

204

4 Solids

that is the deformation gradient, the temperature, and the temperature gradient. The mass density ρ is related to the reference mass density ρ R by ρ det F = ρ R . By regarding ρ R as a parameter it follows that ρ is merely a function of F. In linear thermoelasticity ρ is regarded as a constant. For mathematical convenience we describe the stress via the first Piola stress T R . Hence we say that an admissible thermodynamic process is the set of functions χ, F, θ, ψ, η, T R , q, on  × R, where χ is the motion. As always, it is understood that det F > 0, θ > 0. For later convenience we show the following Lemma 4.1 At any point x˜ ∈  and time t˜ ∈ R, we can select admissible thermo˙ ∇F, ∇∇θ. ˙ ∇θ, ˙ θ, dynamic processes with arbitrary values of F, θ, ∇θ, F, To this end let P ⊂  be a neighbourhood of x˜ . Let  be a third-order tensor, K = K T and A be second order tensors, while g and a are vectors. Define the motion and the temperature, ˜ − X) ˜ + f (t)A(X − X) ˜ + [(X − X) ˜ ⊗ (X − X)], ˜ χ(X, t) = x˜ + F(X θ(x, t) = θ˜ + α f (t) + [g + f (t)a] · (x − x˜ ) + (x − x˜ ) · K(x − x˜ ), where θ˜ > 0 and det F˜ > 0. Moreover, f (t˜) = 0, f˙(t˜) = 1, and | f (t)| is small as ˜ t˜) and we please for any t. We find x˜ = χ(X, ˜ F(X, t) = F˜ + f (t)A + (X − X). The arbitrary smallness of f and the diameter of P implies that det F > 0 in B R . Moreover ˙ X, ˜ t˜) = A, F( ˜ t˜) = . ∇F(X, Also

˜ v(X, t) = f˙(t)A(X − X)

˜ and hence at x = x˜ . Now so that v = 0 at X = X ˙ x, t˜) = α θ(˜ while ∇θ(x, t) = g + f (t)a + K(x − x˜ ) whence

4.1 Thermoelastic Solids

205

∇θ(˜x, t˜) = g,

˙ x, t˜) = a, ∇θ(˜

∇∇θ(˜x, t˜) = K,

which concludes the proof.  We now derive the restrictions placed by the second law. Let ψ, η, T R , q, and k be functions of F, θ, ∇θ. Moreover let ψ be differentiable. The result is stated as follows. A necessary and sufficient condition that every admissible thermodynamic process obey the second law is that ˆ ψ = ψ(F, θ),

ˆ R (F, θ), TR = T

ˆ θ), Tˆ R (F, θ) = ρ R ∂F ψ(F, k = 0,

η = η(F, ˆ θ),

ˆ η(F, ˆ θ) = −∂θ ψ(F, θ),

ˆ q(F, θ, ∇θ) · ∇θ ≤ 0.

(4.1) (4.2) (4.3)

To prove this assertion we observe that J T · D = T R · F˙ and hence the entropy inequality (2.40) can be written 1 1 −ρψ˙ − ρη θ˙ + T R · F˙ − q · ∇θ + θ∇ · k ≥ 0. J θ By assumption ψ = ψ(F, θ, ∇θ). Hence the entropy inequality becomes ˙ − −ρ(∂θ ψ + η)θ˙ + (−ρ∂F ψ + T R /J ) · F˙ − ρ∂∇θ ψ · ∇θ

1 q · ∇θ + θ[∂F k · ∇F + ∂θ k · g + ∂∇θ k · ∇∇θ] ≥ 0. θ

In view of Lemma 4.1 we can select the process such that −ρ(∂θ ψ + η)α + (−ρ∂F ψ + T R /J ) · A − ρ∂∇θ ψ · a −

1 q · g + θ[∂F k ·  + ∂θ k · g + ∂∇θ k · K] ≥ 0, θ

˜ g. The arbitrariness of ˜ θ, the functions ∂θ ψ, η, ..., ∂∇θ ψ being evaluated at F, α, A, a, , K implies that the inequality holds only if (4.1), (4.2) hold with ∂F k = 0,

∂∇θ k ∈ Skw,

1 − q · g + θ∂θ k · g ≥ 0. θ

Since k depends only on the scalar θ and the vector ∇θ then we conclude that ∂∇θ k ∈ Skw implies ∂∇θ k = 0 and hence we have k(θ) = 0. Consequently the inequality (4.3) follows. Conversely, the validity of (4.1), (4.2), (4.3) implies that the entropy inequality (2.40) holds.  As a comment, for a thermoelastic solid T R and η are related to the free ˆ ˆ energy ψ(F, θ) by (4.2) while q(F, θ, ∇θ) satisfies the heat conduction inequality ˆ q(F, θ, ∇θ) · ∇θ ≤ 0. Some consequences of the thermoelastic model are of interest. By definition

206

4 Solids

ε˙ = ψ˙ + θη˙ + η θ˙ while, by (4.1) and (4.2),

˙ ˙ R − η θ. ψ˙ = T R · F/ρ

Since ψ = ε − θη it follows the Gibbs equation ˙ R, θη˙ = ε˙ − T R · F/ρ

˙ θη˙ R = ε˙R − T R · F.

˙ − ∇ · q + ρr we obtain In view of the energy equation ρε˙ = T R · F/J ρθη˙ = −∇ · q + ρr

(4.4)

which implies that a thermodynamic process is adiabatic (−∇ · q + ρr ≡ 0) if and only if it is isentropic (η˙ ≡ 0). ˆ θ) satisfy By (4.2), it follows that if ψˆ is a C 2 function then Tˆ R (F, θ)/ρ R and η(F, the Maxwell relation ˆ θ). ∂θ Tˆ R (F, θ)/ρ R = −∂F η(F, The specific heat is defined classically as the amount of energy, per unit mass, required to raise the temperature by one degree (Kelvin). In the continuum scheme, we look at the internal energy per unit mass ˆ ε = ε(F, ˆ θ) = ψ(F, θ) + θη(F, ˆ θ) and say that ˆ θ) c(F, θ) = ∂θ ε(F, is the specific heat. Substitution for εˆ results in ˆ c(F, θ) = ∂θ ψ(F, ˆ θ), θ) + η(F, ˆ θ) + θ∂θ η(F, whence, by (4.2), ˆ θ). c(F, θ) = θ∂θ η(F, By (4.2) we have T=

1 ˆ T R FT = ρ ∂F ψ(F, θ)FT . J

The symmetry of T requires that ˆ ∂F ψ(F, θ)FT ∈ Sym. Now if ψ depends on F through C = FT F then, in component form,

(4.5)

4.1 Thermoelastic Solids

207

∂ Fi K ψ F j K = ∂C P Q ψ ∂ Fi K (Fh P Fh Q )F j K = ∂C P Q ψ (Fi Q F j P + Fi P F j Q ) whence ∂F ψ FT = 2F ∂C ψ FT ∈ Sym and T = ρ ∂F ψ FT = 2ρ F ∂C ψ FT ∈ Sym. The dependence of ψ on C guarantees the symmetry of the Cauchy stress. In the material representation we can write T R (F, θ, ∇R θ),

q R (F, θ, ∇R θ).

Objectivity requires that QTˆ R (F, θ, ∇R θ) = Tˆ R (QF, θ, ∇R θ),

qˆ R (F, θ, ∇R θ) = qˆ R (QF, θ, ∇R θ),

for any orthogonal tensor Q. By the polar decomposition F = RU and the choice Q = R T we obtain Tˆ R (F, θ, ∇R θ) = FU−1 Tˆ R (U, θ, ∇R θ),

qˆ R (F, θ, ∇R θ) = qˆ R (U, θ, ∇R θ)

and hence T R R = F−1 T R is given by ˜ R R (U, θ, ∇R θ). Tˆ R R (F, θ, ∇R θ) = T Thus the objectivity requires that T R R , q R and the scalars ψ, η depends on F through U or C.1 As a comment we now examine a consequence of the heat conduction inequality. Let  K(F, θ) = −∂∇θ q(F, θ, ∇θ) ∇θ=0

be the conductivity tensor. Hence the heat conduction inequality q(F, θ, ∇θ) · ∇θ ≤ 0 and an expansion of q yield q(F, θ, 0) · ∇θ − ∇θ · K∇θ + o(|∇θ|2 ) ≤ 0. This inequality holds only if

1

See, e.g. [81, 405].

208

4 Solids

q(C, θ, 0) = 0,

K > 0.

Hence, in light of the second law of thermodynamics, the heat flux vanishes whenever the temperature gradient vanishes and the conductivity tensor has to be positive definite. A further comment is given about the entropy. Let η be a function of θ and the specific heat be constant. Then (4.5) can be written in the form η (θ) =

c . θ

Let θ0 be a reference temperature. Upon integration, from θ0 to θ, we have η(θ) − η(θ0 ) = c ln(θ/θ0 ). Now, integrate

ψ (θ) = −η(θ) = −η(θ0 ) − c ln(θ/θ0 )

to obtain ψ(θ) − ψ(θ0 ) = −η(θ0 )(θ − θ0 ) − cθ ln(θ/θ0 ) + c(θ − θ0 ). Hence ε = ψ + θη takes the form ε(θ) = ε(θ0 ) + c(θ − θ0 ). Quite often in the literature a simpler representation is adopted by letting η = c ln θ,

ψ = c θ(1 − ln θ),

ε = c θ.

4.1.1 Linear Theory Qualitatively linear theories are based on the view that the reference configuration is natural and furthermore some quantities, such as deformations and temperature variations, are small. Moreover we assume that the reference configuration is stress free. To make this view formal here we let θ0 be a reference temperature and look ˙ ∇θ. Now, since ˙ θ − θ0 , θ, for linear equations in ∇u, ∇ u, ∇R u = ∇u F then F = 1 + ∇R u = 1 + ∇u F = 1 + ∇u(1 + ∇uF)

4.1 Thermoelastic Solids

209

so that F = 1 + ∇u + o(|∇u|), ˙ + O(|∇u|), F˙ = ∇ u(1

C = 1 + 2E = 1 + 2sym∇u + o(|∇u|), ˙ = 2E, ˙ C

˙ E˙ = sym∇ u˙ + O(|∇u|)O(|∇ u|).

Thus we take the approximations F ≈ 1 + ∇u,

E ≈ sym∇u = ε, ˙ F˙ ≈ ∇ u,

C ≈ 1 + 2sym∇u,

˙ E˙ sym∇ u.

By objectivity, ψ is a function of F via C. Yet T R need not be symmetric, T R = ρ∂F ψ = 2 ρ F ∂C ψ. In the linear approximation we have F 1, ρ ρ R , T R R , T R T, and T 2 ρ R ∂C ψ(C, θ). For formal convenience we let ψ be a function of E and ϑ = θ − θ0 . Hence we let ˜ C = ρ R ∂E ∂E ψ(E, ϑ),

˜ M = ρ R ∂ϑ ∂E ψ(E, ϑ),

at E = 0, ϑ = 0. Hence we let T = CE + ϑM to within o(|∇u|, |ϑ|). The fourth-order tensor C is called the elasticity tensor and is symmetric. The second-order tensor M is called the stress-temperature tensor and is symmetric too. The stress-temperature tensor M gives the stress resulting from a given temperature variation when E = 0. As to the heat flux, an expansion with respect to ∇θ and the observation that q = 0 when ∇θ = 0 provide q = −K∇θ to within o(|∇θ|). Here K is the conductivity tensor and is evaluated at E = 0, ϑ = 0, ∇θ = 0. By η = η(E, ˜ ϑ) we obtain the time derivative η˙ in the form ˜ ϑ) · E˙ + ∂ϑ η(E, η˙ = ∂E η(E, ˜ ϑ) · E˙ + ∂ϑ η(E, ˜ ϑ)ϑ˙ = −∂ϑ T(E, ˜ ϑ)ϑ˙ ˙ = −M · E˙ + ∂ϑ η(E, ˜ ϑ)ϑ. Hence, since ρθη˙ = −∇ · q + ρr then we have

210

4 Solids

−ρθM · E˙ + ρθ∂ϑ η(E, ˜ ϑ) ϑ˙ = −∇ · q + ρ r. We thus obtain the basic equations of linearized thermoelasticity in the form ρ R u¨ = ρ R b + ∇ · T,

T = C E + ϑ M,

ρ R c ϑ˙ = −∇ · q + θ0 M · E˙ + ρ R r,

q = −K ∇ϑ.

As a comment, if the elasticity tensor C is invertible we can write E = K T + ϑ A. The tensors

K = C−1 ,

A = −K M

are called the compliance tensor and the thermal expansion tensor, respectively. Hence A determines the strain resulting from a given temperature when the stress vanishes. When the material is isotropic we can write2 C E = 2μE + λ(tr E)1,

M = m 1,

K = k 1.

The scalars μ and λ are called the Lamé moduli, m is the stress-temperature modulus, and k is the conductivity. In such a case the governing equations become ρ R u¨ = ρ R b + ∇ · T,

T = 2μ E + λ(tr E)1 + m ϑ 1,

ρ R c ϑ˙ = −∇ · q + m θ0 tr E˙ + ρ R r,

q = −k ∇ϑ.

Temperature variations result in pressure tensors in that T = m ϑ1 when E = 0. In addition if μ = 0 and 3λ + 2μ = 0 then we obtain the strain as a function of stress and temperature in the form E=

λ 1 T− (tr T)1 + α ϑ 1, 2μ 2μ(3λ + 2μ)

where α=−

(4.6)

m . 3λ + 2μ

The definitions of C and M embody a factor ρ R (or ρ). Hence μ, λ, m too are factored by ρ R . This in turn means that μ/ρ, λ/ρ, m/ρ keep finite, or constant, when ρ → 0.

2

4.1 Thermoelastic Solids

211

Since T = 2μE0 + κ(tr E)1 + mϑ1 then in stress-free conditions we have 0 = κ(tr E) + mϑ. Moreover, since tr E = 21 tr (HT H + HT + H) ∇R · u ∇ · u then3 ∇ · u = 13 αϑ. Since ∇ · u is the relative variation of the volume then α/3 is called the coefficient of thermal expansion.4 It is natural to assume that α > 0, m < 0.

4.1.2 Free Energy and Invariants A body is isotropic if every rotation is a symmetry transformation. For a thermoelastic solid this implies that ψ R (θ, F) = ψ R (θ, FQ) for all rotations Q. By the polar decomposition, F = VR, and the choice Q = R T we have ψ R (θ, F) = ψ R (θ, V) = ψˆ R (θ, B), B being the left Cauchy-Green tensor. The referential free energy ψ R = ρ R ψ and the first Piola stress T R are related by T R = ∂F ψ R (θ, F) = ∂F ψˆ R (θ, B) and hence

T R = ∂B ψˆ R (θ, B)∂F B.

Now, ∂ Fi K B j h = ∂ Fi K [F j P Fh P ] = δi j Fh K + F j K δi h . It then follows

3

T R = 2∂B ψˆ R (θ, B)F.

See, e.g. [272]. A different definition is given in the literature. In view of (4.6), A = α1 is called the thermal expansion tensor and then α, instead of α/3, is called the coefficient of thermal expansion [216].

4

212

4 Solids

−1 T To √ obtain the Cauchy stress T we observe that T = J T R F and J = det F = det B. Thus T is represented as a function of B (and θ) in the form

2 ∂B ψˆ R (θ, B) B. T= √ det B

(4.7)

A representation theorem [120] asserts that an isotropic scalar function of a symmetric tensor B may be expressed as a function of the principal invariants I1 (B) = tr B, I2 (B) = 21 [(tr B)2 − tr B2 ], I3 (B) = det B. Let IB be the triplet of invariants, IB = (I1 (B), I2 (B), I3 (B)), so that we can write ψ R = ψ R (θ, IB ). Hence the Cauchy stress T can be given the form 2 T = √ ∂B ψ R (θ, IB ) B. I3 Since ∂B I1 (B) = 1,

∂B I2 (B) = I1 (B)1 − B,

∂B I3 (B) = I3 (B)B−1 ,

we obtain 2 T = √ [I3 ∂ I3 ψ R (θ, IB ) 1 + (∂ I1 ψ R (θ, IB ) + I1 ∂ I2 ψ R (θ, IB ))B − ∂ I2 ψ R (θ, IB )B2 ]. I3 Now, by the Cayley-Hamilton equation B3 − I1 B2 + I2 B − I3 1 = 0, we have

and hence

B2 = I1 B − I2 1 + I3 B−1 T = β0 1 + β1 B + β2 B−1 ,

where  2 2 β0 = √ (∂ I1 ψ R (θ, IB ) + I1 ∂ I2 ψ R (θ, IB )), β1 = √ ∂ I1 ψ R (θ, IB ), β2 = −2 I3 ∂ I2 ψ R (θ, IB ). I3 I3

4.2 Elastic Solids

213

4.2 Elastic Solids A solid is a continuum characterized by structural rigidity and resistance to changes of shape or volume. This is made in mathematical form by saying that the stress, as well as the other constitutive properties, is related to the deformation. The thermal aspects, that is the dependence on the temperature or on the temperature gradient, and the heat conduction are modelled within thermoelasticity and hence ignored in elasticity. In elastic solids the Cauchy stress is a function of the ˆ deformation gradient, F, T = T(F). By the principle of objectivity, Tˆ is required to satisfy T ˆ ˆ = T(QF), QT(F)Q for any orthogonal tensor Q. By the polar decomposition theorem, F = RU, we can choose Q = R to obtain T ˆ ˆ T(F) = RT(U)R whence

−T −1 ˆ ˆ = U−1 T(U)U . F−1 T(F)F

Since J = det F = det U, the second Piola stress T R R = J F−1 T F−T is then a function of U, ˆ U−1 . T R R = (det U)U−1 T(U) Alternatively we may take T R R as a function of the Cauchy-Green tensor C = U2 . The reference free energy ψ R = ρ R ψ is taken to depend on the deformation gradient and hence, by the principle of objectivity, turns out to be a function of C; let ψ R = ψ˜ R (C). The thermodynamic restrictions are more directly established in the material description. The entropy inequality reduces to ˙ ≥ 0. −ψ˙ R + 21 T R R · C If ψ R is a function of C, ψ R = ψ˜ R (C), then the entropy inequality can be written in the form ˙ ≥ 0. (−∂C ψ R + 21 T R R ) · C By regarding b(X, t) as an arbitrarily assignable body force we can take F and F˙ as arbitrarily prescribed, at any point X and time t. Hence there are motions such that ˙ = F˙ T F + FT F˙ have arbitrarily prescribed symmetric values at any C = FT F and C X and t. Consequently the entropy inequality holds for all motions if and only if T R R = 2 ∂C ψ˜ R (C).

(4.8)

214

4 Solids

Since T = J −1 F T R R FT then the Cauchy stress takes the form T=

2 T ˜ . F ∂C ψ(C)F J

Materials satisfying (4.8) are termed hyperelastic. If the material is hyperelastic then the dependence of T R R on C has to satisfy ∂C T˜ R R (C) = 2∂C ∂C ψ˜ R ∈ Sym(Lin, Lin). / Sym(Lin, Lin) then the stress relation (4.8) does not hold If, instead, ∂C T˜ R R (C) ∈ and the entropy inequality simply says that ˙ ψ˙ R ≤ 21 T R R · C and the balance of energy yields ˙ + rR. ε˙R = ψ˙ R + (θη R )˙ = 21 T R R · C In such a case we say that T˜ R R (C) models an elastic solid, not a hyperelastic solid. ˜ A reference configuration is said to be natural if ψ(C) has a local minimum at C = 1. This means that ˜ ˜ ψ(C) ≥ ψ(1),

|C − 1| < δ,

for a suitable δ > 0. This implies that ˜ = 0 at C = 1. ∂C ψ(C) Hence T, T R , and T R R vanish at C = 1 or F = 1. In words, a natural reference configuration is stress free. Consequently, ˜ is positive semidefinite ∂C2 ψ(1) that is

˜ ≥ 0 ∀A ∈ Sym. A · ∂C2 ψ(1)A

Elastic materials are characterized by constitutive equations where the stress is determined by the strain. Hyperelastic materials are a particular case of elastic materials where the stress is derivable from an energy potential (of the strain). In general hypoelastic materials are characterized by a rate-equation ([428], p. 99) ◦

T= h (T, D).

4.2 Elastic Solids

215

Here we restrict attention to hypoelastic solids, in the Lagrangian description, as given by rate-independent constitutive equations in the form [310] ˙ T (T R R , C) T˙ R R = C (T R R , C) C. If the tensor T (T R R , C) is invertible then we can write ˙ T˙ R R = H (T R R , C) C; this case is investigated in Sect. 13.6.2.

4.2.1 Material Symmetries We now characterize the symmetries of a solid within the reference configuration. Each symmetry results in the invariance, of the second Piola stress and the free energy, to specific symmetry transformations namely rotations about specific axes and reflections about specific planes. Hence a material symmetry consists of a class, usually called symmetry group, of rigid transformations, of the reference shape, which are mechanically undetectable. A symmetry transformation can be represented by an orthogonal second-order tensor Q ∈ Orth; it is called rotation when det Q = 1 and reflection when det Q = −1. A set G of rotations forms a group if it is closed under multiplication and inversion, Q1 , Q2 ∈ G =⇒ Q1 Q2 ∈ G,

Q ∈ G =⇒ QT ∈ G.

The set of all rotations forms the proper orthogonal group Orth+ and G is a subgroup of Orth+ . Let A be a set of tensors, Lin+ , Sym, or Psym. The transform of A ∈ A under the rotation Q is QAQT . With a view to the free energy and the second Piola stress, we say that a scalar-valued function φ on A is invariant under G if φ(QAQT ) = φ(A) ∀A ∈ A and that a tensor-valued function  on A is invariant under G if (QAQT ) = Q(A)QT ∀A ∈ A. Let X be a point in the reference configuration R. If F is a (constant) deformation gradient then fF (Y) = X + F(Y − X) is a homogeneous deformation. If the body is subject to the rotation Q around X,

216

4 Solids

fQ (Y) = X + Q(Y − X) and then to the homogeneous deformation fF then the composite deformation is given by Y−X

Q



Q(Y − X)

F



FQ(Y − X).

The corresponding deformation gradient F2 is F2 = FQ and C2 = F2T F2 = QT C Q, If

C = FT F.

ψ˜ R (C) = ψ˜ R (QT CQ)

for every C ∈ Psym then Q is referred to as a symmetry transformation. The set G of all symmetry transformations is referred to as the symmetry group for the body. That G is indeed a group is shown as follows. If Q1 and Q2 belong to G then by applying first Q1 and then Q2 we have ψˆ R (F) = ψˆ R (FQ1 ), whence

ψˆ R (FQ1 ) = ψˆ R (F Q1 Q2 )

ψˆ R (F) = ψˆ R (F Q1 Q2 );

if Q1 , Q2 ∈ G then Q1 Q2 ∈ G. Now let Q ∈ G and look at any F. Hence ψˆ R (F) = ψˆ R (F Q). Letting F1 = F Q we can write F = F1 QT and hence we have ψˆ R (F1 ) = ψˆ R (F1 QT ), whereby if Q ∈ G then QT ∈ G. This concludes the proof that G is a group. By the stress relation T R R = 2∂C ψ˜ R it follows that the invariance of ψ˜ R implies ¯ = QT CQ and observe that, owing to the that of T R R . To show this property let C invariance of ψ R , ¯ ∂C H I C¯ AB C˙ H I . ψ˙ R (t) = ∂C H I ψ˜ R (C) C˙ H I = ∂C¯ AB ψ˜ R (C) Since ˜ R R (C)] H I , ∂C H I ψ˜ R (C) = [T

¯ = [T ˜ R R (C)] ¯ AB , ∂C¯ AB ψ˜ R (C)

∂C H I C¯ AB = Q H A Q I B ,

4.2 Elastic Solids

we can write

217

¯ − QT T˜ R R (C)Q] · C ˙ = 0. [T˜ R R (C)

˙ allows us to obtain the sought transformation law for the The arbitrariness of C second Piola stress, T˜ R R (QT C Q) = QT T˜ R R (C)Q. If every rotation is a symmetry transformation then we say that the body is isotropic. Accordingly, the body is said to be isotropic if G = Orth+ . If, instead, G is a proper subgroup of G then the body is said to be anisotropic. In particular, if G = Orth+ (a), the collection of all rotations about a fixed axis a, the body is said to be transversely isotropic, in that any plane that is normal to the axis a is a plane of isotropy. We now show some properties of ψ R and T R R for isotropic bodies. We start from the invariance property ψ˜ R (C) = ψ˜ R (QT C Q), for all rotations Q. Really, the property holds by replacing Q by −Q and hence is true for all orthogonal tensors. This means that ψ˜ R is an isotropic function, namely it is invariant under G = Orth. For isotropic bodies the free energy can be represented equivalently in terms of the right and left Cauchy-Green tensors C = FT F,

B = FFT .

By the polar decomposition, F = R U = V R, it follows that B = R C RT . Consequently, letting Q = R T we obtain ψ˜ R (C) = ψ˜ R (B). By

˙ = T R · F˙ ψ˙ R = 21 T R R · C

and taking ψ R = ψ˜ R (B) we have ˙ ψ˙ R = T R · F, Since

˙ ∂B ψ˜ R (B) · B˙ = T R · F.

218

4 Solids

˙ T + FF˙ T = 2sym(FF ˙ T) B˙ = FF and ∂B ψ˜ R (B) is symmetric then ˙ T ) = 2[∂B ψ˜ R (B)F] · F. ˙ ∂B ψ˜ R (B) · B˙ = 2∂B ψ˜ R (B) · (FF Thus, the entropy inequality yields ˜ ˙ = [2 ∂B ψ˜ R (B)F − T R ] · F. ˙ − TR R ] · C 0 ≥ [2 ∂C ψ(C) The arbitrariness of F˙ implies that T R is given by Tˆ R (F) = 2 ∂B ψ˜ R (B)F. Since T = J −1 T R FT and J = det F = (det B)1/2 then it follows that the Cauchy stress is a function of B, ˜ T(B) =

2 ∂B ψ˜ R (B)B. (det B)1/2

If P = M N, where M, N, P ∈ Sym, then NP = NMN = NT MT NT = (NMN)T = (NP)T and, hence, P commutes with N. The identifications P = T, M =

2 ∂B ψ˜ R (B), N = B (det B)1/2

allow us to say that T commutes with B, BT = TB. Let φ : Sym → R. If φ is invariant under G then ∂A φ(A) is also invariant under G. To prove this property, observe that φ(A) = φ(QT AQ)

∀Q ∈ G,

¯ ¯ ∂A A, ∂A φ(A) = ∂A φ(QT AQ) = ∂A¯ φ(A) ¯ = QT AQ. Now, where A ∂ A pq (QT AQ)i j = Q pi Q q j , whence

4.2 Elastic Solids

219

¯ = Q ∂A¯ φ(A) ¯ ∂A A ¯ QT . ∂A¯ φ(A) Upon substitution we obtain the sought relation ¯ = QT ∂A φ(A)Q. ∂A¯ φ(A) As an immediate consequence, we let A = B, φ = ψ˜ R , to find ˜ T BQ) = T(Q =

2 ¯ T BQ ∂ ¯ ψ˜ R (B)Q (det B)1/2 B

2 ˜ QT ∂B ψ˜ R (B)QQT BQ = QT T(B)Q, (det B)1/2

˜ which means that T(B) is isotropic. ˜ The isotropy of T(B) implies that T commutes with C, C T = T C. For this, since B = R C R T we let Q = R T and find T T ˜ ˜ ˜ ˜ C R T )R C R T = RT(C)R RCR T = RT(C)CR . T B = T(B)B = T(R

Likewise we find that Since B T = T B then

T ˜ . B T = R C T(C)R

T T ˜ ˜ = RT(C)CR , R C T(C)R

whence the sought result, C T = T C, follows.

4.2.2 Elastic Waves Nonlinear Waves We now investigate wave solutions, in elastic materials, in the form of singular surfaces of order 2 (acceleration waves or weak waves). Hence v = χ ˙ is continuous across the singular surface whereas v˙ , L and higher-order derivatives suffer jump discontinuities across the singular surface. It is convenient to describe the waves in the reference configuration. Denote by  the singular surface. By means of the first Piola stress T R we can write the equation of motion in the form ρ R v˙ = ∇R · T R + ρ R b,

220

4 Solids

while the constitutive equation is T R = T R (F). We let b be continuous across the singular surface and hence we have the jump condition ρ R [[˙v]] = [[∇R · T R ]]. We assume [[T R ]] = 0, so that Poisson’s condition is satisfied. We now apply repeatedly Maxwell’s theorem and the kinematical condition of compatibility. The continuity of T R implies that there exists a second-order tensor S such that ˙ R ]] = −U R S, [[T S := [[∂n R T R ]]. [[∇R T R ]] = S ⊗ n R , Since [[χ]] ˙ = 0 then ˙ = U R2 a, [[χ]] ¨ = −U R [[∂n R χ]] Moreover

Since

then we obtain

a := [[∂n R χ]].

˙ = [[∇ R v]] = [[∂n R v]] ⊗ n R = −U R a ⊗ n R . [[F]] ˙ R ]]n R U R [[∇R · T R ]] = −[[T ˙ R ]]n R = −ρ R U R3 a. [[T

˙ R ]] is now related to a via the constitutive property T R = T R (F). The jump [[T Denote by A the fourth-order tensor defined by Ai H j K = ∂ F j K Ti RH . Since F is continuous, across , then A is continuous too. Hence we have ˙ ˙ R ]] = A[[F]]. [[T ˙ we find Upon substitution of [[F]] ˙ R ]] n R = −U R A a, [[T where A depends on n R such that

4.2 Elastic Solids

221

Aa = A[a ⊗ n R ]n R . ˙ R ]]n R we find the propagation condition Equalling the two expressions of [[T A a = ρ R U R2 a. The second-order tensor A is called the reference acoustic tensor. Hence the vector a is an eigenvector of A and ρ R U R2 is the corresponding eigenvalue. This eigenvalue problem is parameterized by the direction n R of the surface . The directions n R of the eigenvectors are also called the acoustic axes. The notation A(n) emphasizes that A depends on n. Likewise, taking the current configuration as reference we find that the strictly analogous propagation condition holds with ρ, u n , n instead of ρ R , U R , n R . The results so obtained are stated as follows. Theorem 4.1 (Fresnel-Hadamard) The vector jump a of an acceleration wave in the direction n has to be an eigenvector of the acoustic tensor A(n). The corresponding propagation speed U is given by ρ R U R2 = w · A(n R )w,

w := a/|a|.

As any eigenvalue, ρ R U R2 is the solution to the characteristic equation (ρ R U R2 )3 − I1 (ρ R U R2 )2 + I2 ρ R U R2 − I3 = 0, I1 , I2 , I3 being the principal invariants of A(n R ). Shock waves are characterized by the singularity of the velocity field. We let [[χ]] = 0. Hence the geometrical and the kinematical conditions of compatibility imply [[F]] = [[∂n R χ]] ⊗ n R , [[v]] = −U R [[∂n R χ]]. The jump condition ρ R U R [[v]] + [[T R ]]n R = 0 takes the form ρ R U R2 [[∂n R χ]] − [[T R (∇ R χ)]]n R = 0, that is an equation in the unknown derivatives ∇ R χ = F. To fix ideas let the state ahead be in the reference state, that is F = 1 ahead of the wave. Hence T R (1) is the Piola stress ahead. For the state behind we can write T R = T R (1) + (∂ F jk T R )(F j K − δ j K ) + o(F − 1), the derivatives ∂ F jk T R being evaluated at F = 1. By neglecting the o(F − 1) term we follow the linearized approximation. Hence we have

222

4 Solids

[[T R ]] = A[[F]] and [[T R ]]n R = A[[∂n R χ]]. Consequently, in the linearized approximation, shock waves are governed by the propagation condition (4.9) (A(n R ) − ρ R U R2 1)[[∂n R χ]] = 0. As with the weak waves, in the linearized approximation of T R (F), the shock wave polarization [[∂n R χ]] is an eigenvector of A and ρ R U R2 is the corresponding eigenvalue. The propagation condition (4.9) is also called Christoffel equation while A is called the Green-Christoffel tensor.

Plane Progressive Waves The body is taken to be at rest in the reference configuration so that ∇R · T R + ρ R b = 0. A deformation is now superimposed; the variation of T R is considered in the linearized approximation, T R = A(F − 1) while ρ R is regarded as constant. The equation of motion can then be written in terms of the displacement function u(X, t) in the form ρ R ∂t2 u = A∇ R u,

ρ R ∂t2 u i = Ai H j K ∂ X H ∂ X K u j .

A plane wave propagating with phase speed c in the direction of the unit propagation vector p is represented by the function u(X, t) = f (X · p − ct) d, where f is a scalar-valued function and d is the unit vector defining the direction of motion (or polarization). Upon substitution in the equation of motion and letting φ = X · p − ct we find [ρ R c2 1 − A(p)]d f (φ) = 0, where A(p) = A p ⊗ p,

Ai j = Ai H j K p H p K .

4.2 Elastic Solids

223

Accordingly, plane waves are admissible, for any function f , in the direction of propagation p, if the polarization d is an eigenvector of A with eigenvalue ρ R c2 , A(p) d = ρ R c2 d. This in turn means that waves are allowed, with direction p and polarization d, if 0 < d · A(p) d = (d ⊗ p) · A(d ⊗ p). Strong ellipticity, that is (e ⊗ f) · A(e ⊗ f) > 0 for all vectors e and f, guarantees propagation with any pair of vectors p and d.

4.2.3 Linear Theory The linear theory of elasticity is derived as an approximation of the exact theory. We let the reference configuration be natural, that is stress free, and restrict attention to linear terms in F − 1, which is the content of ‘small deformations’ approximation. Since F = 1 + H, Hi j = ∂ X j u i , u being the displacement, we look for linear representations with respect to the displacement gradient H. The fourth-order tensor C = 2 ∂C T˜ R R (C), at C = 1, is referred to as the elasticity tensor. Since both T R R and C are symmetric it follows that, in the suffix notation, Ci j hk = C ji hk = Ci jkh . These relations are referred to as minor symmetries and hold for any elastic solid. If, further, the solid is hyperelastic then C = 4 ∂C2 ψ˜ R (C),

Ci j hk = 4 ∂Ci j ∂Chk ψ˜ R (C), at C = 1,

and hence C satisfies also the major symmetry conditions Ci j hk = Chki j . It is usually assumed that C is positive-definite, which means that

224

4 Solids

A · CA > 0

∀A ∈ Sym \ {0}.

This amounts to assuming that the free energy has a minimum at the reference state, C = 1. We now show that C is invariant under the symmetry group G of the material. Look at a time dependence C(t) such that ˙ C(0) =A

C(0) = 1, and consider

Hence we have

T R R (t) = T˜ R R (C(t)). ˜ R R (C) C, ˙ T˙ R R = ∂C T

˙ R R (0) = 1 C A. T 2

Now apply the invariance property of T R R , ˜ T C Q). QT T R R Q = T(Q Time differentiation and evaluation at t = 0 gives ˙ QT T˙ R R (0)Q = 21 C QT C(0)Q whence QT [C A]Q = C[QT A Q], that is the transformation law of C for any symmetry transformation Q and every symmetric tensor A. For formal convenience we let the reference configuration be stress free and hence let the second Piola stress T R R (C) satisfy T R R (1) = 0,

(∂C T R R )(1) = 21 C

so that T R R (C) = 21 C(C − 1) + o(|C − 1|). In terms of the Green-St.Venant strain tensor E = 21 (C − 1) we can write T R R = C E + o(|E|). To determine linear approximations it is convenient to refer to the displacement gradient H. Now F = 1 + H,

J = det F = 1 + O(|H|),

J −1 = 1 + O(|H|)

4.2 Elastic Solids

225

and hence T = J −1 FT R R FT = T R R + O(|H|),

T R = FT R R = T R R + O(|H|).

To within O(|H|) terms, the Cauchy and Piola stresses coincide. Moreover, ρ = J −1 ρ R = ρ R + O(|H|). The expansion of ψ R (C) about C = 1 can be written in the form ψ R (C) = ψ R (1) + (∂C ψ R )(1)(C − 1) + 21 (C − 1) · (∂C2 ψ R )(1)(C − 1) + o(|H|2 ). The assumptions that ψ R and T R R vanish at C = 1 and the definition of C, 1 C 2

= (∂C T R R )(1) = 2(∂C2 ψ R )(1),

allow ψ R to be given the form ψ R (C) = 18 (C − 1) · C(C − 1) + o(|H|2 ) = 21 E · CE + o(|H|2 ). The linear theory approximation of elasticity follows by restricting attention to the leading terms in H. We let T = CE, Ti j = Ci j hk E hk ;

ψ R = 21 E · CE, ψ R = 21 Ci j hk E i j E hk

and hence T = ∂E ψ R (E). The elasticity tensor C is assumed to be positive definite. Since ∂x j Ti j = ∂ X k Ti j Fk−1 j = ∂ X k Ti j (δk j + O(|H|)) = ∂ X j Ti j + O(|H|), in the linear approximation ∇ · T ≈ ∇R · T. Hence we can write the equation of motion in the form ρ R u¨ = ∇ R · T + ρ R b,

ρ R u¨ i = ∂ X j Ti j + ρ R bi .

If the body is homogeneous then C is constant and the equation of motion simplifies to ρ R u¨ i = Ci j hk ∂ X j ∂ X k u h + ρ R bi . ρ R u¨ = ∇ R · (CH) + ρ R b,

226

4 Solids

Linear Theory for Isotropic Materials If the body is isotropic then C is an isotropic fourth-order tensor (see Appendix A.2) and we have CE = 2 μ E + λ(tr E)1, where μ and λ are scalar constitutive parameters called the Lamé moduli. To account for the assumed positive definiteness of C we let E0 be the deviatoric part of E and let E = E0 + 13 (tr E)1. Hence the requirement, for E = 0, becomes 0 < E · CE = 2μ|E|2 + λ(tr E)2 = 2μ|E0 |2 + κ(tr E)2 , where κ = 13 (2μ + 3λ). If E = 1, so that tr E = 3 and E0 = 0, it follows κ > 0. If, instead, u = u 1 (X 2 , t)e1 then E = 21 ∂ X 2 u 1 (e1 ⊗ e2 + e2 ⊗ e1 ),

tr E = 0,

|E0 |2 = 21 |∂ X 2 u 1 |2 .

Consequently we find μ > 0. Thus C is positive definite if μ > 0,

2μ + 3λ > 0.

With these restrictions, the stress-strain law for an isotropic body is T = 2μE + λ(tr E)1 and the corresponding energy is ψ R = μ|E|2 + 21 λ(tr E)2 . The stress-strain law may be inverted. Since tr T = 2μ tr E + 3λ tr E then upon substitution we find E=

 λ 1  T− (tr T)1 . 2μ 2μ + 3λ

In the linear approximation ∇ · T ≈ ∇R · T. Now ∇R · T = 2μ∇R · E + λ∇R (tr E) and

4.2 Elastic Solids

227

(2∇R · E)i = ∂ X j (∂ X j u i + ∂ X i u j ) =  R u i + ∂ X i ∇R · u. Hence the equation of motion reads ρ R u¨ = μ R u + (λ + μ)∇R (∇R · u) + ρ R b. Starting from the equation of motion ρ R u¨ = ∇R · T + ρ R b,

T = CE,

and inner multiplying by u˙ we obtain ˙ − T · ∇R u˙ + ρ R b · u. ˙ ρ R ( 21 u˙ 2 )˙ = ∇R · (Tu) Since

T · ∇R u˙ = T · E˙ = 21 (E · CE)˙,

then we can write the global balance of energy in the form d 1 ∫ (ρ R u˙ 2 + E · CE)dv = ∫ Tn · u˙ da + ∫ ρ R b · u˙ dv, dt P 2 P ∂P for any region P ⊂ R. It is worth looking at the stress-strain relation in connection with three statical solutions. First we consider the pure shear u(X) = γ X 2 e1 . Hence the matrices [E] and [T] take the forms ⎤ 0γ0 [E] = ⎣ γ 0 0 ⎦ , 000 ⎡



⎤ 0 μγ 0 [T] = ⎣ μγ 0 0 ⎦ . 0 0 0

This shows the connection between stress and strain through the shear modulus μ. The second solution is that of uniform compressions or expansions, u(X) = u X. Hence we have E = u1,

T = 3(λ + 2μ/3)u 1.

This shows the role of the bulk modulus λ + 2μ/3. The third solution is related to a pure tension, the stress arising from a force density σ in one direction, say e1 ,

228

4 Solids



⎤ σ00 [T] = ⎣ 0 0 0 ⎦ . 000 Hence, by the strain-stress relation we find that ⎡

⎤ ε00 [E] = ⎣ 0  0 ⎦ , 00 where5 ε=

σ , E

 = −νε,

E=

μ(2μ + 3λ) , μ+λ

ν=

λ . 2(μ + λ)

The displacement field u(X) is then given by u(X) = εX 1 e1 + X 2 e2 + X 3 e3 . The material parameter E is the ratio of the tensile stress σ to the strain ε produced in the same direction; it is known as Young’s modulus. The material parameter ν is the ratio of the transverse strain to the longitudinal strain; it is called Poisson’s ratio. It is expected that an elastic solid should increase its length when pulled, decrease its volume when acted upon by a pressure, and respond to a positive shear strain by a positive shear stress. This is the case if E > 0,

2μ + 3λ > 0,

μ > 0.

We also might expect that ν > 0 so that the body undergoes a lateral contraction when it is subject to a longitudinal stress. Yet, μ + λ = 13 (2μ + 3λ + μ) > 0,

μ dν = > 0. dλ 2(μ + λ)2

Hence ν takes values from the minimum, at λ = −2μ/3, and the maximum, as λ → ∞. It then follows that ν ∈ (−1, 21 ). Thus there can be materials, called auxetics, that have a negative Poisson’s ratio. When stretched, they become thicker perpendicular to the applied force. This may occur due to their particular internal structure and the way this deforms when the sample is uniaxially loaded. Also with a view to the principal directions, consider a solid subject to the normal stresses T1 , T2 , T3 on the faces S1 , S2 , S3 . The strains {E i } generated by T1 are 5

The symbol ε is used here to denote the strain.

4.2 Elastic Solids

229

E1 =

1 T1 , E

E 2,3 = −

ν T1 . E

Strictly analogous relations hold for the strains generated by T2 and T3 . Due to linearity, the joint application of T1 , T2 , T3 results in the strains E1 =

1 ν T1 − (T2 + T3 ), E E

E2 =

1 ν T2 − (T1 + T2 ), E E

E3 =

1 ν T3 − (T1 + T2 ). E E

(4.10)

Linear Waves For formal simplicity here we let X = x. We show how the equation of motion of linear isotropic elasticity, ρu¨ = μu + (λ + μ)∇(∇ · u) allows plane wave solutions with constant phase speed. A plane wave propagating with phase speed c in the direction defined by the unit propagation vector p is represented by the function u(x, t) = f (x · p − ct)d, where f is a scalar-valued function and d is the unit vector defining the direction of motion. Let φ = x · p − ct. It follows that ∇u = d ⊗ p f (φ),

∇ · u = (p · d) f (φ), u = f (φ)d,

∇(∇ · u) = (p · d) f (φ)p,

u¨ = c2 f (φ)d.

Substitution in the equation of motion yields [μd + (λ + μ)(p · d)p − ρc2 d] f (φ) = 0, which obtains for any function f if and only if (μ − ρc2 )d + (λ + μ)(p · d)p = 0.

230

4 Solids

A vector equation αd + βp = 0 holds if β = 0 and hence α = 0 or β = 0 and hence d  p. Look at the two solutions, subject to λ + μ = 0. In the first case p · d = 0,

c = cT := (μ/ρ)1/2 .

The motion is normal to the direction of propagation and the solution is called a transverse wave. Since p · d = 0 then ∇ · u = 0 and that is why the solution is called an equivoluminal wave; other terms are also rotational wave, shear wave, and S-wave (secondary, shear). The displacement u can have any direction d in a plane orthogonal to the direction of propagation p. In the second case d = ±p,

c = cL := [(λ + 2μ)/ρ]1/2 .

The displacement u is then parallel to the direction of propagation and the wave is called a longitudinal wave. The irrotationality condition ∇ × u = (p × d) f (φ) = 0 motivates the alternative term irrotational wave. This solution is also called a dilatational wave, a pressure wave, or a P-wave (primary, pressure). Since 2μ + 3λ > 0 then

3λ + 2μ λ + 2μ = 13 + 4 > 43 . μ μ Hence it follows

λ + 2μ 1/2 cL = > 1, cT μ

and that is the motivation for the terms primary and secondary associated with longitudinal and transverse waves. This is true for any pair of parameters λ, μ, subject to 2μ + 3λ > 0, and then also for auxetic materials; ν = λ/2(μ + λ) ∈ (−1, 21 ). Sinusoidal or time-harmonic waves are the particular case where f (φ) = sin φ or f (φ) = exp(iφ)).

Anisotropic Elastic Solids In anisotropic solids the stiffness tensor C is more involved. As with any fourthorder tensor, the matrix associated with C has 34 = 81 entries. Yet the number of independent with entries is significantly smaller. By the symmetry of the stress T and of the strain ε the constitutive relations

4.2 Elastic Solids

231

T = Cε,

C = 4 ∂C2 ψ˜ R (C),

imply the validity of major and minor symmetries. Hence there are at most 21 different entries. This is apparent by observing that, because of the minor symmetries, the significant entries [Ci jkl ] of C are represented by a 6 × 6 matrix [Cαβ ] [Ci jkl ]

⇐⇒

[Cαβ ]

via the correspondence i j = 11; 22; 33; 23, 32; 13, 31; 12, 21

⇐⇒

α = 1, 2, ..., 6

between pairs of indices, with values from 1 to 3, and a single index with values from 1 to 6. Now, because of the major symmetries, the 6 × 6 matrix ⎡

C11 ⎢C21

Cαβ = ⎢ ⎣··· C61

C12 C22 ··· C62

C13 C23 ··· C63

C14 C24 ··· C64

C15 C25 ··· C65

⎤ C16 C26 ⎥ ⎥ ···⎦ C66

is symmetric and hence only 6 + (36 − 6)/2 = 21 entries are independent. Transversely isotropic solids have a single axis of symmetry and hence have 5 independent entries ⎤ ⎡ C11 C11 − 2 C66 C13 0 0 0 ⎢C11 − 2 C66 C11 C13 0 0 0 ⎥ ⎥ ⎢ ⎢

C13 C33 0 0 0 ⎥ C13 ⎥. ⎢ Cαβ = ⎢ 0 0 0 C44 0 0 ⎥ ⎥ ⎢ ⎣ 0 0 0 0 C44 0 ⎦ 0 0 0 0 0 C66 Orthotropic solids have 9 independent entries. The matrix takes the form ⎡

C11 ⎢C12 ⎢ ⎢C13 [Cαβ ] = ⎢ ⎢ 0 ⎢ ⎣ 0 0

C12 C22 C13 0 0 0

C13 C23 C33 0 0 0

0 0 0 C44 0 0

0 0 0 0 C55 0

⎤ 0 0 ⎥ ⎥ 0 ⎥ ⎥. 0 ⎥ ⎥ 0 ⎦ C66

Following Voigt notation, we let T˜1 = T11 , T˜2 = T22 , T˜3 = T33 , T˜4 = T23 , T˜5 = T13 , T˜6 = T12 , and ε˜1 = ε11 , ε˜2 = ε22 , ε˜3 = ε33 , ε˜4 = 2ε23 , ε˜5 = 2ε13 , ε˜6 = 2ε12 .

232

4 Solids

Hence the stress-strain relation becomes T˜α = Cαβ ε˜β . Moreover

Ti j εi j = T˜α ε˜α

and then Ti j εi j takes the form of a inner product. Instead, εi j εi j = ε˜21 + ε˜22 + ε˜23 + 21 (ε˜24 + ε˜25 + ε˜26 ) = ε˜α ε˜α , thus showing that the inner product structure is not preserved. In the Mandel (or Mandel-Voigt) notation the stress and strain components are handled in the same way by considering the sextuples T˜ M = (T11 , T22 , T33 , ε˜M = (ε11 , ε22 , ε33 ,

√ √

2 T23 ,

2 ε23 ,





2 T13 ,

2 ε13 ,





2 T12 ),

2 ε12 ).

Hence the inner product structure is preserved, Ti j Ti j = T˜αM T˜αM ,

εi j εi j = ε˜αM ε˜αM ,

and the corresponding relation

Ti j εi j = T˜αM ε˜αM ,

˜ M ε˜M T˜αM = C αβ β

holds with ⎡

C1111 C1122 C1133 ⎢ C C2222 C2233 2211 ⎢ ⎢ C C ⎢ 3311 3322 M ˜ = ⎢√ √ √C3333 C ⎢ √2 C2311 √2 C2322 √2 C2333 ⎢ ⎣ 2 C1311 2 C1322 2 C1333 √ √ √ 2 C1211 2 C1222 2 C1233

√ √2 C1123 √2 C2223 2 C3323 2 C2323 2 C1323 2 C1223

From the equation of motion, ρu¨ = ∇ · T + ρb, in the linear approximation we have ρ∂t2 u = ∇ · (C ε) + ρb, ρ being a constant. Consequently,

√ √2 C1113 √2 C2213 2 C3313 2 C2313 2 C1313 2 C1213

⎤ √ 2 C 1112 √ ⎥ √2 C2212 ⎥ ⎥ 2 C3312 ⎥ ⎥. 2 C2312 ⎥ ⎥ 2 C1312 ⎦ 2 C1212

4.2 Elastic Solids

233

(1 ∂t2 − A(∇))u = b, where A(∇) is the operator given by A(∇)|ik =

1 ∂x Ci j hk ∂xk , ρ j

and is referred to as the acoustic differential operator. Look for plane waves, of the form u(x, t) = p exp[i(k · x − ωt)], where p is the possible direction of u (polarization) and k is the direction of propagation. These waves are solutions to the equation of motion with zero forcing (b = 0) if and only if (1 ω 2 − A(k))p = 0, so that ω 2 and p constitute an eigenvalue/eigenvector pair of the acoustic algebraic operator 1 A(k)|i h = k j Ci j hl kl . ρ Let n = k/|k|. The propagation condition, which is also known as the Christoffel equation, may be written as A(n)p = c2 p, where c = ω/|k| is the phase velocity in the direction of k.

4.2.4 Elastostatics and Compatibility of Strains Linear elastostatic solids are investigated at equilibrium. The linear approximation is considered and the constitutive equation for the Cauchy stress is taken in the form T = Cε,

ε = sym∇u.

We look for time independent functions u(x). Consistently we assume ρ = ρ0 , ρb = bˆ and hence the body force is constant. The equation of motion simplifies to the equilibrium equation ∇ · T + bˆ = 0; in components

∂x j Ci j hk ∂xk u h + bˆi = 0,

234

4 Solids

whence6

Ci j hk u h, jk + bˆi = 0.

Attention is now restricted to isotropic solids. Hence T = 2με + λ(tr ε)1,

ε=

λ 1 (T − (tr T)1). 2μ 2μ + 3λ

where μ and λ are the Lamé moduli (or parameters). In components we have Ti j = μ(u i, j + u j,i ) + λu k,k δi j and the equilibrium equation reads (μ + λ)u j,i j + μu i, j j = −bˆi .

(4.11)

Since ∇ · bˆ = 0, the divergence of the equilibrium equation results in (2μ + λ)u i,i j j = 0. Consequently ∇ · u = 0 and hence ∇∇ · u = 0.

(4.12)

The Laplacian of (4.11) implies (μ + λ)u j,i jkk + μu i, j jkk = 0,

(μ + λ)∇∇ · u + μu = 0.

In view of (4.12) we find u = 0; the displacement u satisfies the biharmonic equation.

Compatibility Conditions Given a (infinitesimal) strain tensor field ε(x) we look for the conditions guaranteeing that ε is associated with a displacement field u(x). The solution of this problem traces back to Saint-Venant [398] and Beltrami [35]. Given a displacement field u we can write ∇u = ε + ξ ,

6

u i, j = 21 (u i, j + u j,i ) + 21 (u i, j − u j,i ).

For formal convenience here we use subscripts to denote partial derivatives, e.g. , j = ∂x j .

4.2 Elastic Solids

235

Now, ξi j,k = 21 (u i, jk − u j,ik ) = 21 (u i, jk + u k,i j − u j,ik − u k,i j ) = εik, j − ε jk,i . Requiring that ξi j,kl = ξi j,lk we find 0 = ξi j,kl − ξi j,lk = εik, jl − ε jk,il + εil, jk − ε jl,ik . Hence we have the compatibility equations ik jl := εik, jl − ε jk,il − εil, jk + ε jl,ik = 0. Due to the symmetry conditions ik jl =  jlik = ki jl = ikl j there are only six distinct components of  = 0.

4.2.5 Vibrating Strings, Chains, and Phonons Consider a string as a one-dimensional continuum with mass density (per unit length) ρ parameterized by x ∈ [0, L]. The string is assumed to be a plane curve and hence we let (x, y) = (x, u(x)) be the current point. A vanishing displacement in the xdirection implies that the x component of the traction, tx , satisfies tx (x2 ) − tx (x1 ) = 0, ∀x1 , x2 ∈ [0, L]. Let τ = |t| and θ be the angle of t with the horizontal. Then tan θ = ∂x u and  tx = τ cos θ = τ / 1 + (∂x u)2 . We assume |∂x u|  1 and then conclude that τ = constant. The balance of linear momentum in the u-direction implies x2

∂t ∫ ρ∂t u d x = t y (x2 ) − t y (x1 ) = τ [∂x u(x2 ) − ∂x u(x1 )]. x1

Differentiation with respect to x2 yields the wave equation ρ∂t2 u = τ ∂x2 u. It follows that perturbations of u propagate with speed v =

√ τ /ρ.

236

4 Solids

Phonons A phonon is a collective excitation in a periodic elastic arrangement of atoms in solids. Starting from simple discrete models we show how continuum properties of motion are established quite similar to the string oscillations. As a poor, simple model a crystal is associated with a space lattice of points (atoms or molecules). The forces between each pair of points may be characterized by a potential energy that depends on the distance. For simplicity atoms and forces are modelled by masses and springs. Let c be the elastic constant of the springs and m the mass of the atoms of a chain. Assuming interaction between nearest neighbours then the displacement u n (t) of the nth atom is governed by the differential equation m u¨ n = c(u n+1 + u n−1 ) − 2cu n .

(4.13)

To fix ideas we assume the chain consists of N atoms connected with N + 1 springs; the fixed extremes of the chain are placed at x = 0, x = l. At equilibrium, xn = na, a = l/(N + 1). Look for solutions in the form u n (t) = u sin(nβ) sin(ωt + α);

(4.14)

the vanishing of u n at n = 0 and n = N + 1 implies that (N + 1)β = pπ, p ∈ N. Substitution in (4.13) yields −mω 2 sin(nβ) = c[sin(n + 1)β + sin(n − 1)β − 2 sin nβ]. Since sin(n ± 1)β = sin nβ cos β ± cos nβ sin nβ it follows that (4.14) is a solution if c ω 2 = 4 sin2 β/2. m Since β = pπ/(N + 1) then there are solutions parameterized by p, ( p)

u n (t) = u ( p) sin(npπ/(N + 1)) sin(ω p t + α p ),

 ω p = 2 c/m sin( pπ/2(N + 1)).

Only the solutions labelled by p = 1, 2, ..., N are linearly independent; the solutions labelled by p = N + 2, N + 3, ... coincide with those labelled by p = 1, 2, .... Hence the general solution for u n can be written in the form u n (t) =

N

p=1 u

( p)

sin(nk p a) sin(ω p t + α p ).

where k p = pπ/l and a = l/(N + 1). In light of the identity sin θ sin φ = φ) − 21 cos(θ + φ) we can write u n (t) =

( p) 1 N [cos(nk p a p=1 u 2

1 2

cos(θ −

− ω p t − α p ) − cos(nk p a + ω p t + α p )].

4.2 Elastic Solids

237

Thus u n (t) is the superposition of right-propagating waves, cos(nk p a − ω p t − α p ), and left propagating waves, cos(nk p a + ω p t + α p ). Further, ω p is the angular frequency while k p is the wavenumber. The ratio ω p /k p is the phase speed v p ,  sin(k p a/2) v p = 2 c/m . kp The limit of a chain that reduces to a string is performed by letting m = ρa, c = τ /a, while (N + 1)a = l. We obtain vp =

2 a



τ sin(k p a/2) . ρ kp

√ The continuum limit, namely a → 0, yields v p = τ /ρ, as with the string oscillations. For a number of applications the behaviour of diatomic crystals (e.g. NaCl, KCl, KI, ZnS) is well described by a linear chain of atoms of two types. Consider a chain where a mass m is placed at x = a, 3a, ..., (2N − 1)a and a mass M is placed at x = 2a, 4a, ..., 2N a. Further, the chain is fixed at the extremes x = 0, x = (2N + 1)a. The 2N equations of motion are given the form of the system M u¨ 2n + c(−u 2n+1 − u 2n−1 + 2u 2n ) = 0, n = 1, ..., N , m u¨ 2n−1 + c(−u 2n − u 2n−2 + 2u 2n−1 ) = 0, n = 1, ..., N . A solution is sought in the form u 2n (t) = P sin(2nβ) sin(ωt + α), u 2n−1 (t) = D sin((2n − 1)β) sin(ωt + α), with n = 1, 2, ..., N . Upon substitution and the observation that sin((2n ± 1)β) = sin(2nβ) cos β ± cos(2nβ) sin β, sin((2n − 1 ± 1)β) = sin((2n − 1)β) cos β ± cos((2n − 1)β) sin β, we obtain the algebraic system (−ω 2 M + 2c)P − 2c cos β D = 0,

−2c cos β P + (−ω 2 m + 2c)D = 0

in the unknowns P, D. Non-trivial solutions hold if the corresponding determinant vanishes, namely m Mω 4 − 2c(m + M)ω 2 + 4c2 sin2 β = 0. Since M > m it follows that two distinct real positive values of ω 2 hold,

238

4 Solids

ω 2 = c(

1/2 1 c  1 + )± (M − m)2 + 4m M cos2 β . m M mM

The vanishing of u at x = 0 is apparent; we require the vanishing at (2N + 1)a to obtain 0 = u 2N +1 = D sin[(2N + 1)β] sin(ωt + α). It follows that (2N + 1)β = pπ whence β = βp = p Let kp =

π . 2N + 1

βp π =p . a (2N + 1)a

Only the solutions with p = 1, 2, ..., N are distinct and hence there are 2N frequencies, 1/2 1 1 c  2 (M − m)2 + 4m M cos2 k p a (ω ± . + )± p ) = c( m M mM + + As p increases, from p = 1 to p = N , the frequency ω + p decreases from ω1 to ω N ,

 1 1/2

1 1 c 1 1/2 ω1+ = c( + ) + < 2c( + ) , [(M − m)2 + 4m M cos2 k N a]1/2 m M mM m M

1/2  2c 1/2  1 1 c [(M − m)2 + 4m M cos2 k N a]1/2 ω N+ = c( + ) + > . m M mM m − − Conversely, ω − p increases from ω1 to ω N ,

 1 1/2 1 c > 0, ω1− = c( + ) − [(M − m)2 + 4m M cos2 k N a]1/2 m M mM  1 1/2  2c 1/2 1 c [(M − m)2 + 4m M cos2 k N a]1/2 < . ω N− = c( + ) − m M mM M Since

 2c 1/2 M


0.

4.3 Internal Constraints

241

then N = − p 1,

p = −q det F.

The scalar p may be viewed as the reactive pressure. The reaction space N R (F) is the one-dimensional subspace in the direction of the identity 1 and hence N R (F) ∈ Sym. Another example is given by the inextensibility constraint, in a direction e; it is expressed by λ > 0. γ R (F) = Fe · Fe − λ = 0, Since ∂F γ R = 2(Fe) ⊗ e then N = q(Fe) ⊗ (Fe). In general, because of the reaction N, the constitutive relation for a constrained elastic solid may be written as T = N + G R (F), where G R has to be defined only for F ∈ C R . Hence G R (F) is the response function of the constrained (elastic) material relative to the reference configuration. Another approach is now considered subject to the linear approximation of elasticity. We take the view that a uniform internal constraint is a prescription on the set of possible strains [362, 366]. For definiteness, assume the possible strains are those orthogonal to a tensor V ∈ Sym in that V · E = 0, E being the Green-St. Venant strain tensor. Then the associated constraint space C R is given by C R = {E ∈ Sym : V · E = 0}. Hence we can write

Sym = C R ⊕ C R⊥ ,

where C R⊥ is the subspace spanned by V. This in turn allows the splitting of the stress tensor into a reactive and an active part as T = N + T(a) . In linear elasticity T(a) is given by a linear transformation of C R into itself, C : CR → CR , while N ∈ C R⊥ and hence

N = q V,

T(a) = CE q ∈ R.

242

4 Solids

˙ Given a smooth strain function E(t), t ∈ [0, d], on C R then E(t) ∈ C R for any t ∈ (0, d). As a consequence, reactive stresses expend no power in any admissible motion, N · E˙ = 0. A constraint space C R is proper if dim C R < 6. The rigidity constraint corresponds to C R = {0} and hence C R⊥ = Sym. Moreover, T(a) = 0 and hence T = N is arbitrary in that it is not constitutively determined. A simple example of proper internal constraint is the preservation of volume. In this case V = 1 and then C R⊥ = Sph,

C R = Dev = {E ∈ Sym : 1 · E = tr E = 0}.

In the linear approximation tr E tr ε = ∇ · u and hence the V = 1 means incompressibility. Therefore N = − p 1, where − p is the undetermined scalar q. It is natural to view p as the reactive pressure. Again consider the constraint of inextensibility in a direction e, that is Fe · Fe = 1. Since Fe · Fe = e · FT Fe = e · Ce = 1 + 2E · e ⊗ e then the constraint becomes E·e⊗e =0 and hence V = e ⊗ e. Consequently, C R⊥ = {E ∈ Sym : E = λe ⊗ e, λ ∈ R},

C R = {E ∈ Sym : e · Ee = 0}.

It follows that N = q e ⊗ e,

q ∈ R.

We may view q e ⊗ e as the reactive tension along e. We observe that internal constraints affect material symmetries. Indeed, the occurrence of a constraint reduces the domain of the elasticity tensor C to the constraint space C R and then a rotation Q ∈ Orth+ is a symmetric transformation of the material provided that QE ∈ C R for all E ∈ C R . Hence, we may associate to each constraint space C R a constraint group G(C R ) consisting of all rotations which leave C R invariant, G(C R ) = {Q ∈ Orth+ : QC R = C R }.

4.4 Hyperelastic Solids and Rubber-Like Materials

243

This group specifies the maximal material symmetry compatible with the given constraint, in the sense that the symmetry group G of a linearly elastic material is compatible with a constraint C R if G ⊂ G(C R ). For instance, if C R = {E ∈ Sym : e · Ee = 0} then G(C R ) contains Orth+ (e) but not Orth+ itself. Consequently, the constraint of inextensibility is compatible with the response symmetry of transversely isotropic materials (having the same preferred direction) but is not compatible with isotropic materials.

4.4 Hyperelastic Solids and Rubber-Like Materials By (4.7), in isotropic thermoelastic solids the Cauchy stress is given by the (referential) free energy ψ R in the form 2 ∂B ψ R (θ, B) B. T= √ det B

(4.19)

Hyperelastic solids7 are often assumed to be characterized by (4.19) with the restrictions that ψ R is independent of θ and the material is incompressible. If the material is incompressible then det B = 1 and, moreover, there is an undetermined scalar parameter p so that T = 2 ∂B ψ R (θ, B) B − p1. Also to establish known models applied in the literature we let the free energy be given by ψ R (θ, B) = ψ0 (θ) + W (θ, B), where W (θ, B) = 0 when B = 1. Let λ1 , λ2 , λ3 be the eigenvalues of the stretch tensors U and V and hence the square roots of the eigenvalues of B and C. The principal invariants {Ii } of B (and C) are then related to the eigenvalues {λi } by I1 = λ21 + λ22 + λ23 ,

7

I2 = λ21 λ22 + λ22 λ23 + λ23 λ21 ,

I3 = λ21 λ22 λ23 .

In hyperelasticity the stress tensor is assumed to derive from a strain energy function; really the thermodynamic consistency of thermoelastic solids implies that the stress derives from the free energy function.

244

4 Solids

Moreover J = λ1 λ2 λ3 . Denote by

B¯ = (det B)−1/3 B

the unimodular tensor of B. The corresponding invariants are I¯1 = J −2/3 I1 ,

I¯2 = J −4/3 I2 ,

while I¯3 = det B¯ = 1. In the undeformed state, F = 1, it follows J = 1,

I¯1 = I1 = 3,

I¯2 = I2 = 3.

The generalized Rivlin model, or polynomial hyperelastic model, involves the free energy in terms of the invariants in the form W =

N  i, j=0

C hk ( I¯1 − 3)h ( I¯2 − 3)k +

M 

Dm (J − 1)2m ,

(4.20)

m=0

where {C hk }, {Dm } are material parameters, possibly dependent on the temperature θ. Frequently used models in applications are particular cases of (4.20). Neo-Hookean model For general materials the relationship between applied stress and strain is initially linear (Hookean) but then the corresponding curve changes to nonlinear. An incompressible Neo-Hookean material is the simplest nonlinear model. The strain energy W is defined to be W = C10 ( I¯1 − 3), where C10 is a material constant. Incompressibility implies that J = λ1 λ2 λ3 = 1. Treloar model A correction to the Neo-Hookean material is given by the Treloar model [423] where W = C10 ( I¯1 − 3) + D(J − 1)2 . The occurrence of J − 1 means that the material is not strictly incompressible the smallness of J − 1 justifies the dependence of λ1 , λ2 , λ3 via the constraint λ1 λ2 λ3 = 1.

4.4 Hyperelastic Solids and Rubber-Like Materials

245

Mooney-Rivlin model The standard model of Mooney-Rivlin material [315, 379] follows from (4.20) by letting J = 1 (incompressibility assumption) and i, j ≤ 1; the model then is represented by W = C10 ( I¯1 − 3) + C01 ( I¯2 − 3). Signorini model The Signorini model describes hyperelasticity by means of the right Cauchy-Green tensor C in the form W = W v (J ) + W i ( I¯1 , I¯2 ), ¯ = J −2/3 C. In particular, where I¯1 , I¯2 are the invariants of the unimodular tensor C v v W (J ) can be taken as in the Treloar model, so that W (J ) = D(J − 1)2 . Ogden model Differently from the previous relations, Ogden model [347] is characterized by a strain energy W that depends on the principal stretch ratios {λi } in the form W =

N  μi αi (λ1 + λα2 i + λα3 i − 3), α i i=1

where μi , αi are empirically determined material constants. It is mainly applied in connection with rubber and polymer and, due to the material constants, shows usually a better flexibility.

4.4.1 Compressible Mooney-Rivlin Solid Let the referential free energy function be given by ψ R = ψ0 (θ) + W (θ, I¯1 , I¯2 , J ) By (4.19), to determine the Cauchy stress we need to evaluate ∂B W . We first observe that W is a composite function of B and ∂B W = ∂ I¯1 W ∂B I¯1 + ∂ I¯2 W ∂B I¯2 + ∂ J W ∂B J. Now,

∂B I¯1 = ∂B [(det B)−1/3 tr B] = − 13 I¯1 B−1 + (det B)−1/3 1,

246

4 Solids

∂B I¯2 = ∂B [(det B)−2/3 21 [(tr B)2 − tr B2 ] = − 23 I¯2 B−1 + J −4/3 (tr B 1 − B), ∂B J = ∂B (det B)1/2 = 21 (det B)1/2 B−1 = 21 J B−1 . Hence we find that T= √

2 det B

2 −2/3 J (∂ I¯1 W + I¯1 ∂ I¯2 W )B − J −4/3 ∂ I¯2 W B2 J

2 ¯ ( I1 ∂ I¯1 W + 2 I¯2 ∂ I¯2 W ) 1. + ∂J W − 3J (4.21)

∂B ψ R (B) B =

By a direct check it follows that tr {J −2/3 (∂ I¯1 W + I¯1 ∂ I¯2 W )B − J −4/3 ∂ I¯2 W B2 ] − 13 ( I¯1 ∂ I¯1 W + 2 I¯2 ∂ I¯2 W )1} = 0 for any function W . Consequently, tr T = 3 ∂ J W. If we let p be defined by p := − 13 tr T then we may view p as the pressure and find that p = −∂ J W. Hence it follows from (4.21) that T can be written in the form T = −p 1 +

2 ¯ 2 −2/3 J (∂ I¯1 W + I¯1 ∂ I¯2 W )B − J −4/3 ∂ I¯2 W B2 − ( I1 ∂ I¯1 W + 2 I¯2 ∂ I¯2 W )1. J 3J

(4.22)

or T = − p1 + T0 ,

T0 =

2 −2/3 J (∂ I¯1 W + I¯1 ∂ I¯2 W )B − J −4/3 ∂ I¯2 W B2 0 . J

We now specialize the stress relation (4.22) to the compressible Mooney-Rivlin model; the corresponding free energy follows from (4.20) by letting N = 1, M = 1, namely (4.23) W = C1 ( I¯1 − 3) + C2 ( I¯2 − 3) + D1 (J − 1)2 . Hence p = −2D1 (J − 1) and (4.22) becomes

4.4 Hyperelastic Solids and Rubber-Like Materials

247

T = − p 1 + 2 J −1 J −2/3 (C1 + I¯1 C2 )B − J −4/3 C2 B2 − 23 J −1 (C1 I¯1 + 2C2 I¯2 )1. For an incompressible Mooney-Rivlin material J = 1 and hence p = 0, I¯1 = I1 , I¯2 = I2 . As a consequence it follows that T = − p1 + T(a) , p being the reactive pressure (see Sect. 5.3) and T(a) = 2(C1 + I1 C2 )B − 2 C2 B2 − 23 (C1 I1 + 2C2 I2 )1,

(4.24)

which satisfies the normalization condition tr T(a) = 0. Since J = 1, the CayleyHamilton theorem implies B−1 = B2 − I1 B + I2 1. Hence, upon replacing B2 = B−1 + I1 B + I2 1 we find T(a) = − p ∗ 1 + 2C1 B − 2C2 B−1 where

(4.25)

p ∗ := 23 (C1 I1 − C2 I2 )

is not the pressure but merely the isotropic stress determined by the free energy. Since tr T(a) = 0 then −1 T(a) = T(a) 0 = 2C 1 B0 − 2C 2 B0 .

As a comment, we observe that the second Piola stress T(a) R R , associated with (4.24), is given by = J F−1 T(a) F−T = 2(C1 + I1 C2 )1 − 2C2 C − 23 (C1 I1 + 2C2 I2 )C−1 . T(a) RR We recall that the invariants of C are equal to those of B. Hence, since J = 1, the Cayley-Hamilton theorem implies C−1 = C2 − I1 C + I2 1. Upon substitution of C−1 we find = τ1 1 + τ2 C + τ3 C2 , T(a) RR where τ1 = 2(C1 + I1 C2 ) − 23 I2 (C1 I1 + 2C2 I2 ), τ2 = 23 I1 (C1 I1 + 2C2 I2 ) − 2C2 , τ3 = − 23 (C1 I1 + 2C2 I2 ).

248

4 Solids

Differently from (4.24), tr T(a) R R = 0. To show this observe that tr C−1 = tr C2 − I12 + 3I2 . Now, I2 = 21 [(tr C)2 − tr C2 ], and hence

tr C2 = (tr C)2 − 2I2 = I12 − 2I2 , tr C−1 = I2 .

Consequently, = 6(C1 + I1 C1 ) − 2I1 C2 − 23 (C1 I1 + 2C2 I2 )I2 = 0. tr T(a) RR It is of interest to express the Cauchy stress T in terms of the principal stretches λ21 , λ22 , λ23 . By (4.19) we have Tii = Bi p ∂ B pi W = Bii ∂ Bii W = λi2 ∂λi2 W,

i not summed.

Observe that, in general, ∂λi W = ∂λi2 W 2λi and hence Tii = λi2 ∂λi2 W = λi ∂λi W. Thus T11 − T33 = λ1 ∂λ1 W − λ3 ∂λ3 W,

T22 − T33 = λ2 ∂λ2 W − λ3 ∂λ3 W.

For an incompressible Mooney-Rivlin material W is given by (4.23) and hence W = C1 (λ21 + λ22 + λ23 − 3) + C2 (λ21 λ22 + λ22 λ23 + λ23 λ21 − 3), subject to λ1 λ2 λ3 = 1. Therefore we find that λ1 ∂λ1 W = 2C1 λ21 + 2C2 (

1 1 1 1 1 1 + 2 ), λ2 ∂λ2 W = 2C1 λ22 + 2C2 ( 2 + 2 ), λ3 ∂λ3 W = 2C1 λ23 + 2C2 ( 2 + 2 ). λ23 λ2 λ3 λ1 λ2 λ1

Hence the stress differences can be written as T11 − T33 = 2C1 (λ21 − λ23 ) − 2C2 (

1 1 1 1 − 2 ), T22 − T33 = 2C1 (λ22 − λ23 ) − 2C2 ( 2 − 2 ). λ21 λ3 λ2 λ3

(4.26)

4.4 Hyperelastic Solids and Rubber-Like Materials

249

Uniaxial Tension Let the (Mooney-Rivlin) material be subject to a uniaxial elongation (in the 1√ direction, first eigenvector of U) so that λ1 = λ, λ2 = λ3 = 1/ λ. Hence we have T11 − T33 = (2C1 + 2C2 /λ)(λ2 − 1/λ) − 2C2 (1/λ2 − λ),

T22 − T33 = 0.

If, further, the body is free in the 2- and 3-direction then it follows T11 = (2C1 + 2C2 /λ)(λ2 − 1/λ) − 2C2 (1/λ2 − λ),

T22 = T33 = 0.

Experimental results are naturally derived for the first Piola stress, that is the stress relative to the reference area. Now, from T R = J TF−T , since J = 1 then T11R = T11 /λ whence T11R = 2C1 λ + 2C2 − 2C1 /λ2 − 2C2 /λ3 . Upon differentiation with respect to λ we find ∂λ T11R = 2C1 + 4C1 /λ3 + 6C2 /λ4 , ∂λ2 T11R = −12C1 /λ4 − 24C2 /λ5 . Positive values of C1 and C2 make T11R (λ) to be an increasing, concave function. Equibiaxial Tension Apply the equibiaxial tension in the directions 1 and 2 so that the principal stretches are λ1 = λ2 = λ. Owing to incompressibility, λ3 = 1/λ2 . Substitution in (4.26) yields T11 − T33 = T22 − T33 = 2C1 (λ2 − 1/λ4 ) − 2C2 (1/λ2 − λ4 ).

Shear Deformation For definiteness consider a shear deformation, with a non-zero displacement in the e1 -direction, in the form ⎡ ⎤ 1γ0 F = ⎣0 1 0⎦. F = 1 + γe1 ⊗ e2 , 001 The eigenvalues r of B = FFT are given by

250

4 Solids



⎤ 1 + γ2 − r γ 0 γ 1 − r 0 ⎦ = r 2 − (2 + γ 2 )r + 1. 0 = det ⎣ 0 0 1−r Hence we find two solutions,  r1 = 1 + 21 γ 2 + 21 γ 4 + 4γ 2 ,

r2 = 1 + 21 γ 2 −

1 2



γ 4 + 4γ 2 ,

and they are the inverse of each other. Let λ2 = r 1 ,

λ1 = λ, λ2 = 1/λ, λ3 = 1.

Hence I1 = λ2 + 1/λ2 + 1 = I2 . ⎡

⎤ 1 + γ2 γ 0 B = FFT = ⎣ γ 1 0 ⎦ , 0 01

Since



B−1

⎤ 1 −γ 0 = ⎣ −γ 1 + γ 2 0 ⎦ . 0 0 1

and T = − p ∗ + 2C1 B − 2C2 B−1 then ⎡

⎤ − p ∗ + 2(C1 − C2 ) + 2C1 γ 2 2(C1 + C2 )γ 0 ∗ 2 ⎦, T=⎣ − p + 2(C1 − C2 ) − 2C2 γ 0 2(C1 + C2 )γ 0 0 − p ∗ + 2(C1 − C2 )

where

p ∗ := 23 (C1 I1 − C2 I2 ) = 23 (C1 − C2 )(3 + γ 2 ).

4.5 Modelling of Dissipative Solids In modelling dissipative effects in solids analogy with the fluid models seem to be a common reference. Landau and Lifshitz[272], p. 34, model viscous dissipation by adding to the elastic stress tensor a viscosity stress tensor Tvis given by Tvis = 2μ˙ε0 + ζtr ε˙ 1, μ and ζ being the shear and bulk viscosity coefficients. In a sense this dependence is natural in that it is the strict analogue of linear elasticity, where T is a linear function of ε. Yet, ε˙ is not objective and hence dissipation has to be modelled in a different way. An answer to this problem is given in [132] by considering a Piola stress (the first one) instead of the Cauchy stress. Heat conduction is modelled ([272], p. 33) by a Fourier-like law,

4.5 Modelling of Dissipative Solids

251

q = −K∇θ,

(4.27)

the solid character being represented by the (possibly) anisotropic tensor K. For isotropic solids the standard Fourier law q = −κ∇θ is assumed. Yet, to account for deformation an argument can be developed as follows. Let for simplicity r = 0. The equation of balance of energy (4.4) is taken in the approximate form (J ≈ 1) θη˙ R = −∇ · q. Applying the trace to the stress-strain relation (4.6) we obtain ∇ · u = −kp + 3α(θ − θ0 ), where θ0 is the reference temperature, α is the thermal expansion, p = −tr T/3, and k = 3/(3λ + 2μ). Hence p is linear with respect to ∇ · u and θ − θ0 . This suggests that we let ([272], p. 6) ψ R = ψ0 (θ) − kα(θ − θ0 )∇ · u + με0 · ε0 + 21 k(∇ · u)2 . Since η R = −∂θ ψ R then

η R = η0 (θ) + kα∇ · u.

Consequently we have θ(η0 θ˙ + kα∇ ˙· u) = ∇ · (κ∇θ). As a rough approximation we now let the solid be in a stress-free condition, p = 0. Then Eq. (4.6) results in ∇ · u = 3α(θ − θ0 ) and by the balance of energy it follows θ(η0 + 3kα2 )θ˙ = κθ. Hence deformation would affect the heat equation by merely modifying the value of ˙ the coefficient of θ. We now examine how a thermodynamically-consistent scheme can be developed for dissipation in solids.

252

4 Solids

4.5.1 A Model for Viscosity and Heat Conduction If we look at the dependence of the stress on the deformation gradient, as is done in elasticity, by the principle of objectivity it follows that the second Piola stress T R R can depend on F via the Cauchy-Green tensor C. Now C is invariant under Euclidean ˙ are objective. ˙ Hence both T ˙ R R and C transformations and so is the time derivative C. Indeed they both are invariant. The same properties hold for the Green-St. Venant strain E = 21 (C − 1). ˙ or T R R (E, E), ˙ is naturally framed A modelling of solids via a function T R R (C, C), within a Lagrangian description. As to a Fourier-like law, we observe that q R = J qF−T , or

q = J −1 Fq R ,

∇R θ = ∇θ F ∇θ = F−T ∇R θ.

Hence (4.27) is replaced with q R = −J F−1 KF−T ∇R θ. We can then view J F−1 KF−T as the conductivity tensor K R , in the Lagrangian description, associated with the conductivity tensor K in the Eulerian description, K = J −1 FK R FT .

(4.28)

We follow the material description and then write the entropy inequality in the form ˙ + T R R · E˙ − 1 q R · ∇R θ ≥ 0. −(ψ˙ R + η R θ) θ Let ψ R , η R , T R R , q R be functions of ˙ ∇R θ. E, θ, E, Upon computation of ψ˙ R and substitution we have ¨ − ∂∇R θ ψ R · ∇R θ˙ − 1 q R · ∇R θ ≥ 0. −(∂θ ψ R + η R )θ˙ + (T R R − ∂E ψ R ) · E˙ − ∂E˙ ψ R · E θ ˙ θ˙ implies that ¨ ∇R θ, The arbitrariness of E, ∂E˙ ψ R = 0, Let

∂∇R θ ψ R = 0,

η R = −∂θ ψ R .

4.5 Modelling of Dissipative Solids

253

˙ ∇R θ) − TelR R (E, θ) Tdis := T R R (E, θ, E, RR

TelR R (E, θ) := T R R (E, θ, 0, 0), and assume T R R is continuous so that

˙ ∇R θ) → 0 as E, ˙ ∇R θ → 0. (E, θ, E, Tdis RR Hence it follows that TelR R = ∂E ψ R and 1 · E˙ − q R · ∇R θ ≥ 0. Tdis RR θ If

˙ = A R (E, θ)E, Tdis RR

q R = −K R (E, θ)∇R θ

then the reduced inequality holds if and only if A R ≥ 0,

K R ≥ 0.

The positive semi-definiteness of K R = J F−1 KF−T then implies K = J −1 FK R FT ≥ 0. This is so because, for any vector w, J w · Kw = w · FK R FT w = (FT w) · K R (FT w) and the conclusion follows. Moreover, since J = det F > 0 then K R is positive definite if and only if K is positive definite. The definition (4.28) of K, in terms of K R and the deformation gradient F, shows that even though K R is independent of E the spatial tensor K is affected by deformation. In the modelling of solids a properly selected reference configuration R is central in the description of constitutive properties. For instance, the reference configuration might be stress free and isotropic while the current configuration might have properties induced by the deformation. This suggests that the material properties be formulated in the reference configuration and next be extended to the current configuration. Consequently we describe the viscous behaviour by letting ˙ = A R (E, θ)E, Tdis RR

A R ∈ Lin(Sym, Sym),

and look for the corresponding relation T = AD

254

4 Solids

in the current configuration. The connection between T R R and T is different from that between E˙ and D. Since = J F−1 Tdis F−T , Tdis RR

E˙ = FT DF,

in indicial notation we have dis −1 R T J FK−1 i Ti j FH j = A K H P Q FP r Dr s Fs Q ,

whence

dis = J −1 F p K Fq H A KR H P Q Fr P Fs Q Dr s . T pq

In compact notation we can write Tdis = J −1 (F  F)A R (FT  FT )D. Hence

A = J −1 (F  F)A R (FT  FT ) ∈ Lin(Sym, Sym).

Go back to the components and let A pqr s = J −1 F p K Fq H A KR H M N Fr M Fs N . If the deformation gradient is merely a rotation, that is U = 1, F = Q, then it follows A pqr s = Q p K Q q H A KR H M N Q r M Q s N . This relation is also referred to as the basis change formula for the elasticity tensor [65]. Now that we know the relations K R ↔ K,

AR ↔ A

it is worth checking the properties under Euclidean transformations. By definition, K R and A R are assumed to be invariant. Hence K∗ = (J −1 FK R FT )∗ = J −1 F∗ K R (F∗ )T = J −1 QFK R FT QT = QKQT . Accordingly, K transforms as a (objective) second-order tensor. To determine the property of A we observe A∗pqr s = J −1 F p∗K Fq∗H A KR H M N Fr∗M Fs∗N = J −1 Q pi Fi K Q q j F j H A KR H M N Q r h Fh M Q sk Fk N = Q pi Q q j Q r h Q sk Ai j hk

4.5 Modelling of Dissipative Solids

255

and hence A transforms as a (objective) fourth-order tensor. Assume the conductivity tensor and the viscosity tensor are isotropic8 in the reference configuration K R = κ1,

A R = 2μ1  1 + λ1 ⊗ 1,

κ, μ, λ being possibly functions of the temperature θ. Hence q R = −κ∇R θ,

˙ Tdis = 2μE˙ + λ(tr E)1. RR

In the current configuration the heat flux vector q and the dissipative Cauchy stress Tdis take the form (4.29) q = −κJ −1 B∇θ, Tdis = 2μJ −1 B D B + λJ −1 (B · D)B.

(4.30)

Therefore, if heat conduction and viscosity are isotropic in the reference configuration then, in the current configuration, q and Tdis are influenced by deformation via the left Cauchy-Green tensor B = FFT . As is known, this happens as a consequence of the transformation of gradients (∇R θ = FT ∇θ, E˙ = FT DF) and the relations q ↔ ˙ is isotropic in R the corresponding q R , T ↔ T R R . It is of interest that while λ(tr E)1 term in R is a multiple of B. Likewise the shear term 2μE˙ is transformed into 2μJ −1 BDB, not merely a term proportional to D. The constitutive Eqs. (4.29) and (4.30) are consistent with thermodynamics in that this is apparent in the Lagrangian description (reference configuration). Yet it is of interest to examine the thermodynamic consistency of q and Tdis in the Eulerian ˙ Now, the description, that is as functions of ∇θ and D rather than of ∇R θ and E. function ∇θ · B∇θ of ∇θ is positive definite because so is the tensor B. Moreover, ∇θ · B∇θ = |FT ∇θ|2 . Instead, (BDB) · D = (BD) · (DB) and we cannot conclude about the positive definiteness with respect to D. However, (BDB) · D = B ps Dsr Bqr D pq = F p K Fs K Dsr Fq H Fr H D pq = FKTp D pq Fq H FKTs Dsr Fr H = |FT DF|2 . In addition, (B · D)B · D = (B · D)2 and B · D = F p K Fq K D pq = FKTp D pq Fq K = tr (FT DF). Let G = FT DF. Then [2μJ −1 B D B + λJ −1 (B · D)B] · D = 2μJ −1 G · G + λJ −1 (tr G)2 . 8

See Appendix about isotropic tensors. We consider the representation (A.15) in that A is applied to a symmetric tensor.

256

4 Solids

Moreover, letting G = G0 + 13 (tr G)1 we find [2μJ −1 B D B + λJ −1 (B · D)B] · D = J −1 [2μG0 · G0 + ( 23 μ + λ)(tr G)2 ]. As a consequence 1 κ T |F ∇θ|2 . Tdis · D − q · ∇θ = J −1 [2μ G0 · G0 + ( 23 μ + λ)(tr G)2 ] − θ θJ Hence the reduced entropy inequality holds if and only if the standard inequalities μ ≥ 0,

2 μ 3

+ λ ≥ 0,

κ≥0

hold.

4.6 Modelling via Dissipation Potentials The modelling of dissipation is often established by expressing the constitutive properties via rate-type equations (see Chap. 6). The literature shows also constitutive schemes based on the notion of dissipation potential.9 The interest in dissipation potentials seems to be the compact description of the dissipative “driving forces” via a (potential) function namely a dissipation potential. Following is an outline of some approaches.

4.6.1 Convex Dissipation Potential Let T = − p1 + Tdis and consider the entropy inequality in the form ˙ − p∇ · v + Tdis · D − 1 q · ∇θ ≥ 0. −ρ(ψ˙ + η θ) θ Moreover let the free energy ψ = ε − θη be a function of ρ, θ so that the entropy inequality becomes p  1 ρ 2 − ∂ρ ψ ρ˙ − ρ(η + ∂θ ψ)θ˙ + Tdis · D − q · ∇θ ≥ 0. ρ θ 9

See, e.g. [287] and references therein.

4.6 Modelling via Dissipation Potentials

257

We let p = ρ2 ∂ρ ψ,

η = −∂θ ψ

and hence the inequality reduces to 1 D := Tdis · D − q · ∇θ ≥ 0. θ The dissipative stress Tdis is split in the form Tdis = p dis 1 + Tdis 0 , Hence

tr Tdis 0 = 0,

p dis = 13 tr Tdis.

1 D = p dis ∇ · v + Tdis 0 · D0 − q · ∇θ. θ

Let D be given by a function D = ϒ(∇ · v, D0 , ∇θ), and let p dis := ∂∇·v ϒ,

Tdis := ∂D0 ϒ,

1 q := −∂∇θ ϒ. θ

The function ϒ(∇ · v, D0 , ∇θ) is said to be a potential of dissipation if it is convex, vanishing at the origin, and non-negative. An example in the literature [317] involves the thermo-viscous fluid. Look at the classical constitutive equations   Tdis = λ + 23 μ ∇ · v 1 + 2μD0 ,

q = −κ∇θ.

Hence it follows that ϒ(∇ · v, D0 , ∇θ) =

1 2

  λ + 23 μ (∇ · v)2 + μD0 · D0 + 21 θ κ|∇θ|2

is a potential of dissipation. This example with Tdis and q coincides in fact with the Navier-Stokes-Fourier model, Sects. 5.5 and 6.2.4. As to solids, the constitutive Eqs. (4.29) and (4.30) satisfy q = −θ∂∇θ ϒ, where ϒ(∇θ, D) = Since

Tdis = ∂D ϒ

μ λ κ ∇θ · B∇θ + BDBD + (B · D)2 . 2θ J J 2J

258

4 Solids

ϒ(∇θ, D) =

κ μ 1 |FT ∇θ|2 + |(FT DF)0 |2 + ( 23 μ + λ)[tr (FT DF)]2 2θ J J J

it follows that ϒ is also positive definite (as far as κ > 0, μ > 0, 2μ + 3λ > 0) and vanishes at ∇θ = 0, D = 0. Moreover the quadratic dependence proves convexity.

4.6.2 Model with a Scalar Internal Variable A scalar internal variable is introduced to describe dissipative effects. Two views are adopted in the literature. In a case [176] a variable β is considered to describe at the macroscopic level the effects of microfractures and microcavities which result in a decrease of the material stiffness. The basic idea is that the internal variable enters the balance of energy via an appropriate power of the internal forces [175]. In the other case [301] the internal variable, say α, appears only in the free energy function; both α and ∇R α contribute to the dissipated power but do not contribute to the external power. We outline the two schemes. By analogy with [175, 176], the balance of mass and linear momentum are left unchanged while the balance of energy is assumed in the form ρε˙ = T · D + Hβ˙ + H · ∇ β˙ − ∇ · q + ρr, H being viewed as the scalar internal force (of damage) and H as the vector internal force. Hence substitution of −∇ · q + ρr in the entropy inequality results in the modified Clausius-Duhem inequality 1 −ρ(ψ˙ + η η) ˙ + T · D + Hβ˙ + H · ∇ β˙ − q · ∇θ ≥ 0. θ Let T = Ten + Tdis ,

H = Hen + Hdis ,

H = Hen + Hdis ,

the purpose being to distinguish the energetic non-dissipative parts from the dissipative ones. As in the previous model we let Tdis = p dis 1 + Tdis 0 , where tr Tdis 0 = 0. Let ψ be given by a differentiable function ψ(ρ, θ, β, ∇β). The entropy inequality becomes (Ten − ρ2 ∂ρ ψ1) · D − ρ(η + ∂θ ψ)θ˙ + (Hen − ρ∂β ψ)β˙ + (Hen − ρ∂∇β ψ)∇ β˙ 1 dis ˙ dis · ∇ β˙ − q · ∇θ ≥ 0. +ρ∇β ⊗ ∂∇β ψ · L + p dis ∇ · v + Tdis 0 · D0 + H β + H θ

4.6 Modelling via Dissipation Potentials

259

dis dis ˙ ˙ By assuming that p dis , Tdis 0 , H , H , q approach zero as D, β, ∇ β, ∇θ approach zero it follows that

Ten = ρ2 ∂ρ ψ1 + ρ∇β ⊗ ∂∇β ψ, η = −∂θ ψ, Hen = ρ∂β ψ, Hen = ρ∂∇β ψ, and

1 dis ˙ dis · ∇ β˙ − q · ∇θ ≥ 0, D := p dis ∇ · v + Tdis 0 · D0 + H β + H θ

˙ ∇ β, ˙ ∇θ) holds while ∇β ⊗ ∂∇β ψ ∈ Sym. A dissipation potential ϒ(∇ · v, D0 , β, provided dis = ∂β˙ ϒ, Hdis = ∂∇ β˙ ϒ, q = −θ∂∇θ ϒ p dis = ∂∇·v ϒ, Tdis 0 = ∂D0 ϒ, H

while ∂∇·v ϒ∇ · v + ∂D0 ϒ · D0 + ∂β˙ ϒ β˙ + ∂∇ β˙ ϒ · ∇ β˙ + ∂∇θ ϒ · ∇θ ≥ 0. The other model with a scalar internal variable leads to the canonical balance laws; it stems from the view that the internal variable describes a purely dissipative behaviour and that it is controlled by the standard fields of stress and heat flux [301]. As a starting point, the balance equations are considered in the material description; the balance of linear momentum and energy are ρ R v˙ = ∇R · T R + ρ R b,

ρ R (ε˙ + v · v˙ ) = ∇R · (vT R − q R ) + ρ R r.

The second law inequality is then considered in the form ˙ + T R · F˙ − 1 q R · ∇R θ ≥ 0. −ρ R (ψ˙ + η θ) θ Let α be an internal variable and let ψ be given by ψ = ψ(F, θ, α, ∇R α) and the like for T R and η. Hence it follows T R = ∂F ψ,

η = −ρ R ∂θ ψ.

Upon the definitions A := −ρ R ∂α ψ,

B := −ρ R ∂∇R α ψ

we can write the reduced inequality in the form

260

4 Solids

1 D := Aα˙ + B · ∇R α˙ − q R · ∇R θ ≥ 0. θ

(4.31)

Hence, the free energy function ψ and the heat flux q R are required to satisfy the reduced inequality (4.31). This in turn shows that the internal variable α, though occurring only in the free energy ψ, enters the modelling of dissipative effects via the reduced inequality. In addition, the occurrence of the internal variable eventually leads to the canonical momentum balance. In view of the identities ˙ ρ R v˙ F = (ρ R vF)˙− ρ R vF,

ρ R vF˙ = ∇R ( 21 ρ R v2 ),

(∇R · T R )F = ∇R · (FT T R ) − ∂F ψ ∇R F, upon right multiplication by F of the equation of motion (balance of linear momentum) it follows (ρ R vF)˙ = ∇R · (FT T R + 21 ρ R v2 1) − ∂F ψ ∇R F + ρ R bF. Now, ∇R ψ(F, θ, α, ∇R α) = ∂F ψ∇R F + ∂θ ψ∇R θ + ∂α ψ∇R α + ∂∇R α ψ∇R ∇R α = ρ−1 [T R ∇R F − ρ R η∇R θ − A ∇R α − B∇R ∇R α]. R Hence, substitution of ∂F ψ ∇R F = T R ∇R F allows the balance of linear momentum to be written (4.32) (ρ R vF)˙ = ∇R · (FT T R + L 1) + ρ R bF + f th + f intv , where L = ρ R ( 21 v2 − ψ),

f th = −ρ R η∇R θ,

f intv = −A ∇R α − B∇R ∇R α.

The vector ρ R vF is the material linear momentum and the tensor FT T R + L 1 is the Eshelby material stress. The material force f intv occurring in the balance Eq. (4.32) is due to the dependence of the free energy on the internal variable α. Remark 4.1 Whether or not the internal variable α is observable and hence subject to a proper action in the bulk and at the boundary of any region in the body, mathematically an appropriate equation is required so that the evolution of α is determined. While physically an additional equation may be introduced by ascribing to α an angular momentum and a kinetic energy [301], a general setting for internal variables may be given within the framework of rate-type equations (see Chap. 6).

4.6 Modelling via Dissipation Potentials

261

4.6.3 Model with a Vector Internal Variable The starting point is the use of a generalized stress and a generalized strain. For definiteness, though, we let the material be non-conducting and describe the stress by the first Piola stress T R and the deformation by F. Hence we write the entropy inequality in the form −ψ˙ R − η R θ˙ + T R · F˙ ≥ 0. The free energy is assumed to depend on θ, F, and a tensor-valued internal variable α. Upon time differentiation of ψ R (θ, F, α) it follows ˙ ≥ 0. −(∂θ ψ R + η R )θ˙ + (T R − ∂F ψ R ) · F˙ − ∂α ψ R · α If η R and T R depend on θ, F, α, as ψ R does, then the inequality implies η R = −∂θ ψ R ,

T R = ∂F ψ R ,

∂α ψ R · α ˙ ≤ 0.

Instead, the assumption is made that + Tdis , T R = Ten R R and hence

Ten = ∂F ψ R , R

· F˙ − ∂α ψ R · α ˙ ≥ 0. Tdis R

˙ In = Tdis (θ, F, α, F). This position is thermodynamically consistent if we let Tdis R R addition, the tensors Xdis := −Xen Xen := ∂α ψ R , are defined; Xen is viewed as the energetic driving force while Xdis is the dissipative driving force [46]. The formal advantage of considering Xdis is that the reduced inequality can be expressed in the form · F˙ + Xdis · α ˙ ≥ 0, Tdis R

(4.33)

↔ F˙ and Xdis ↔ α ˙ are viewed as dissipation conjugate pairings. Yet, and hence Tdis R ˙ by definition, Xen , and Xdis too, is a function of θ, F, α and hence independent of α. Henceforth the dependence on temperature is understood and not written. A dissipation potential π is considered, via a function ˙ α), π = π(F, ˙ such that = ∂F˙ π, Tdis R

Xdis = ∂α˙ π.

262

4 Solids

Let , ∂F˙ Tdis R

∂α˙ Xdis R

be invertible so that we can determine the functions ˙ dis , F, α), F(T R

α(X ˙ dis , F, α).

Next the dual dissipation potential π ∗ , the convex conjugate to π, is defined by ˙ α). , Xdis ) = Tdis · F˙ + Xdis · α ˙ − π(F, ˙ π ∗ (Tdis R R It follows that dis ˙ ˙ ˙ ∂Xdis π ∗ = α ∂Tdis π ∗ = F˙ + Tdis ˙ + Xdis · ∂Xdis α ˙ − Xdis · ∂Xdis α ˙ = α. ˙ R · ∂Tdis F − T R · ∂Tdis F = F, R

R

R

Therefore the equations π∗ , F˙ = ∂Tdis R

α ˙ = ∂Xdis π ∗ ,

determine the rate of F and α. The dissipation potential π is assumed to be a gauge [287] in that ˙ α) ˙ α, π(F, ˙ ≥ 0, ∀F, ˙

π(0, 0) = 0,

and π is convex and positively homogeneous. Now, ˙ λα). · F˙ + Xdis · α ˙ = ∂F˙ π · F˙ + ∂α˙ π · α ˙ = ∂λ π(λF, ˙ Tdis R By the positive homogeneity it follows ˙ α), · F˙ + Xdis · α ˙ = |λ|k π(F, ˙ k > 0. Tdis R The assumption π ≥ 0 implies the inequality (4.33). Yet, thermodynamic consistency is more severe. By definition, Xdis is independent of α. ˙ Then upon the obvious integration of Xdis = ∂α˙ π we find ˙ α), ˙ + f (F, F, π = Xdis (F, α) · α f being an arbitrary function. This shows that π is linear with respect to α. ˙ Hence the assumption that π be a gauge is contradicted by the condition ∂α˙ π = −∂α ψ R .

Chapter 5

Fluids

Physically a fluid is defined as a substance which deforms continuously under the application of a shear stress. The term fluid generally includes both the liquid and gas phases. Some substances can be both fluid and solid. Viscoelastic fluids appear to behave similar to a solid when a sudden force is applied. Also substances with a very high viscosity such as pitch appear to behave like a solid. Here fluids are characterized via the constitutive properties. It is a common property of fluids that they exhibit elasticity in their response to compression. Fluids do not possess natural reference configurations and that is why it is more convenient to develop the pertinent equations in the spatial form. Fluids show lack of resistance to permanent deformation, resisting only relative rates of deformation in a dissipative, frictional manner. These properties are typically a function of their inability to support a shear stress in static equilibrium. Some models are examined in detail: elastic fluids (along with water wave theories), thermoelastic fluids, the ideal gas and some real gases, heat-conducting viscous fluids, Newtonian fluids, Stokesian fluids, generalized Newtonian fluids, viscoplastic and viscoelastic fluids, models of turbulence.

5.1 Elastic Fluids An elastic fluid is a body governed by the constitutive equations ε = ε(ρ), ˆ

ˆ T = T(ρ).

Since ρ is invariant the objectivity implies that T ˆ ˆ T(ρ) = QT(ρ)Q

© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_5

263

264

5 Fluids

for all rotations Q and any value of ρ. Consequently T must be a scalar times the unit tensor 1. Since elasticity occurs only in compression, we take the scalar as − p, p being the pressure, and write T = − p1,

p = p(ρ). ˆ

The elastic fluid is also named piezotropic. The energy supply and the heat flux are taken to be zero and hence the local balance equations are ρ˙ = −ρ∇ · v, ρ v˙ = −∇ p + ρ b, and −ρ ε˙ = − p∇ · v. Then we can write (ρ

p d ε(ρ) ˆ − )ρ˙ = 0. dρ ρ

(5.1)

By the local balance of mass, for any value of ρ we can have any value of ρ˙ depending on the value of ∇ · v. Notice the identity ∇ · v˙ = ∂t ∇ · v + (v · ∇)∇ · v + L · LT . Hence the equation of motion can be written in the form ∂t ∇ · v + (v · ∇)∇ · v + L · LT + ∇ · (

ˆ 1 d p(ρ) ∇ρ) = ∇ · b. ρ dρ

We regard the function b(x, t) as arbitrarily assignable. This allows us to say that for any value of ρ, we can have arbitrary values of ∇ · v, and hence of ρ, ˙ while ∇ · b takes the value required to satisfy the equation of motion. Therefore, given any spatial point and any time, it is possible to find a process such that ρ and ρ˙ have arbitrarily prescribed values at that point and time. An equation of the form f (ρ) ρ˙ = 0 holds for any real value of ρ˙ and any positive value of ρ if and only if f (ρ) = 0. Thus the energy equation (5.1) implies the pressure relation p(ρ) ˆ = ρ2

d ε(ρ) ˆ . dρ

The internal energy determines the pressure through the pressure relation.

5.1 Elastic Fluids

265

To derive a property of the motion we observe p 1 p d ε(ρ) ˆ p 1 ∇ p = ∇ − p∇ = ∇ + ∇ρ = ∇(ε + ). ρ ρ ρ ρ dρ ρ If b is conservative, say b = −∇U and U = gz for the gravity force, then the equation of motion reads p v˙ = −∇(ε + + U ). ρ It follows that ∇ × v˙ = 0. This implies that in the motion of an elastic (or piezotropic) fluid under a conservative body force the acceleration is the gradient of a potential. Thus the motion preserves circulation. Look at a steady flow, where ∂t v = 0 and ∂t ρ = 0. We have v˙ = (v · ∇)v = 21 ∇v2 + (∇ × v) × v. The equation of motion yields 1 ∇v2 2

+ (∇ × v) × v = −∇(ε +

Let ϕ := ε + It follows that if ∇ × v = 0 then

p + U ). ρ

p 1 2 + 2 v + U. ρ

∇ϕ = 0.

Now, irrespective of the value of ∇ × v, v · ∇ϕ = −v · (∇ × v) × v = 0 This means that ϕ is constant on streamlines. If further the flow is irrotational then ϕ is constant on the fluid domain. This result is in fact Bernoulli’s law. In Sect. 2.8 Bernoulli’s law is derived from the equation of motion and the assumption that the fluid is barotropic. Here, for elastic fluids, the equation of motion leads to Bernoulli’s law in the form v · ∇ϕ = 0. Since Bernoulli’s theorem involves ∇ϕ then the two results are equivalent if ε+

p 1 p =∫ dξ + constant. ρ p0 ρ(ξ)

266

5 Fluids

Now look at the two functions as dependent on p and observe p  d εˆ dρ 1 p dρ 1 d  ε(ρ( ˆ p)) + = + − = , dp ρ( p) dρ dp ρ ρ2 dp ρ

d p 1 1 ∫ dξ = ; dp p0 ρ(ξ) ρ

the equivalence is then proved.

5.1.1 Water Wave Theories The simple model of elastic fluid allows us to set up significant systems of water wave equations. Consider a homogeneous, incompressible fluid, with constant mass density ρ. By the continuity equation, the velocity v is solenoidal, ∇ · v = 0. Moreover, the fluid is acted upon by pressure and gravity and hence the force per unit volume is −∇( p + ρgz), z being the upward vertical coordinate. Then circulation is preserved and we let ∇ × v = 0. Since ρ is constant, the equation of motion can be written ∂t v + ∇ 21 v2 + (∇ × v) × v = −∇(

p + gz). ρ

Since ∇ × v = 0 then there is a potential field, v = ∇ψ, and hence ∇(∂t ψ + 21 v2 +

p + gz) = 0. ρ

Thus ∂t ψ + 21 v2 + p/ρ + gz is a function of time. Consistent with the literature, we let this function be zero and also neglect v2 /2. Consequently, ∂t ψ +

p + gz = 0. ρ

Let the fluid move over an uneven bottom specified by x = xe1 + ye2 − h(x, y)e3 , where e3 is the upward unit vector while xe1 + ye2 belongs to a horizontal region D. The free surface of the fluid is described by

5.1 Elastic Fluids

267

x = xe1 + ye2 + η(x, y, t)e3 , η representing the free surface elevation. While h(x, y) is an a-priori given function, η(x, y, t) is unknown. At the free surface of the fluid, the pressure is just the atmospheric pressure. Hence taking the atmospheric pressure as a reference we can take the boundary condition in the form gη + ∂t ψ(η) = 0. Since ∂t η is the vertical velocity then ∂t η = ∂z ψ(η). Hence it follows g∂z ψ + ∂t2 ψ = 0 at z = η.

(5.2)

Moreover the condition ∇ · v = 0, along with v = ∇ψ, results in the differential equation ψ = 0. Deep-water waves We let the depth be so large that we can take z ∈ (−∞, η(x, y)]. We then look for a wave solution with the velocity, in the x-direction, given by the angular frequency ω and the wave number k, ψ = cos(kx − ωt) f (z); to fix ideas let ω, k > 0. We determine the function f by applying ψ = 0 to obtain f  − k 2 f = 0. Both exp(kz) and exp(−kz) are solutions; only exp(kz) is acceptable in that is bounded as z → −∞. Hence we have ψ = A exp(kz) cos(kx − ωt). The boundary condition at the free surface z = η implies (gk − ω 2 )A exp(kη) cos(kx − ωt) = 0. As a consequence we find the propagation condition ω 2 = gk. The speed of propagation (phase speed) is U = ω/k = g/ω.

268

5 Fluids

If, instead, the depth h is finite then both exp(kz) and exp(−kz) are acceptable and the solution is ψ = [A exp(kz) + B exp(−kz)] cos(kx − ωt). We require that the vertical velocity ∂z ψ vanish at the bottom, z = −h. It follows A exp(−kh) − B exp(kh) = 0. We then find that ψ = γ cosh(k(z + h)) cos(kx − ωt), where γ = A exp(−kh). The condition at the free surface (5.2) yields gk sinh(k(η + h)) = ω 2 cosh(k(η + h)). Since η  h we conclude that ω 2 = gk tanh(kh). The phase speed is given by U=

 ω  = g/k tanh(kh). k

Look at long waves namely λ = 2π/k h or kh  1. Since tanh(kh) = kh + o(kh) then as kh  1 we can take the approximation U



gh.

At the limit of long waves the phase speed is independent of the frequency. Shallow-water waves We now follow [23, 208] and describe the fluid motion via a picture of vertical columns, x = r + (ψ + X ϕ)e3 , where r = xe1 + ye2 , ψ = (η − h)/2, and ϕ = η + h. Hence ϕ is the height of the column, or length of the segment, while ψ is the vertical coordinate of the mean point (or centre of mass) of the column (segment). Observe that X ∈ [−1/2, 1/2]; X = −1/2 corresponds to the bed while X = 1/2 corresponds to the free surface. The velocity V of a point is written in the form ˙ 3. V = r˙ + (ψ˙ + X ξ)e

5.1 Elastic Fluids

269

Hence ψ˙ is the vertical velocity of the centre of mass while X ϕ˙ is the vertical velocity, at X , relative to the centre of mass. Let ∇˜ denote the bi-dimensional gradient, ∇˜ = ∂x e1 + ∂ y e2 . The equations of motion are derived by considering the balance of energy for a fluid column. Let P denote a (cylindrical) column and let C be the closed curve given by the intersection of ∂P with an horizontal plane. To compute the power of pressure on ∂P we observe that, on the lateral surface ∂l P, η

∫ pn · V da = ∫ v · n ∫ p dz ds = ∫ v · n ds,

∂l P

−h

C

C

where s is the arclength along C and η

 = ∫ p dz. −h

Moreover,

˜ 2 ]−1/2 [−v · ∇η ˜ + ψ˙ + 1 ϕ], V · n = [1 + (∇η) ˙ 2

at the free surface (X = 1/2) while ˜ − ψ˙ + 1 ϕ], ˜ 2 ]−1/2 [v · ∇h ˙ V · n = [1 + (∇h) 2 at the bottom. Hence 1 ˙ ˜ − ψ˙ − 1 ϕ) ˜ ∫ − pV · n da = − ∫ v · n ds + ∫ [ pa (v · ∇η ˙ 2 ˙ + P(v · ∇h + ψ − 2 ϕ)]dxdy,

∂P

C

D

where D is the region enclosed by C while pa is the atmospheric pressure and P is the pressure at the bottom. Consistent with the balance of energy (2.46) we consider the balance equation d ∫ ρ( 1 V2 + gz)dv = ∫ − pV · n da. dt P 2 ∂P Observe ∫ f dv = ∫ d xd y

P

D

η(x,y,t)



−h(x,y)

η(x,y,t)



f dz,

−h(x,y)

1/2

f (z)dz = ϕ ∫ f (ψ + X ϕ)d X. −1/2

Hence ∫ ρ( 21 V2 + gz)dv = ∫ 21 ρϕ(v2 +

P

and the balance of energy results in

D

1 2 ϕ˙ 12

+ 2gψ)d xd y

270

5 Fluids

d 1 1 2 ∫ ρϕ(v2 + 12 ϕ˙ + 2gψ)d xd y = − ∫ v · n ds dt D 2 C 1 ˙ ˜ ˜ + ∫[ pa (v · ∇η − ψ − ϕ) ˙ + P(v · ∇h + ψ˙ − 1 ϕ)]d ˙ x d y, 2

D

2

(5.3)

We now require that the balance of energy (5.3) be invariant under a superimposed rigid motion so that the change V → V + U leaves (5.3) unchanged. First let v → v + u, u being a horizontal vector. The invariance with respect to u implies that d 1 ˜ + P ∇h]d ˜ ∫ ρϕ(2v · u + u2 )d xd y = −u · ∫ nds + u · ∫[ pa ∇η x d y. dt D 2 C D As to the integral on C we observe that ˜ d x d y. u · ∫ n ds = ∫ ∇˜ · (u) d x d y = u · ∫ ∇ C

D

D

Hence, owing to the arbitrariness of u we deduce that d 1 ∫ ρϕd x d y = 0, dt D 2 d ˜ d x d y + ∫[ pa ∇η ˜ + P ∇h]d ˜ ∫ ρϕvd x d y = − ∫ ∇ x d y. dt D D D Now let ψ˙ → ψ˙ + α. The invariance with respect to α and the arbitrariness of α require that d ∫ ρ ϕ g t d x dy = ∫(P − pa )d x d y. dt D D By the Reynolds transport theorem, in the bi-dimensional domain D, we obtain the corresponding local equations in the form ϕ˙ + ϕ∇˜ · v = 0,

(5.4)

˜ + P ∇h, ˜ ˜ + pa ∇η ρϕ˙v = −∇

(5.5)

0 = P − pa − ρgϕ, where the superposed dot denotes the total time derivative with respect to the hori˜ zontal motion, ϕ˙ = ∂t ϕ + v · ∇ϕ. The shallow-water theory is based on the assumption that the vertical component of the motion has a negligible effect on the pressure1 p in that the depth of the water is 1

See [400], Sect. 2.2.

5.1 Elastic Fluids

271

sufficiently small compared with some other significant lengths (long-wave theory). This assumption amounts to assuming that the pressure p is approximately equal to the hydrostatic pressure, i.e. p(z) = ρg(η − z) + pa . Consequently, P = ρg(η + h) + pa ,

 = [ 21 ρg(η + h) + pa ](η + h).

Equations (5.4) and (5.5) then yield ∂t η + ∇˜ · [(η + h)v] = 0,

˜ = −g ∇η. ˜ ∂t v + (v · ∇)v

(5.6)

Equations (5.6) characterize the nonlinear shallow-water theory for the unknowns functions η(x, y, t), v(x, y, t). The corresponding linearized version is given by ∂t η + ∇˜ · (hv) = 0,

˜ ∂t v = −g ∇η.

Hence it follows the hyperbolic equation for the free surface elevation η, ˜ = 0. ∂t2 η − g ∇˜ · (h ∇η) Korteweg-de Vries equation The Korteweg-de Vries equation (KdV) occurs in various branches of physics and is of interest mathematically in that admits particular solutions, termed solitons [284]. Here we show how the KdV can be derived from, or related to, the shallow-water theory [25]. If ϕ satisfies the differential equation ∂t ϕ + c0 ∂x ϕ = 0 then any function ϕ(x − c0 t) is a solution. If, instead, c = c(x) c0 then we can view ϕ(x − c0 t) as merely an approximation of a solution to ∂t ϕ + ∂x [c(x)ϕ] = 0. It is convenient to apply the change of variable x → χ = x − c0 t and argue with the coordinate χ describing the space dependence relative to the the frame moving with the velocity c0 of the fundamental wave solution. With the purpose of accounting for long waves, slowly varying in time, the new variables

272

5 Fluids

ζ = 1/2 χ,

τ = 3/2 t

are considered. The powers 1/2 and 3/2 of the parameter  characterize the different weight of the derivatives under consideration; so ∂χ ϕ = O(−1/2 ) and ∂t ϕ = O(−3/2 ). Back to the nonlinear system (5.6) we restrict attention to the one-dimensional setting with variables x, t. By (5.4) and (5.5) in the one-dimensional setting we have ϕ˙ + ϕ∂x v = 0,

ϕv˙ = −∂x P,

where −∂x P replaces 1/ρ times the right-hand side of (5.5) and v is the x-component ¨ + gϕ2 /2. of the velocity. For the present purpose it is assumed that P = ϕ2 ϕ/3 Since ϕ˙ = ∂t ϕ + v∂v ϕ, and the like for v, then ϕ∂t ϕ = ∂t (ϕv) − v∂t ϕ = ∂t (ϕv) + v∂x (ϕv). Hence we find the system of equations ∂t ϕ + ∂x (ϕv) = 0,

∂t (ϕv) + ∂x (ϕv 2 + P) = 0.

Alternatively we can apply ∂t ϕ + ∂x (vϕ) = 0,

∂t v + v∂x v + g∂x ϕ = 0.

(5.7)

Since ∂t = 3/2 ∂τ − c0 1/2 ∂ζ ,

∂x = ∂χ = 1/2 ∂ζ ,

then (5.7) can be written ∂τ ϕ + (v − c0 )∂ζ ϕ + ϕ∂ζ v = 0,

∂τ v + (v − c0 )∂ζ ϕ + ϕ−1 ∂ζ P = 0.

Equations free of the parameter  are obtained by replacing ϕ, v, and P with their formal expansions in , ϕ = h 0 + ϕ + 2 ϕ + ...,

v = v  + 2 v  + ...,

P = P0 + P  + 2 P  + ...,

where P0 = 21 c02 h 0 ,

P  = c02 ϕ ,

P  = c02 ϕ + 21 g(ϕ )2 + 13 c02 h 20 ∂ζ2 ϕ .

At the lowest order of approximation we find −c0 ∂ζ ϕ + h 0 ∂ζ v  = 0.

5.2 Thermoelastic Fluids

273

The obvious integration gives v =

c0  (ϕ + β). h0

Mathematically β is an arbitrary function of time. As we see in a moment, a strict connection with the Korteweg-de Vries equation follows by letting β = 0. At the order 2 we find from (5.7) that ∂ τ ϕ +

3 2

c0  ϕ ∂ζ ϕ + 16 c0 h 20 ∂ζ3 ϕ = 0. h0

The identification η = ϕ and using the physical variables x, t result in the Kortewegde Vries equation ∂t η + c0 ∂x η +

3 2

c0 η ∂x η + 16 c0 h 20 ∂x3 η = 0. h0

5.2 Thermoelastic Fluids A thermoelastic fluid is a body governed by the constitutive equations ε = ε(ρ, ˆ θ),

ˆ T = T(ρ, θ).

Since ρ and θ are invariant then objectivity implies that ˆ ˆ T(ρ, θ) = QT(ρ, θ)QT for all rotations Q and any value of ρ, θ. Consequently T must be a scalar times the unit tensor 1. Since elasticity occurs only in compression, we take the scalar as − p, p being the thermodynamic pressure and write T = − p1,

p = p(ρ, ˆ θ),

where the last expression represents the equation of state of the fluid. The dependence of pˆ on ρ is usually assumed to be invertible, namely ρ = ρ( ˆ p, θ), because of the experimental evidence that the isothermal compressibility κ=

1 ∂ p ρˆ ρ

274

5 Fluids

is positive for the most fluids.2 The local balance equations are ρ˙ = −ρ∇ · v, ρ v˙ = −∇ p + ρ b, ρ ε˙ = − p∇ · v − ∇ · q + ρr. Thermodynamic restrictions ˆ θ); the heat flux q and By definition, ψ, η, p are functions of ρ, θ, e.g. ψ = ψ(ρ, the extra-entropy flux k are allowed to depend also on ∇θ. To determine the thermodynamic restrictions we substitute ψ˙ = ∂ρ ψˆ ρ˙ + ∂θ ψˆ θ˙ in the Clausius–Duhem inequality to obtain 1 ˆ θ˙ + (ρ2 ∂ρ ψˆ − p)∇ ˆ · v − qˆ · ∇θ + θ∇ · k ≥ 0. −ρ(∂θ ψˆ + η) θ As always we regard the functions b(x, t) and r (x, t) as arbitrarily assignable. This allows us to say that, at a point x and time t, the functions θ˙ and ∇ · v can take arbitrary scalar values.3 Hence the inequality holds only if ˆ θ), η(ρ, ˆ θ) = −∂θ ψ(ρ, ˆ θ). p(ρ, ˆ θ) = ρ2 ∂ρ ψ(ρ,

(5.8)

The former result is named entropy relation and the latter pressure relation. In particular, ˆ θ)]. ˆ θ) + θ∂ρ ∂θ ψ(ρ, p(ρ, ˆ θ) = ρ2 [∂ρ ε(ρ, The Clausius–Duhem inequality then simplifies to 1 − qˆ · ∇θ + θ∇ · kˆ ≥ 0, θ where qˆ and kˆ are allowed to depend also on ∇θ. Since k is a possible function of ρ, θ, ∇θ then ∇ · kˆ = ∂ρ kˆ · ∇ρ + ∂θ kˆ · ∇θ + ∂∇θ kˆ · ∇∇θ. The arbitrariness of ∇ρ, ∇∇θ implies that ∂ρ kˆ = 0, ∂∇θ kˆ ∈ Skw so that k = k0 (θ) + (θ)∇θ,  ∈ Skw. Since no physically significant vector k0 and skew tensor  occur then we let k = 0. Hence we are left with the heat conduction inequality qˆ · ∇θ ≤ 0,

2

Materials with the negative bulk modulus (inverse compressibility) are quite rare: see, for instance, [264]. 3 See Sect. 3.4.

5.2 Thermoelastic Fluids

275

ˆ a requirement for the constitutive function q(ρ, θ, ∇θ).

5.2.1 Gibbs Relations and Entropy Equation As a consequence of the pressure relation (5.8) we can establish the Gibbs relations p ˙ ψ˙ = 2 ρ˙ − η θ, ρ

ε˙ =

p ρ˙ + θη˙ ρ2

(5.9)

and the Maxwell relation ∂θ p = −ρ2 ∂ρ η. Moreover, by ∂ρ ε = ∂ρ ψ + θ∂ρ η =

p θ − 2 ∂θ p 2 ρ ρ

we obtain the identity p = θ∂θ p + ρ2 ∂ρ ε for the pressure function p(ρ, θ). The partial derivative cv = ∂θ ε(ρ, θ) is called the specific heat at constant volume (constant density). By the Gibbs relation (5.9)2 and the balance of energy we have θη˙ +

p 1 p ρ˙ = 2 ρ˙ + r − ∇ · q, ρ2 ρ ρ

whence we have the entropy equation 1 θη˙ = r − ∇ · q. ρ

(5.10)

It is of interest to contrast the entropy equation (5.10) with the entropy inequality ρη˙ ≥

q ρr 1 1 ρr −∇ · = − ∇ · q + 2 q · ∇θ; θ θ θ θ θ

(5.11)

by (5.11) the time rate of η is greater than the rate given by (5.10) minus |q · ∇θ|/ρθ2 . By the Gibbs relation (5.9)2 and the constitutive relation for p we have θη˙ = ∂ρ ε ρ˙ + ∂θ ε θ˙ −

p θ ρ˙ = cv θ˙ − 2 ∂θ p ρ˙ ρ2 ρ

276

5 Fluids

and hence, in light of the entropy equation (5.10), ρcv θ˙ =

θ ∂θ p ρ˙ + ρr − ∇ · q. ρ

For uniform fields we can multiply by the volume V to obtain a relation for the whole body. If M is the mass then ρV = M,

ρ˙ V˙ =− . 2 ρ M

Hence letting S = M η and C V = M cv we have4 θ S˙ = C V θ˙ + θ ∂θ p V˙ .

5.2.2 Conjugate Variables and Maxwell’s Relations Let f (x, y) be a C 2 function on A ⊆ R2 . Assume that f is convex in x for all y and define z = ∂x f, so that z is the variable conjugate5 to x. We can view z as a function of x, parameterized by y. The convexity of f implies that ∂x z = ∂x2 f > 0. Hence we can define the inverse function x = x(z, ˆ y). Let g(z, y) := f (x(z, ˆ y), y) − z x(z, ˆ y). We find that ∂z g = ∂x f ∂z xˆ − x − z∂z xˆ = −x. Moreover ∂ y g = ∂x f ∂ y xˆ + ∂ y f − z ∂ y xˆ = ∂ y f. In summary, subject to ∂x2 f > 0, or ∂x2 f < 0, 4

Apart from the occurrence of derivatives instead of differentials, this is the classical first T d S equation of thermodynamics (see, e.g. [449], Sect. (13.3). 5 For historical reasons due to the use in analytical mechanics, the conjugate variable is usually denoted by p. Here the conjugate variable is not denoted by p to avoid possible ambiguities with the pressure.

5.2 Thermoelastic Fluids

277

z = ∂x f, g(z, y) = f (x(z, ˆ y), y) − z x(z, ˆ y)

=⇒

x = −∂z g, ∂ y g = ∂ y f. (5.12) The conjugate property z = ∂x f is related to the Legendre transform f ∗ (z) of f (x) defined by f ∗ (z) = sup (z x − f (x)). x∈R

The supremum occurs at x ∗ (z) such that the tangent to y = f (x) is parallel to the straight line y = z x. We now generalize the conjugate variable to the case where x ∈ Rn , y ∈ Rm and f (x, y) is a C 2 function on A ⊆ Rn × Rm . Moreover let   H = ∂x j ∂xk f ,

det H = 0.

Let z k = ∂xk f. The entries of the Jacobian, ∂x j z k = ∂x j ∂xk f, coincide with those of H . Hence det H = 0 allows us to write the inverse function, x = x(z, ˆ y), and to define g(z, y) := f (x(z, ˆ y), y) − z j xˆ j (z, y). Hence we have ∂zk g = ∂x j f ∂zk xˆ j − xk − z j ∂zk xˆ j = −xk , ∂ yi g = ∂x j f ∂ yi xˆ j + ∂ yi f − z j ∂ yi xˆ j = ∂ yi f. We now go back to the entropy and pressure relations, η(ρ, θ) = −∂θ ψ(ρ, θ),

p(ρ, θ) = ρ2 ∂ρ ψ(ρ, θ).

For formal convenience, and for ease of comparison with the physical literature, we use the specific volume υ = 1/ρ in place of the mass density ρ. Since ∂υ = −

1 ∂ρ = −ρ2 ∂ρ υ2

we can use the Helmholtz free energy in the form ψ(υ, θ) and write the entropy and pressure relations as η(υ, θ) = −∂θ ψ(υ, θ),

p(υ, θ) = −∂υ ψ(υ, θ).

278

5 Fluids

To use η and υ as the independent variables we assume that ∂θ η(υ, θ) = −∂θ2 ψ(υ, θ) = 0. Hence we let θ = θ(υ, η), z = ∂θ ψ, and consider the conjugate function ψ(υ, θ(υ, η)) − ∂θ ψ θ(υ, η) = ψ(υ, θ(υ, η)) + η θ(υ, θ(υ, η)) =: ε(υ, η). In view of (5.12), the identifications x = θ, z = −η, f = ψ, g = ε we have θ = ∂η ε(υ, η),

p = −∂υ ε(υ, η).

To use p and η as the independent variables we assume that ∂υ p(υ, η) = −∂υ2 ψ (υ, η) = 0. Hence we let υ = υ( p, η), z = ∂υ ε, and consider the conjugate function ε(υ( p, η), η) − ∂υ ε υ( p, η) = ε(υ( p, η), η) + p υ( p, η) =: h( p, η). The identifications x = υ, z = − p, f = ε, g = h provide υ = ∂ p h,

θ = ∂η h.

Finally, to use p and θ as the independent variables we assume that ∂η θ = ∂η2 h( p, η) = 0. Hence we let η = η( p, θ), z = ∂η h and find that the conjugate function is h( p, η( p, θ)) − ∂η h η( p, θ) = h( p, η( p, θ)) − θ η( p, θ) =: μ( p, θ). Hence we obtain η = −∂θ μ( p, θ),

v = ∂ p μ( p, θ).

The four functions so determined are the specific (per unit mass) Helmholtz free energy ψ, internal energy ε = ψ + θη, enthalpy h = ε + pυ, and Gibbs free energy μ = ε + pυ − θη = ψ + pυ. The function μ is also called the chemical potential. Associated with any of these four functions is a pair of natural independent variables so that the following set of relations holds. ⎧ ψ(υ, θ) : ψ˙ = −η θ˙ − p υ, ˙ ⎪ ⎪ ⎨ ε(υ, η) : ε˙ = θ η˙ − p υ, ˙ h( p, η) : h˙ = υ p˙ + θ η, ˙ ⎪ ⎪ ⎩ ˙ μ( p, θ) : μ˙ = υ p˙ − η θ,

∂θ p(υ, θ) = ∂υ η(υ, θ), ∂υ θ(υ, η) = −∂η p(υ, η), ∂η υ( p, η) = ∂ p θ( p, η), ∂ p η( p, θ) = −∂θ υ( p, θ).

(5.13)

The four equations on the right are referred to as Maxwell’s relations. They follow from the assumption that ψ, ε, h, μ are C 2 functions of the pertinent pair of variables. Hence, for instance, from η = −∂θ ψ(υ, θ), it follows that

p = −∂υ ψ(υ, θ),

5.2 Thermoelastic Fluids

279

∂υ η(υ, θ) = −∂υ ∂θ ψ(υ, θ) = −∂θ ∂υ ψ(υ, θ) = ∂θ p(υ, θ). In classical thermodynamics the systems are often regarded as homogeneous namely the fields υ, θ, p, η are assumed to be independent of the position. In such a case multiplication of (5.13) by the mass M of the body gives A˙ = −S θ˙ − p V˙ , U˙ = θ S˙ − p V˙ ,

˙ H˙ = V p˙ + θ S, ˙ G˙ = V p˙ − S θ,

(5.14) (5.15)

where A = M ψ, U = M ε, H = M h, G = M μ, S = M η, V = M υ. In the next section, further interesting consequences are shown to follow from the set of equations (5.13).

5.2.3 Specific Heats By the balance of energy we have ε˙ + p υ˙ = r − υ∇ · q. The right-hand side is the overall rate at which energy is supplied, per unit mass, by external sources, r , and by heat conduction, −υ∇ · q. By the second line in (5.13) we have ε˙ + p υ˙ = θη. ˙ Hence we define the specific heats, cυ and c p , at constant volume and at constant pressure, as c p = θ ∂θ η( p, θ). cυ = θ ∂θ η(υ, θ), As a consequence we find that cυ = ∂θ ε(υ, θ), and c p = ∂θ ε( p, θ) + p ∂θ υ( p, θ). By the third line in (5.13) it follows that c p = ∂θ h( p, θ). This motivates why sometimes the specific heats are defined in the form6 6

See, e.g. [316], Sect. 3.3.5.

280

5 Fluids

cυ = ∂θ ε(υ, θ), If η = η(υ, θ) then

c p = ∂θ h( p, θ).

˙ η˙ = ∂υ η υ˙ + ∂θ η θ.

From the first line in (5.13) we have ∂v η = ∂θ p and hence, upon multiplication by θ, we can write ˙ θ η˙ = cυ θ˙ + θ ∂θ p υ, which, if expressed by the differentials dη, dθ, dυ is just the local form of the socalled first θ d S equation of classical thermodynamics ([449], Sect. 13.3). An analogous relation holds if η is taken to be given by η = η(θ, p). It follows at once that

˙ θη˙ = θ ∂θ η θ˙ + θ ∂ p η p.

By the fourth line in (5.13) we can replace ∂ p η with −∂θ υ. Hence θη˙ = θ ∂θ η θ˙ − θ ∂θ υ p˙ whence

˙ θη˙ = c p θ˙ − θ ∂θ υ p,

that is the second θ d S equation of classical thermodynamics ([449], Sect. 13.4). Since c p = θ∂θ η(θ, p) it follows that the entropy can be determined in terms of c p (θ) namely θ 1 η(θ, p) = ∫ c p (ϑ)dϑ, θ∗ ϑ (θ∗ , p) being a reference state.

5.3 Ideal Gas An ideal gas is a thermoelastic, non-conducting fluid characterized by the pressuredensity relation Rθ ρ, p= M where M is the molecular weight and R is the gas constant (R = 8.314 107 erg/degrees); ideal gases may differ only by the molecular weight. Since p = ρ2 ∂ρ ψ(θ, ρ) we have

5.3 Ideal Gas

∂ρ ψ =

281

Rθ 1 , M ρ

ψ(θ, ρ) =

Rθ ρ ˜ ln + ψ(θ), M ρ0

η(θ, v) = −

ρ R ˜ ln − ∂θ ψ(θ), M ρ0

where ρ0 is a reference value of ρ. To determine the function ψ˜ we need a further condition. It is a well-known assumption in physics that the internal energy ε of an ideal gas is a function of the ˜ is constant. temperature only and it is a standard approximation that cv = ∂θ ε(θ) Indeed, assume ε(θ) = cv θ + ε0 . Consequently ψ − θ∂θ ψ = cv θ + ε0 yields

(5.16)

ψ˜ − θψ˜  = cv + ε0 .

By means of a integrating factor μ we have μψ˜  −

μ μ ˜ ψ = −cv μ − ε0 θ θ

It follows that μθ is constant, say μθ = k. Hence we have k ψ˜ θ ε0 k = −cv k ln + kν, + θ θ0 θ ν being a constant. Hence ψ(θ, v) = K θ ln

ρ θ − cv θ ln + νθ + ε0 , ρ0 θ0

K =

R , M

(5.17)

while the requirement (5.16) is satisfied identically for any value of ν. By (5.16) we obtain the entropy η in the form η = K ln

ρ θ − cv ln + ν − cv . ρ0 θ0

For practical convenience we may take ν such that η = 0 at the reference state (θ0 , ρ0 ). This is obtained by letting ν = cv . Consequently we have ψ(θ, ρ) = K θ ln

θ ρ + cv θ(1 − ln ) + ε0 , ρ0 θ0

η(θ, ρ) = K ln

Since p = K θρ and p0 = K θ0 ρ0 then ln

p θ ρ = ln + ln . p0 θ0 ρ0

ρ θ − cv ln . ρ0 θ0

282

5 Fluids

In terms of θ and p we then have η(θ, ˆ p) = (cv + K ) ln

θ p − K ln . θ0 p0

(5.18)

5.3.1 Specific Heats and Entropy Functions The quantities cv = θ∂θ η(θ, v),

c p = θ∂θ η(θ, ˆ p)

are the specific heats (per unit mass) at constant volume or at constant pressure. By means of the function v(θ, p) then c p and cv are related by ˆ p) = ∂θ η(θ, v(θ, p)) + ∂v η ∂θ v(θ, p) = cv + θ∂v η ∂θ v(θ, p); c p = ∂θ η(θ, since η(θ, v) = −∂θ ψ(θ, v) then cv = ∂θ ε(θ, v). Observe that ∂v η(θ, v) = −∂v ∂θ ψ(θ, v) = −∂θ ∂v ψ(θ, v) = ∂θ p(θ, v). Hence c p − cv = θ∂θ p ∂θ v. In ideal gases ∂θ p =

R , Mv

∂θ v =

R , Mp

and then c p − cv =

∂θ p∂θ v =

R . M

Alternatively we consider Eq. (5.18) and find ˆ p) = cv + K . c p = θ∂θ η(θ,

R , Mθ

5.4 Models of Real Gases

283

5.4 Models of Real Gases There are a number of models providing a better description of real gases. Despite the relative simplicity, the Van der Waals model holds fairly well with the liquid and vapor phases.

5.4.1 Van der Waals Model Let V be the molar volume. The Van der Waals equation is (p +

a )(V − b) = Rθ; V2

(5.19)

if a, b = 0 then Eq. (5.19) reduces to the equation of an ideal gas. By (5.19) we can derive the function p(V, θ) as p=

Rθ a − 2. V −b V

Consider the isotherms, namely the curves at constant θ. For the ideal gas isotherms are hyperbola in the V − p plane. Instead Eq. (5.19) shows a monotonic behaviour only at high temperatures. Look for horizontal inflection points of (5.19). We have 0 = ∂V p = −

Rθ 2a + 3, 2 (V − b) V

0 = ∂V2 p =

2Rθ 6a − 4. 3 (V − b) V

Solving these equations, along with (5.19), in the unknowns p, V, θ we find the unique solution a 8a , p= . V = 3b, θ= 27 b R 27 b2 Denote these values by Vc , θc , pc . Observe that pc Vc =

a 3 3b = Rθc . 2 27 b 8

Replacing a = 3 pc Vc2 and b = Vc /3 we can write (5.19) in the form (P +

8 3 1 )(V − ) = ϑ, V2 3 3

where P = p/ pc , V = V /Vc , ϑ = θ/θc . It is remarkable that this equation contains only numerical constants and then it is the same for all substances (modelled by the Van der Waals equation).

284

5 Fluids

Incidentally, the ideal gas equation does not allow any inflection point; the ratio Rθ/ pV is constantly equal to 1. Back to (5.19) we now look for thermodynamic potentials. Since V = Mv we let α = a/M 2 and β = b/M so that (5.19) can be written in the form p=

Kθ α − 2. v−β v

Since p = −∂v ψ(θ, v) then we find the Helmholtz free energy ψ in the form ψ = −K θ ln(v − β) −

α + f (θ). v

Consequently the entropy η = −∂θ ψ(θ, v) and the internal energy ε = ψ + θη read η = K ln(v − β) − f  (θ),

ε=−

α + f (θ) − θ f  (θ). v

(5.20)

To determine f we consider the specific heat, at constant volume, cv = ∂θ ε(θ, v). By (5.20) it follows that cv may depend only on temperature. Hence we have f (θ) − θ f  (θ) = Cv (θ) + w, where w is a constant and Cv is the integral of cv ; if cv is constant then Cv = cv θ. For formal convenience we let f −

1 1 w f = − Cv (θ) + . θ θ θ

The integration is performed in the classical way: we multiply by μ(θ) and let −μ/θ = μ to obtain μθ = constant, say c0 , and then (

c0 wc0 c0  f ) = − 2 Cv (θ) + 2 , θ θ θ

whence f (θ) = −F(θ) + cθ ˜ − w, It follows that

θ

F(θ) = θ ∫ θ0

1 Cv (ξ)dξ. ξ2

ˆ η(θ, v) = K ln(v − β) + F  (θ) + c.

If cv is constant then F(θ) = θcv ln θ, F  = cv ln θ + cv and then ¯ η(θ, v) = K ln(v − β) + cv ln θ + c. In adiabatic, or isentropic, transformations η is constant and then we have

5.4 Models of Real Gases

285

K ln(v − β) + cv ln θ = constant, whence (v − β) K θcv = constant and then [θcv (v − β) K ]1/cv = θ(v − β) K /cv = constant. For an ideal gas we would have θv K /cv = constant.

5.4.2 Peng-Robinson Model The Peng-Robinson model proves useful in modelling some liquids in addition to real gases. It is characterized by the constitutive equation p=

α(θ) Kθ − ; v−β v(v + γ) + γ(v − γ)

in the standard Peng-Robinson model γ = Mβ. In light of the identity  1 1  1 1 = √ − √ √ v(v + γ) + γ(v − γ) 2 2 γ v − ( 2 − 1)γ v + ( 2 + 1)γ we rewrite the constitutive equation as p=

 α(θ)  Kθ 1 1 − √ − . √ √ v−β 2 2γ v − ( 2 − 1)γ v + ( 2 + 1)γ

Since p = −∂v ψ we then obtain ψ in the form √ α(θ) v − ( 2 − 1)γ ψ = K θ ln(v − β) − √ ln + f (θ). √ 2 2γ v + ( 2 + 1)γ Consequently √ α (θ) v − ( 2 − 1)γ − f  (θ), η = −∂θ ψ = −K ln(v − β) + √ ln √ 2 2γ v + ( 2 + 1)γ √ θα − α v − ( 2 − 1)γ ε= √ ln + f (θ) − θ f  (θ). √ 2 2γ v + ( 2 + 1)γ The specific heat cv = ∂v ε(θ, v) is a further property to be examined to find the unknown parameters β, γ and the functions α, f . We find that

286

5 Fluids

√ θα (θ) v − ( 2 − 1)γ cv = √ ln − θ f  (θ). √ 2 2γ v + ( 2 + 1)γ Experimental data on cv may give estimates of the parameter γ and the functions α, f while β = γ/M. Further models of real gases There are a number of models (or equations of state) [449] that, also because of a greater number of parameters, can realize a better fit of the relation among p, v, θ. Usually the molar volume V is used instead of the specific volume v. The Dieterici model is expressed by a Rθ exp(− ); p= V −b V Rθ at relatively high temperatures the approximation exp(−a/V Rθ) 1 − a/V Rθ simplifies the formal usage. The Clausius model involves three parameters a, b, c in the form (p +

a )(V − b) = Rθ. θ(V + c)2

The Berthelot model is similar to the Van der Waals model with the change a/V 2 → a/θV 2 , namely a Rθ − . p= V − b θV 2 The Virial model is a source of many equations; it is based on a formal development in inverse powers of V applied to the model of the ideal gas,   b(θ) c(θ) d(θ) + 2 + 3 + ... . pV = Rθ 1 + V V V

5.5 Heat-Conducting, Viscous Fluids Fluids generally exhibit internal friction that opposes to the relative motion of fluid particles. Also they can support heat conduction. The relative motion is commonly described by the velocity gradient L and heat conduction is described by the temperature gradient ∇θ. We then describe the constitutive properties of a heat-conducting, viscous fluid by letting ψ, η, T, q be functions of ρ, θ, L, ∇θ, that is ˆ T = T(ρ, θ, L, ∇θ)

5.5 Heat-Conducting, Viscous Fluids

287

and the like for ψ, η, q. We first examine the consequences of objectivity. The invariance of ρ, θ, ψ, η and the objectivity of T, q, and ∇θ imply that ˆ θ, Q L QT + Q ˙ QT , Q ∇θ), ψ = ψ(ρ, ˆ ˙ QT , Q ∇θ), θ, Q L QT + Q Q T QT = T(ρ, ˙ QT , Q ∇θ), ˆ Q q = q(ρ, θ, Q L QT + Q for any rotation Q. For any arbitrary skew tensor  we can take Q such that, at time t, ˙ Q(t) = 1, Q(t) = ; ˙ + τ ) =  Q(t + τ ), Q(t) = 1. this is the case if we consider the solution to Q(t Hence the objectivity requires that ˆ θ, L + , ∇θ), T = T(ρ, ˆ ˆ ψ = ψ(ρ, θ, L + , ∇θ), q = q(ρ, θ, L + , ∇θ), for all ρ, θ, L, ∇θ, and  ∈ Skw. The arbitrariness of  allows us to select  = −W to obtain ˆ θ, D, ∇θ), T = T(ρ, ˆ ˆ ψ = ψ(ρ, θ, D, ∇θ), q = q(ρ, θ, D, ∇θ). Hence the constitutive relations for a heat-conducting, viscous fluid cannot include the spin as an independent variable. As to the dependence on the stretching D and the temperature gradient ∇θ, the ˆ T, ˆ qˆ are required to satisfy functions ψ, ˆ θ, Q D QT , Q ∇θ), ψ = ψ(ρ, ˆ θ, Q D QT , Q ∇θ), Q T QT = T(ρ,

(5.21)

ˆ Q q = q(ρ, θ, Q D QT , Q ∇θ), for any rotation Q. Pressure ˆ ˆ To understand the properties of T(ρ, θ, D, ∇θ) we observe that T(ρ, θ, 0, 0) represents the stress in the fluid at the (locally) uniform temperature and in the absence of flow. The objectivity condition ˆ ˆ θ, 0, 0) Q T(ρ, θ, 0, 0)QT = T(ρ,

288

5 Fluids

ˆ for any rotation Q implies that T(ρ, θ, 0, 0) is an isotropic tensor and, because elasticity occurs only in compression, we let ˆ T(ρ, θ, 0, 0) =: − p(ρ, ˆ θ)1,

(5.22)

so recovering the constitutive equation of thermoelastic fluids. We may then view p(ρ, ˆ θ) as the thermodynamic pressure; as we see in a moment via the pressure relation (5.23), p(ρ, ˆ θ) is determined by the constitutive function of a thermodynamic potential. Since p(ρ, ˆ θ) is independent of motion it is also named equilibrium pressure. The pressure in a fluid is sometimes defined as p¯ = − 13 tr T. This definition would regard p¯ as a function of D and ∇θ, in addition to ρ and θ. This pressure is then referred to as the total pressure or the mean pressure (in that it represents the mean normal stress). Although the expressions of pˆ and p¯ overlap for special classes of fluids, we hereafter proceed by denoting with p the thermodynamic pressure and let ˆ θ, D, ∇θ) + p(ρ, ˆ θ)1. Tˆ dis (ρ, θ, D, ∇θ) := T(ρ, We can view Tˆ dis (ρ, θ, D, ∇θ) as the dissipative part of the stress in that it is the stress to within the thermodynamic pressure tensor (which can be obtained from a potential and then is conservative). In particular, p(ρ, ¯ θ, D, g) = p(ρ, ˆ θ) − 13 tr Tdis (ρ, θ, D, ∇θ) and, by definition, Tˆ dis (ρ, θ, D, ∇θ) → 0,

p(ρ, ¯ θ, D, ∇θ) → p(ρ, ˆ θ) when D, ∇θ → 0.

Thermodynamic restrictions We now determine the thermodynamic restrictions on the constitutive functions ˆ η, ˆ and qˆ of ρ, θ, D, ∇θ. We compute ψ, ˆ Tˆ dis , p, ˙ ˙ + ∂∇θ ψ · ∇θ, ψ˙ = ∂ρ ψ ρ˙ + ∂θ ψ θ˙ + ∂D ψ · D ˆ θ, D, ∇θ). Upon substitution in the where it is understood that ψ stands for ψ(ρ, Clausius–Duhem inequality we obtain ˙ − ρ∂ ψ ρ˙ − p∇ · v + T · D − 1 q · ∇θ + θ∇ · k ≥ 0. ˙ − ρ∂∇θ ψ · ∇θ −ρ(∂θ ψ + η)θ˙ − ρ∂D ψ · D ρ dis θ

5.5 Heat-Conducting, Viscous Fluids

289

We regard the functions b(x, t) and r (x, t) as arbitrarily assignable. This allows us ˙ can take arbitrary ˙ ∇ · v, D, ˙ and ∇θ to say that, at a point x and time t, the functions θ, 7 scalar, symmetric tensor, and vector values. ˙ θ˙ implies that the inequality holds only if ˙ ∇θ, The arbitrariness of D, ∂D ψ = 0,

∂∇θ ψ = 0,

η(ρ, θ) = −∂θ ψ(ρ, θ).

The inequality then simplifies to 1 (− p + ρ2 ∂ρ ψ)∇ · v + Tdis · D − q · ∇θ + θ∇ · k ≥ 0. θ Since k is a possible function of ρ, θ, D, ∇θ then ∇ · k = ∂ρ k · ∇ρ + ∂θ k · ∇θ + ∂D k · ∇D + ∂∇θ k · ∇∇θ. The arbitrariness of ∇ρ, ∇D, ∇∇θ implies that ∂ρ k = 0, ∂D k = 0, ∂∇θ k ∈ Skw. Hence k = k0 (θ) + (θ)∇θ,  ∈ Skw. For isotropic materials this implies that k = 0. Since, by definition, p is independent of ∇θ then we can restrict attention to ∇θ = 0 to obtain (− p + ρ2 ∂ρ ψ)∇ · v + Tdis · D ≥ 0. Since ∇ · v = tr D, both terms involve the stretching D. To find the consequences of the arbitrariness of D we replace D with λD, and hence ∇ · v with λ∇ · v so that λ(− p(ρ, θ) + ρ2 ∂ρ ψ(ρ, θ))∇ · v + λTdis (ρ, θ, λD, 0) · D ≥ 0. The analysis at small stretching tensors is made formal by looking at small values of λ. We can write the inequality as λ(− p(ρ, θ) + ρ2 ∂ρ ψ(ρ, θ))∇ · v + o(λ) ≥ 0. Consequently, for suitably small values of λ we have sgn [λ(− p(ρ, θ) + ρ2 ∂ρ ψ(ρ, θ))∇ · v + o(λ)] = sgn λ[− p(ρ, θ) + ρ2 ∂ρ ψ(ρ, θ)]∇ · v.

The arbitrariness of λ implies that the inequality holds only if p(ρ, θ) = ρ2 ∂ρ ψ(ρ, θ). Hence we are left with the reduced entropy inequality

7

See Sect. 3.4.

(5.23)

290

5 Fluids

1 Tdis · D − q · ∇θ ≥ 0. θ A comment is in order about the thermodynamic restriction ∂∇θ ψ = 0. This is so because the entropy inequality contains a single term, ˙ ∂∇θ ψ · ∇θ, ˙ = ∇ θ˙ − L∇θ. By the balance of energy we have in ∇θ ˙ = .... + r, ∂θ ε θ˙ + ∂∇θ ε · ∇θ the dots indicating terms related to the constitutive model under consideration. If ˙ an arbitrary value and the balance of energy holds ∂∇θ ε = 0 then we can assign ∇θ thanks to the appropriate value of r . If, instead, ∂∇θ ε = 0 then by ∂θ ε ∇ θ˙ = .... + ∇r, ˙ can be assigned an arbitrary value and the balance of energy holds it follows that ∇θ ˙ implies thanks to the appropriate value of ∇r . In both cases the arbitrariness of ∇θ the requirement ∂∇θ ψ = 0. If, instead, k depends on θ˙ then −ρψ˙ + θ∇ · k = (−ρ∂∇θ ψ + θ∂θ˙ k) · ∇ θ˙ + ... whence −ρ∂∇θ ψ + θ∂θ˙ k = 0. The free energy ψ can depend on ∇θ provided only that the extra-entropy flux k ˙ depends on θ. Among the constitutive functions Tdis and q compatible with the (reduced) entropy inequality we consider the linear functions Tdis = 2μ(ρ, θ)D + λ(ρ, θ)tr D1, q = −κ(ρ, θ)∇θ, in D and ∇θ, which then are also objective. To establish the possible compatibility, look at the traceless (or deviatoric) part D0 of D, D0 = D − 13 (tr D)1, so that Tdis = 2μD0 + ζ(tr D)1,

ζ := λ + 23 μ.

5.5 Heat-Conducting, Viscous Fluids

291

Therefore we have 1 1 0 ≤ Tdis · D − q · ∇θ = 2μD0 · D0 + ζ(tr D)2 − κ∇θ · ∇θ. θ θ The independence of D0 , tr D, and ∇θ implies that the inequality holds if and only if μ ≥ 0, ζ ≥ 0, κ ≥ 0. The restrictions so determined are then necessary and sufficient for the validity of the second law subject to the assumption on p(ρ, θ) and Tdis , q. Some comments are in order about the model so derived. The coefficients μ and ζ are called coefficients of viscosity, notably μ is the shear viscosity and ζ is the second (or bulk or dilatational) viscosity. The independence of μ and ζ of the stretching D allows the model to be referred to as the Newtonian fluid. For rarefied gases it is customary to assume ζ = 0 (referred to as Stokes relation8 ). For liquids, the assumption ζ > μ is often made which means that λ > μ/3. The governing differential equations for the Newtonian fluid are ρ˙ + ρ∇ · v = 0, ρ˙v = −∇ p + ∇(ζtr D) + 2∇ · (μD0 ) + ρb, ρε˙ = 2μD0 · D0 + ζ(tr D)2 − ∇ · (κ∇θ) + ρr. In general ρ and θ, and hence μ and ζ, are not constant throughout the fluid. If, however, the viscosities may be regarded as constant then the equation of motion becomes ρ˙v = −∇ p + μv + (ζ + 13 μ)∇(∇ · v) + ρb, while ζ + μ/3 = λ + μ. This form of the equation of motion is called the Navier– Stokes equation. A classical vorticity transport equation follows within the approximation that the temperature is constant throughout the fluid and the viscosities are constant. In view of the pressure relation p = ρ2 ∂ρ ψ and of the assumption ∇θ = 0 we obtain ∇(ψ +

1 p p 1 ) = ∂ρ ψ ∇ρ − 2 ∇ρ + ∇ p = ∇ p. ρ ρ ρ ρ

Hence, letting b be conservative, b = ∇ϕ, we can write the Navier–Stokes relation in the form λ+μ μ p ∇ · v) + v. v˙ = −∇(ψ + − ϕ − ρ ρ ρ 8

See, e.g. [393], Sect. 62.

292

5 Fluids

Hence ∇ × v˙ is given by ∇ × v˙ = μ∇ ×

∇ × v 1 v = μ( − 2 ∇ρ × v). ρ ρ ρ

By (1.34)



∇ × v˙ =  ˙ − L + (∇ · v)) = , 

 being the convected rate, or the Truesdell rate, of the vorticity  = ∇ × v. Hence it follows that, in isothermal conditions,  satisfies the transport equation 

 = μ(

1  − 2 ∇ρ × v). ρ ρ

Viscosity affects the rate of change of the kinetic energy within a convecting spatial region Pt as well as within a fixed spatial region . Inner product of the equation of motion with v gives ρv · v˙ = v · ∇ · T + ρb · v. Since ρv · v˙ = ρ( 21 v2 )˙ = ∂t ( 21 ρv2 ) + ∇ · ( 21 ρv2 v) we can write ∂t ( 21 ρv2 ) = ∇ · (Tv − 21 ρv2 v) + ρ b · v − T · D. Integration over a fixed region  and use of the divergence theorem imply ∂t ∫ 21 ρv2 dv = ∫ (Tv − 21 ρv2 v) · n da + ∫(ρ b · v − T · D)dv. 

∂



If v = 0 at the boundary ∂ then the rate of change is given by two terms, the power of the body force and that of the stress. Now, −T · D = p∇ · v − 2μD0 · D0 − ζ(∇ · v)2 . The power p∇ · v is positive during expansion (∇ · v > 0) and negative during compression (∇ · v < 0). The power of the dissipative stress, instead, is negative in any motion as a consequence of the thermodynamic requirements μ, ζ ≥ 0. Consequently, the global kinetic energy decreases as a consequence of viscosity whereas, by effect of pressure, it increases in any expansion and decreases in any compression. If, instead, we look at a convecting region Pt , by ρ( 21 v2 )˙ = v · (∇ · T) + ρ b · v

5.5 Heat-Conducting, Viscous Fluids

293

integration over Pt allows us to write ∂t ∫

Pt

1 ρv2 2

dv = ∫ (Tv) · n da + ∫ (ρ b · v − T · D)dv. Pt

∂Pt

Again we find that viscosity results in a decay of the kinetic energy via the power T · D.

5.5.1 Stokesian Fluids To investigate the possible dependence of Tdis on the stretching D, we neglect the dependence of Tdis on ∇θ and exploit the isotropy condition (5.21) and (5.22), namely Q Tdis QT = Tˆ dis (ρ, θ, Q D QT ) ,

Tˆ dis (ρ, θ, 0) = 0.

By adding the requirement that Tˆ dis is a smooth function of D, this expression amounts to Stokes’ postulates of fluidity. By means of the invariants9 Stokes’ postulates lead to the representation formula10 Tdis = α1 + βD + γD2 where ˆ θ, I1 , I2 , I3 ), γ = γ(ρ, ˆ θ, I1 , I2 , I3 ), α = α(ρ, ˆ θ, I1 , I2 , I3 ), β = β(ρ, and α(ρ, ˆ θ, 0, 0, 0) = 0. A proof of this result is given also in [184] and is based on the Cayley–Hamilton theorem. Fluids whose constitutive stress function is given in the form (5.24) T = (− p + α)1 + βD + γD2 , where p = p(ρ, ˆ θ), are usually referred to as Stokesian fluids. Hereafter, for ease in writing, the quantity p − α (which is often but improperly called pressure) will be denoted by p˜ and called nominal pressure. Since p¯ = − 13 tr T then p¯ = p − 13 tr Tdis whence we get the difference between the thermodynamic (or equilibrium) pressure p and the mean pressure p, ¯ p − p¯ = α + 13 βI1 + 13 γI2 . 9

(5.25)

We observe that I1 , I2 , I3 , relative to the stretching D, are denoted by ID , I ID , I I ID in [393]. See, e.g. [393], p. 232, for a simple proof.

10

294

5 Fluids

For Stokesian fluids a sufficient condition for compatibility with the (reduced) entropy inequality is given by Tdis · D ≥ 0,

q · ∇θ ≤ 0,

which implies that the constitutive isotropic functions α, β and γ satisfy the inequality αI1 + βI2 + γI3 ≥ 0 for any admissible motion. In the linear case, α = λ(ρ, θ)I1 ,

β = 2μ(ρ, θ),

γ=0

and the fluid is referred to as Newtonian. In polynomial fluids the components of the Cauchy stress are polynomials of degree n ≥ 2 in the deformation components, namely α, β and γ are polynomials of degree n, n − 1 and n − 2, respectively, in the components of D and α = 0 if D = 0. Since I1 , I2 and I3 are homogeneous of degree 1, 2 and 3 in the components of D, respectively, then α, β and γ turn out to be polynomials of the scalar invariants of D. For instance, if n = 2 the Cauchy stress is taken in the form T = (− p + λI1 + λ1 I12 + λ2 I2 )1 + 2(μ + μ1 I1 )D + 4νD2 ,

(5.26)

where p, λ, λ1 , λ2 , μ, μ1 and ν are functions of ρ, θ only. The entropy inequality requires that λI12 + λ1 I13 + λ2 I1 I2 + 2μI2 + 2μ1 I1 I2 + 4νI3 ≥ 0. The total pressure p¯ in the fluid is then expressed as a quadratic function of the scalar invariants of D, p¯ = p − (λ + 23 μ)I1 − (λ1 + 23 μ1 )I12 − (λ2 + 43 ν)I2 If the fluid is incompressible, the constitutive relation of T simplifies to T = (− p¯ − 43 νI2 (D0 ))1 + 2μD0 + 4νD20 , where I2 (D0 ) = tr D20 . Since ρ is constant, p¯ is viewed as an independent variable while μ and ν are functions of θ only. Hence, the nominal pressure reduces to the total pressure p¯ provided ν = 0 and this is the case compatible with thermodynamics, as discussed in Case II of Sect. 5.5.3. Higher degree stress-stretching relations can be established in a similar way.

5.5 Heat-Conducting, Viscous Fluids

295

5.5.2 Boundary Conditions According to a classical assumption, which traces back to Stokes, a viscous fluid must adhere to a solid boundary , namely v(x, t) = u(x, t)

on 

where u is the velocity of the boundary. This is called no-slip condition and is experimentally verified when moderate pressures and low surface stresses are involved, but do not apply in all cases. In rarefied viscous gases, for instance, the rarefaction effects are modelled through the partial slip at the boundary rigid wall using Maxwell’s velocity slip.11 In general, a phenomenologically reasonable slip condition is vt − ut = s tt on  vn − u n = 0, where n is the boundary outward normal vector at x at , and vn = v · n,

tn = t · n,

vt = v − vn n,

tt = t − tn n.

The positive coefficient s is called skin friction. Assuming that it is not a constant but vanishes at low tangential stresses and high pressures, this boundary condition reduces to no-slip at the opposite extremes. Following [393], Sect. 64, we let s(tn , tt ) =

0 if |tt | ≤ ξ0 |tn |, s0 (1 − ξ0 |tn /tt |) otherwise.

(5.27)

where the friction coefficient ξ0 is a dimensionless parameter which describes an empirical property of the contacting materials, as in the modelling of dynamic dry friction.12 To fix ideas, we now apply the no-slip assumption as the boundary condition. Let  be a fixed (regular) domain, ∂ be its fixed boundary and n be the outward normal at x ∈ ∂. The no-slip condition reads v=0

on ∂.

(5.28)

As a consequence of Stokes’ theorem, the vorticity vector is tangential to the boundary, ·n =0 on ∂, and in addition 11

See, e.g. [253], Chap. 2 A unified slip boundary condition, set up in [411], is applicable to a wide range of fluid flows, from rarefied gases to non-Newtonian fluids.

12

296

5 Fluids

(n × ∇) × v = Dn − (∇ · v)n − 21  × n = 0

on ∂.

whence it follows Dn = (∇ · v)n + 21  × n,

D2 n = (∇ · v)2 n + 21 (∇ · v) × n + 21 D( × n)

and then Dn · n=(∇ · v)2 ,

D2 n · n=(∇ · v)2 + 21 ( × n) · Dn = (∇ · v)2 + 14 ( × n)2

By virtue of these relations we obtain the expression of the stress vector at the fixed boundary for Stokesian fluids t = Tn = (− p + α + βI1 + γI12 )n + 21 (β + 43 γI1 ) × n + 21 γD0 ( × n) and then tn = − p + α + βI1 + γI12 + 41 γ  2 .

(5.29)

This shows that, in general, both tangential and normal stresses at the boundary depend on the local vorticity. When γ = 0, the normal stress is due to pressure and expansion, only, whereas the total stress is unaffected by local vorticity only if β = γ = 0. Finally, comparing tn with the mean pressure on the boundary, we obtain tn = − p¯ + 23 βI1 + γ(I12 − 13 I2 + 41  2 ).

(5.30)

5.5.3 Incompressible Fluids Incompressibility means that only isochoric motions are possible, namely ∇ · v = 0, which implies that the mass density ρ is constant along streamlines. Incompressibility can be viewed as an internal linear constraint by prescribing a subspace M of Sym (the space of all symmetric tensors) to be interpreted13 as the collection of all possible stretching tensors D. In particular, since ∇ · v = 1 · D, we have M = {D ∈ Sym : 1 · D = 0}. Therefore, M is the subspace of all traceless (deviatoric) tensors, whereas M⊥ is the subspace of all isotropic (spherical) tensors. According to this approach, the current stress T in an incompressible material is determined by the deformation process 13

See, e.g. [42], Sect. 5.1, p. 167.

5.5 Heat-Conducting, Viscous Fluids

297

only up to an additive part, Tr , (called reactive) that does no work in any admissible (isochoric) motion, namely Tr · D0 = 0,

D0 ∈ M.

As a consequence, the reactive stress is spherical, Tr ∈ M⊥ , and there exists a timedependent scalar field φ such that Tr = φ1. Unlike the compressible case, φ is an arbitrary multiplier and cannot be determined by constitutive functions. We now investigate the relationship between φ and the pressure. To establish the uniqueness of the decomposition T = Tr + Ta ˆ a (ρ, θ, D, ∇θ) by the condition ([42], Eq. we can normalize the active stress Ta = T (5.5)) Ta · Tr = 0 which implies Ta ∈ M for any admissible (isochoric) deformation process. Accordingly, we have Tr = 13 (tr T) 1 = − p¯ 1, and hence we identify −φ with the (reactive) total pressure p, ¯ and then Tdis = Ta . In view of (5.24), we finally rewrite the stress decomposition for Stokesian incompressible fluids into the form14 T(ρ, θ, D0 ) = − p¯ 1 + β0 D0 + γ0 (D20 − 13 I2 1), where I2 is relative to D0 and p¯ is a parameter (Lagrangian multiplier) independent of the thermodynamic process, D0 is the traceless (or deviatoric) part of D and β0 = βˆ0 (ρ, θ, I2 , I3 ),

γ0 = γˆ 0 (ρ, θ, I2 , I3 ).

If we let ¯ ρ, θ, I2 , I3 ) = p¯ + 13 γ0 I2 p0 ( p, we obtain the well-known constitutive relation of Reiner–Rivlin fluids, T = − p0 1 + β0 D0 + γ0 D20 . 14

tr D20 = I2 > 0 for any D0 = 0.

(5.31)

298

5 Fluids

Aside from the different nature of the pressure, we can state the constitutive and dynamic properties of incompressible fluids as a particular case of compressible ones. Since ρ is constant, the free energy and the entropy are given by ψ = ψ(θ),

η(θ) = −∂θ ψ(θ).

Because of the different stress decomposition, the reduced entropy inequality holds in a modified form, 1 Ta · D0 − q · ∇θ ≥ 0. θ For Reiner–Rivlin fluids a sufficient condition is given by Ta · D0 ≥ 0,

q · ∇θ ≤ 0,

which implies β0 I2 + γ0 I3 ≥ 0.

(5.32)

Now, I3 = 3 det D0 . Moreover, I2 = |D0 |2 ≥ 0 and vanishes only if D0 = 0. In particular, we examine two simple cases. Case I. Letting γ0 = 0 and β0 = 2μ(θ), the linear functions Ta = 2μ(θ)D0 ,

q = −κ(θ)∇θ,

are compatible with the reduced entropy inequality provided μ ≥ 0, κ ≥ 0. This case is referred to as incompressible Newtonian fluid (see Sect. 5.6.5). Case II. Let γ0 = 4ν(θ) and β0 = 2μ(θ). Then, using a special deformation process it can be proved that this model is compatible with the reduced entropy inequality provided ν = 0. Indeed, by considering a shear deformation such that ⎡ 01 γ˙ ⎣ 10 D0 = 2 00

⎤ 0 0⎦ 0

˙ then μ ≥ 0. we have I2 = 21 γ˙ 2 and I3 = 0. From (5.32) it follows μ γ˙ 2 ≥ 0 for any γ, Now, we consider the following stretching tensors, ⎡

⎤ 2 0 0 D0 = γ˙ ⎣ 0 −1 0 ⎦ , 0 0 −1



⎤ −2 0 0 D0 = γ˙ ⎣ 0 1 0 ⎦ . 0 01

For both tensors, I2 = 6γ˙ 2 , but I3 = 6γ˙ 3 for the former and I3 = −6γ˙ 3 for the latter. Hence, from (5.32) it follows that μ and ν must satisfy for any γ˙ both of the following inequalities

5.5 Heat-Conducting, Viscous Fluids

299

μ + 2ν γ˙ ≥ 0,

μ − 2ν γ˙ ≥ 0

As a consequence, we obtain ν = 0,

μ ≥ 0,

and we conclude that the incompressible Newtonian fluid is the only Reiner–Rivlin model with β0 and γ0 independent of D which is compatible with the entropy inequality.

5.5.4 Oberbeck–Boussinesq Approximation The Oberbeck–Boussinesq approximation [60, 346] takes the fluid as incompressible in that the velocity field is assumed to be solenoidal, ∇ · v = 0, but allows the mass density to be a (linear) function of the temperature. This is somewhat contradictory because incompressibility would imply that θ and hence the mass density be constant along streamlines. The approximation is justified as follows. Assume the fluid is at equilibrium at the reference state with mass density ρ0 = ρ R and temperature θ0 . Hence ∇ p(ρ0 ) + ρ0 g = 0. Letting p = p0 + P, with p0 = p(ρ0 ), we can write the equation of motion in the form ρ˙v = −∇P + μv + (ρ − ρ0 )g. By linearization, ρ˙v ρ0 v˙ ,

ρ − ρ0 = −αρ0 (θ − θ0 )g

we can write the equation of motion in the form v˙ = −

1 ∇P + νv − α(θ − θ0 )g, ρ0

where ν = μ/ρ0 is the kinematic viscosity. The scheme is completed by letting q = −κ∇θ so that the balance of energy becomes θ˙ = κθ + h, where h = (ρr + T · D)/ρ c, c being the specific heat (per unit volume).

300

5 Fluids

Of course it is no more than an approximation to let the mass density be a function of temperature and hence disregarding the constancy of ρ along streamlines, required by the incompressibility assumption, ∇ · v = 0.

5.6 Newtonian Fluids Among the constitutive functions Tdis and q compatible with the (reduced) entropy inequality we consider the linear functions Tdis = 2μ(ρ, θ)D + λ(ρ, θ)(tr D)1,

q = −κ(ρ, θ)∇θ,

in D and ∇θ, which then are also objective. They are recovered as a special case from Stokesian fluids by letting γ = 0, β = 2μ and α = λI1 . To establish the thermodynamic compatibility we express Tdis in the form Tdis = 2μD0 + ζ(tr D)1,

ζ := λ + 23 μ,

where D0 is the traceless part of D. As a consequence we have 1 1 0 ≤ Tdis · D − q · ∇θ = 2μD0 · D0 + ζ(tr D)2 − κ∇θ · ∇θ. θ θ The independence of D0 , tr D, and ∇θ implies that the inequality holds if and only if μ ≥ 0, ζ ≥ 0, κ ≥ 0. The restrictions so determined are then necessary and sufficient for the validity of the second law subject to the assumption on p(ρ, θ) and Tdis , q. Some comments are in order about the model so derived. The coefficients μ and ζ are called coefficients of viscosity, notably μ is the shear viscosity and ζ is the second (or bulk or dilatational ) viscosity. The independence of μ and ζ of the stretching D allows the model to be referred to as the Newtonian fluid. In addition, from (5.25) the difference between pressure and mean pressure in a compressible Newtonian fluid is proportional to ζ, namely p − p¯ = ζI1 , and then vanishes for rarefied gases (where ζ = 0). The governing differential equations for the Newtonian fluid are ρ˙ + ρ∇ · v = 0, ρ˙v = −∇ p + ∇(ζtr D) + 2∇ · (μD0 ) + ρb, ρε˙ = 2μD0 · D0 + ζ(tr D)2 − ∇ · (κ∇θ) + ρr.

5.6 Newtonian Fluids

301

In general ρ and θ, and hence μ and ζ, are not constant throughout the fluid. If, however, the viscosities may be regarded as constant then the equation of motion can be written as ρ˙v = −∇ p + μv + (ζ + 13 μ)∇(∇ · v) + ρb, while ζ + μ/3 = λ + μ. This form of the equation of motion is called the Navier– Stokes equation. Assuming a no-slip boundary condition, by virtue of (5.29) the normal and tangential stresses at a fixed rigid wall takes the form tn = − p + (λ + 2μ)I1 ,

tt = μ  × n.

According to (5.27) and remembering that  is tangential (normal to n), for a given friction coefficient ξ0 and a given pressure p(ρ, θ) the no-slip condition is valid provided that the velocity field v on the boundary satisfies μ |∇ × v| ≤ ξ0 | p − (λ + 2μ)∇ · v|.

5.6.1 Models with Thermal Expansion and Pressure-Dependent Viscosities The specific volume is assumed to depend only on temperature. Hence we assume (see [374]) J = f (θ) whence ˙ ∇ · v = α(θ)θ,

α(θ) =

1 f  (θ). f (θ)

Thus the fluid sustains only isochoric motions in isothermal conditions. A thermodynamically consistent model is now established. Since ρ˙ = −ρ∇ · v then ρ˙ v˙ ˙ = − = −α(θ)θ. v ρ Hence α(θ) =

1  v (θ) v(θ)

is the coefficient of thermal expansion. Take the stress T in the form T = − p1 + T ,

T → 0 as D → 0,

302

5 Fluids

and hence p = −(tr T)/3 as D = 0. Let θ, p, D, ∇θ be the set of independent variables while ψ, η, T , q are given by constitutive functions under consideration. ˙ Computing ψ˙ and substituting we obtain the Clausius– Observe that p∇ · v = α p θ. Duhem inequality in the form ˙ + T · D − 1 q · ∇θ ≥ 0. ˙ − ρ∂∇θ ψ · ∇θ −ρ∂ p ψ p˙ − (ρ∂θ ψ + ρη + α p)θ˙ − ρ∂D ψ · D θ

˙ implies that ˙ and ∇θ The arbitrariness of p, ˙ D, ∂ p ψ = 0,

∂D ψ = 0,

∂∇θ ψ = 0.

Hence ψ = ψ(θ, p) and the arbitrariness of θ˙ implies that η = −∂θ ψ −

α p. ρ

The internal energy ε then takes the form ε = ψ(θ, p) − θ∂θ ψ(θ, p) −

θα(θ) p . ρ(θ)

Hence we are left with the reduced dissipation inequality 1 T · D − q · ∇θ ≥ 0. θ

(5.33)

As with T and q we can improve the standard Navier–Stokes–Fourier and assume T = 2μ(θ, p)D + λ(θ, p)(tr D)1,

q = −κ(θ, p)∇θ.

Inequality (5.33) holds with the classical conditions μ ≥ 0,

2μ + 3λ ≥ 0,

κ≥0

for any dependence on θ and p. Qualitative differences occur in the equation of motion and the balance of energy as a consequence of the dependences on θ, p. The equation of motion reads ρ˙v = −∇ p + μv + (μ + λ)∇(∇ · v) + 2D∇μ + (∇ · v)∇λ + ρb. Now observe that

  θα   p ∂θ ε = − θ∂θ2 ψ + ρ

5.6 Newtonian Fluids

303

can be viewed as the specific heat at constant pressure; denote it by c p . Hence the balance of energy can be expressed in the form ρc p θ˙ − αθ p˙ = κθ + ∇κ · ∇θ + 2μD0 · D0 + (

2μ + λ)(∇ · v)2 + ρr. 3

As a particular case, Newtonian incompressible fluids, with pressure-dependent viscosity, follow by letting ∇ · v = 0 and α = 0. We now examine compressible fluids with pressure-dependent viscosities.

5.6.2 Fluids with Pressure-Dependent Viscosities The fluid is described with the independent variables θ, p, D, ∇θ. Still we let T = − p1 + T ,

T → 0 as D → 0.

Consider the Clausius–Duhem inequality ˙ − p∇ · v + T · D − 1 q · ∇θ ≥ 0 −ρ(ψ˙ + η θ) θ and observe that v˙ p∇ · v = p , v

−ρψ˙ − p

v˙ ˙ − v p] = −ρ[ψ˙ + pv ˙ = −ρφ˙ + p, ˙ v

where φ = ψ + pv is the Gibbs free energy. Hence we write the inequality in the form ˙ + p˙ + T · D − 1 q · ∇θ ≥ 0. −ρ(φ˙ + η θ) θ Letting φ, η, ρ, T , q be given by functions of θ, p, D, ∇θ and computing φ˙ we find ˙ + T · D − 1 q · ∇θ ≥ 0. ˙ − ρ∂∇θ φ · ∇θ −ρ(∂θ φ + η)θ˙ − (ρ∂ p φ − 1) p˙ − ρ∂D ψ · D θ ˙ θ, ˙ p˙ implies ˙ ∇θ, The arbitrariness of D, ∂D φ = 0, ∂∇θ φ = 0, η = −∂θ φ, v = ∂ p φ. Letting T = 2μ(θ, p)D + λ(θ, p)(tr D)1,

q = −κ(θ, p)∇θ

304

5 Fluids

we find that the reduced dissipation inequality T · D − (1/θ)q · ∇θ ≥ 0 holds with the standard restrictions μ(θ, p) ≥ 0,

2μ(θ, p) + 3λ(θ, p) ≥ 0,

κ(θ, p) ≥ 0.

If the fluid is incompressible then it follows η = −∂θ ψ, ∂ p ψ = 0, μ(θ, p) ≥ 0, κ(θ, p) ≥ 0. Yet, instead of incompressibility the model of thermal expansion looks more realistic. Some models of temperature- and pressure-dependent viscosities Roughly, viscosity increases with decreasing temperature. Some examples are applied in the literature. Poiseuille (1840) proposed a formula for the dependence of μ on temperature by analogy with the mass-density dependence, μ(θ) =

μ0 . 1 + μ1 θ + μ2 θ2

This equation is also applied by replacing the absolute temperature θ with the Celsius temperature and redefining the values of the parameters μ0 , μ1 , μ2 . Based on the Arrhenius law for the chemical reaction rates, k = k0 exp(−E/Rθ), where R is the gas constant, De Guzman - Andrade law [12] has been considered in the form μ = μ0 exp(a/Rθ), a being a positive parameter. Viscosity increases with increasing pressure because the amount of free volume in the internal structure decreases due to compression. However, compared to the temperature influence, most viscous liquids are influenced very little by the applied pressure. For most liquids, a considerable change in pressure from 0.1 to 30 MPa causes about the same change in viscosity as a temperature change of about 1 K. Even for the enormous pressure difference of 0.1 to 200 MPa the viscosity increase for most low-molecular liquids amounts to a factor 3 to 7 only. The reason is that viscous liquids (other than gases) are almost non-compressible at low or medium pressures. A simple model of the dependence of (shear) viscosity on temperature, pressure and mass density is given by Andrade’s formula μ(θ, p, ρ) = Aρ1/2 exp[s( p + r 2 ρ)/θ]

5.6 Newtonian Fluids

305

where A, r , and s are constants. Letting μ∞ (ρ) = Aρ1/2 and a( p, ρ) = s( p + r 2 ρ) this formula looks like a generalization of the De Guzman-Andrade law. If the fluid is regarded as incompressible then ρ is merely a parameter. In compressible fluids, where a constitutive relation gives p as a function of ρ and θ, once, e.g. p is replaced with the pertinent function p(ρ, θ) then μ becomes a function of ρ and θ.

5.6.3 Vorticity and Enstrophy Transport Equation A classical vorticity transport equation follows for a compressible fluid within the approximation that the temperature is constant throughout the fluid and the viscosities are constant. In view of the pressure relation p = ρ2 ∂ρ ψ and the assumption ∇θ = 0 we obtain 1 p 1 p ∇(ψ + ) = ∂ρ ψ ∇ρ − 2 ∇ρ + ∇ p = ∇ p. ρ ρ ρ ρ Hence, letting b be conservative, b = ∇ϕ, we can write the Navier–Stokes relation in the form p λ+μ μ v˙ = −∇(ψ + − ϕ − ∇ · v) + v. ρ ρ ρ Consequently, ∇ × v˙ is given by ∇ × v˙ = μ∇ ×

∇ × v 1 v = μ( − 2 ∇ρ × v). ρ ρ ρ

Observe ∇ × v˙ = ∂t ∇ × v + ∇ × [(v · ∇)v]. By means of the vorticity vector  = ∇ × v and the identities (v · ∇)v = 2Wv + ∇ 21 v2 , 2Wv =  × v, it follows ∇ × [ × v] = ∇ × [(v · ∇)v]. By direct calculation we find ∇ × [ × v] = (∇ · v) + (v · ∇) − L. Thus ∇ × [(v · ∇)v] = (∇ · v) + (v · ∇) − L  and then

306

5 Fluids 

∇ × v˙ =  ˙ − L  + (∇ · v) = ,

(5.34)



 being the Truesdell rate of . Hence in isothermal conditions the vorticity  satisfies the transport equation    1  − 2 ∇ρ × v . = μ ρ ρ Restrict attention to incompressible fluids. Let ν = μ/ρ. Since v˙ = −∇ p + g + νv,

∇ × v˙ = ν

then the vorticity transport equation (5.34) reduces to  ˙ = L + ν. Enstrophy is viewed as the kinetic energy associated with vorticity. Let e be the enstrophy per unit mass, e = 21 ||2 . Hence e˙ =  · L + ν · . Consider the identities u · u = ∂xk (u · ∂xk u) − (∂xk u) · (∂xk u),

 21 u2 = ∂xk (u · ∂xk u),

and apply them to u = . Moreover observe that  · L =  · D. Hence we find e˙ =  · D − ν|∇|2 + νe, usually referred to as the enstrophy-transport equation.

5.6.4 Viscosity and Energy Decay Viscosity affects the rate of change of the kinetic energy within a convecting spatial region Pt as well as within a fixed spatial region . Inner product of the equation of motion with v gives ρv · v˙ = v · ∇ · T + ρb · v. Since ρv · v˙ = ρ( 21 v2 )˙ = ∂t ( 21 ρv2 ) + ∇ · ( 21 ρv2 v)

5.6 Newtonian Fluids

307

we can write ∂t ( 21 ρv2 ) = ∇ · (Tv − 21 ρv2 v) + ρ b · v − T · D. Integration over a fixed region  and use of the divergence theorem give ∂t ∫ 

1 ρv2 2

dv = ∫ (Tv − 21 ρv2 v) · n da + ∫ (ρ b · v − T · D)dv. ∂



If v = 0 at the boundary ∂ then the rate of change is given by two terms, the power of the body force and that of the stress. Now, −T · D = p∇ · v − 2μD0 · D0 − ζ(∇ · v)2 . The power p∇ · v is positive during expansion (∇ · v > 0) and negative during compression (∇ · v < 0). The power of the dissipative stress, instead, is negative in any motion as a consequence of the thermodynamic requirements μ, ζ ≥ 0. Hence the global kinetic energy decreases as a consequence of viscosity whereas, by effect of pressure, it increases in any expansion and decreases in any compression. If, instead, we look at a convecting region Pt , by ρ( 21 v2 )˙ = v · ∇ · T + ρ b · v integration over Pt allows us to write d ∫ 1 ρv2 dv = ∫ (Tv) · n da + ∫ (ρ b · v − T · D)dv. dt Pt 2 Pt ∂Pt Again we find that viscosity results in a decay of the kinetic energy.

5.6.5 Incompressible Newtonian Fluids According to Sect. 5.5.3, the stress constitutive function for an incompressible Newtonian fluid is given by T = − p¯ 1 + 2μ(θ)D0 , and then nominal and total pressures are the same. The kinetic energy balance holds for any convecting region Pt as with compressible fluids except for the vanishing of the power ζ(∇ · v)2 and we have d ∫ 1 ρv2 dv = ∫ (Tv) · n da + ∫ (ρ b · v − T · D)dv, dt Pt 2 Pt ∂Pt Let μ be constant. Since

T · D = 2μD0 · D0 .

308

5 Fluids

2∇ · D0 = v, the Navier–Stokes equation of motion takes the form ρ˙v = −∇ p¯ + μ v + ρ b. The velocity v is also subject to ∇ · v = 0. These are the incompressible Navier–Stokes equations in the unknowns v, p. ¯ Let the body force be conservative, b = ∇, so that  p¯  1 ∇ p¯ − b = ∇ − . ρ ρ The equation of motion can be written as v˙ = −∇

 p¯ ρ

 −  + νv,

where ν = μ/ρ. The quantity ν is called kinematic viscosity and it is natural to regard ν as a diffusivity for the velocity v in that the equation for v has formally the structure of the diffusion equation for v. This feature holds also for the vorticity  = ∇ × v. In view of (5.34) and the incompressibility condition ∇ · v = 0 we obtain  ˙ − L  = ν or  ˙ − D0  = ν since L = W + D0 and W = 0. Under no-slip boundary conditions, by virtue of (5.30) the normal stress of an incompressible Newtonian fluid at a fixed rigid wall equals the opposite of the mean pressure, ¯ tn = − p. Then, by virtue of (5.27), for a given friction coefficient ξ0 and a given pressure p¯ the no-slip condition v = 0 is valid provided that the velocity field v on the boundary satisfies ξ0 ¯ |∇ × v| ≤ | p|, μ that is to say, provided that the boundary vorticity ω is low if compared to the pressure.

5.7 Generalized Newtonian Fluids

309

5.7 Generalized Newtonian Fluids A Newtonian fluid is a fluid in which the (Cauchy) viscous stress, arising from its flow, is linearly proportional to the stretching D. Owing to linearity, Newtonian fluids are the simplest mathematical models of fluids that account for viscosity. While water and air can be regarded as Newtonian fluids, more involved, non-Newtonian models are in order for other fluids. Generalized Newtonian fluid models are widely adopted in the literature and account for a variety of behaviours. In the shear-thickening fluid or dilatant fluid, the viscosity appears to increase when the shear-rate increases. Corn starch dissolved in water is a common example15 : when stirred slowly it looks milky, when stirred vigorously it feels like a very viscous liquid. In the shear-thinning fluid the viscosity appears to decrease when the shear-rate increases. A familiar example in this sense is wall paint. Another example of a shear-thinning fluid is blood. This application is highly favoured within the body, as it allows the viscosity of blood to decrease with increased shear strain rate. A constitutive relation between stress and stretching for generalized Newtonian fluids is obtained from (5.24) by letting16 γ = 0,

α = α(ρ, ˜ θ, I1 , I2 ),

˜ θ, I1 , I2 ). β = β(ρ,

Neglecting the dependence of p, α˜ and β˜ on ρ and θ, such fluids are characterized by the relation ˜ 1 , I2 )D, (5.35) T = [− p + α(I ˜ 1 , I2 )]1 + β(I and are then referred to as generalized Newtonian fluids. The relationship between stress and stretching for incompressible generalized Newtonian fluids takes the form ¯ θ, I2 )D0 , T = − p¯ 1 + β˜0 ( p,

(5.36)

where the viscosity function β˜0 is expected to be non-negative valued and possibly depends on the mean pressure (the independent variable that replaces ρ). By properly choosing the expression of this constitutive function special rheological models for incompressible fluids have been proposed (see, e.g. [18, 318]). Power-law model. Relative to equation (5.36), β0 is taken in the form σ−1 2

β0 (θ, I2 ) = 2μ(θ) I2

,

(5.37)

where μ ≥ 0, as required by the entropy inequality. The real parameter σ > 0 is the power-law index of the model. The constitutive relation of a Newtonian fluid is 15 16

This non-Newtonian fluid is called oobleck. This form may be reasonable for real Stokesian fluids; see e.g. [18].

310

5 Fluids

recovered when σ = 1. When σ > 1 the model describes a shear-thickening (also termed dilatant) fluid in which viscosity increases with the rate of shear strain, whereas the constitutive equation is shear-thinning when 0 < σ < 1. It is a drawback of this model that it does not match experimental results merely because the viscosity function β0 is unbounded and vanishes as D0 → 0. To avoid this drawback some models have been established subject to the limit conditions lim β0 = 2μ0 ,

I2 →0

Equations

lim β0 = 2μ∞ .

I2 →∞

1 β0 = μ∞ + (μ0 − μ∞ ) , 2 1 + [κ2 I2 ](1−σ)/2 β0 a/2 = μ∞ + (μ0 − μ∞ ) [1 + κa I2 ](σ−1)/a , a > 0, 2 √ β0 sinh−1 (κ I2 ) = μ∞ + (μ0 − μ∞ ) , √ 2 κ I2

are referred to as Cross model, Carreau–Yasuda model, Powell-Eyring model. In all of these models μ0 , μ∞ and κ are constitutive functions of θ. In addition, κ is a time and hence κ2 I2 is a pure number. Indeed, κ is viewed as a relaxation time; it characterizes a threshold relative to D0 . At low values of the stretching, I2  1/κ2 , Cross fluids behave as Newtonian fluids with β0 = 2μ0 , at large values of the stretching, I2 1/κ2 , the behaviour is that of a power-law fluid. It is worth considering a successful class of nonlinear models that reproduce different viscous responses of the fluid through some properly defined shear-rate critical value. This quantity is somewhat related to the notion of yield stress that is peculiar to viscoplastic fluids (see Sect. 5.8.1). Casson model. The viscosity function β0 is taken in the form

√ 2μ∞ + τ y / I2 if I2 > I2c , β0 = otherwise, 2μ0 where τ y is the yield stress of the fluid and I2c is the critical value of I2 . They are related by τ y2 I2c = , (μ0 − μ∞ )2 where μ0 is the (Newtonian) viscosity at low stretching; the parameter μ∞ is called the plastic dynamic viscosity. If τ y = 0 and μ0 = μ∞ the model reduces to the Newtonian fluid. A Casson fluid exhibits a yield stress τ y > 0. If a shear stress, smaller than the yield stress, is applied then the fluid behaves like a Newtonian fluid, whereas if a

5.8 Viscoplastic and Viscoelastic Fluids

311

shear stress is greater than the yield stress the fluid moves like a power-law fluid with index n = 0. These properties make the Casson fluid a realistic model for the blood circulation in narrow arteries and chocolate flow.17 Herschel–Bulkley model. It is a model of viscosity where both the yield stress and the power-law dependence occur. The function β0 is given by ⎧ ⎨

σ−1 2

√ + τ y / I2 if I2 > I2c , β0 = σ−1  ⎩ 2μ∞ I2c 2 + τ y / I2c if I2 < I2c , 2μ∞ I2

where τ y is the yield stress of the fluid and I2c is the shear-rate critical value, as in the Casson model. For large values of the yield stress the fluid will significantly flow in response to a large applied force. If σ < 1 the fluid is shear-thinning, whereas if σ > 1 the fluid is shear-thickening. If σ = 1 the model reduces to the Casson model.

5.8 Viscoplastic and Viscoelastic Fluids The vast majority of non-Newtonian fluid models are concerned with simple models (like power-law, Carreau–Yasuda and Casson) which are special versions of the generalized Newtonian model. Nevertheless, a lot of composite and biological materials (e.g. polymeric solutions, melts, muds, condensed milk, glues, printing ink, emulsions, soaps, chocolate, paints, shampoos, toothpaste and tomato paste) show properties of fluid flows that don’t match with Stokes’ assumptions. In particular, Stokesian fluid models are unable to predict either memory or other elastic/plastic effects exhibited by these substances. The governing equations expected for nonStokesian fluids are then much more complicated if compared to those of Newtonian fluids. Due to complexity, no single constitutive equation exhibiting all properties of such fluids is available. So, several models of non-Stokesian fluids are exhibited in the literature through the differential, integral, and rate-type categories. Roughly, we may divide them into two main classes: viscoplastic and viscoelastic fluids.

5.8.1 Viscoplastic Fluids The viscoplastic behaviour involves a threshold value of the stress, termed yield stress. When the applied stress remains below this value, the fluid doesn’t flow, but it is allowed to move as a rigid solid. When the applied stress increases beyond a finite threshold, termed yield stress, then the fluid exhibits a viscous (possibly nonlinear) stress–strain relationship so that it begins to flow. 17

Remarkable applications of this model can be found in [181].

312

5 Fluids

The viscoplastic behaviour is usually not observed in pure materials, but may occur in concentrated suspensions of particles. First, Bingham [43] observed this type of behaviour in paints, but it also applies to drilling mud, toothpaste, margarine, mustard and clay suspensions. Bingham model. It is the most celebrated viscoplastic incompressible fluid model. A Bingham fluid exhibits a yield stress τ y , as the Casson model, but its dynamical features look quite different. Indeed, if a shear stress less than the yield stress is applied to the fluid, it behaves like a (rigid) solid, whereas if a shear stress greater than the yield stress is applied, it starts to move like a Newtonian fluid. The constitutive assumption is then expressed as a relation where the strain rate is given as a function of the stress. Instead of the usual relations that hold for generalized (incompressible) Newtonian fluids, tr T = −3 p, ¯ T0 = β˜0 (θ, I2 (D0 ))D0 in a Bingham fluid the latter is replaced by D0 = δ˜0 (θ, I2 (T0 ))T0

(5.38)

˜ where √ δ0 is expected to be a non-negative valued function. In particular, letting τ = I2 (T0 ) and τ y its yielding value, we may choose ⎧ ⎨ 1 (1 − τ /τ ) if τ > τ , y y δ0 = 2μ∞ ⎩ 0 otherwise where the viscosity μ∞ is a constitutive functions of θ and represents the limiting viscosity of the fluid when τ τ y . As is apparent, the constitutive relation cannot be reversed for all values of τ . Nevertheless, if τ > τ y from (5.38) we infer  τ = τ y + 2μ∞ I2 (D0 ) and then we may write  β0 (θ, I2 (D0 )) = 2μ∞ + τ y / I2 (D0 ) which looks like the Casson model for high strain rate values. The main difference occurs when τ < τ y : according to Bingham the fluid moves as a rigid solid, whereas according to Casson the fluid merely exhibits a very high viscosity. Geometry has a relevant role in the Bingham fluid dynamics. Indeed, in simple shear, the shear stress is constant throughout the flow field, but in more complex flows, such as flow in a straight pipe, the shear stress is not constant and may be regions where the fluid is flowing (since the yield criterion is reached) while in other regions the shear stress lies below the yielding limit and the fluid moves as a rigid body (D = 0).

5.8 Viscoplastic and Viscoelastic Fluids

313

Generalized Bingham model. Generalizations of the Bingham model are obtained by choosing δ˜0 in (5.38) in the form ⎧ ⎨ 1 (1 − τ /τ )1/σ if τ > τ , y y δ 0 = κ∞ τ ⎩0 otherwise where κ∞ is a constitutive functions of θ. Although we cannot reverse this relation, we may proceed as before: if τ > τ y from (5.38) we infer σ

τ = τ y + 2μ∞ [I2 (D0 )] 2 where μ∞ = κσ∞ /2, and then we may write β0 (θ, I2 (D0 )) = 2μ∞ [I2 (D0 )]

σ−1 2

 + τ y / I2 (D0 )

which looks like the Herschel–Bulkley model for high strain rate values. For σ < 1 the fluid is shear-thinning, whereas for σ > 1 the fluid is shear-thickening. As σ = 1 we get the Bingham model. Pseudo-plastic model. In a lot of real biological materials as ketchup and mayonnaise the viscoplastic behaviour is quite similar, but the existence of a true yield stress has no experimental evidence.18 For this reason and for ease in computing it is preferable to adopt a modified Bingham model (also termed pseudo-plastic) with index σ, tr T = −3 p, ¯

√  τ y [1 − exp(−σ I2 (D0 ))]  D0 , T0 = 2μ0 + √ I2 (D0 )

where, the yield-stress parameter τ y has a conventional meaning, as in Casson and Herschel–Bulkley models. Indeed, the modified Bingham model turns out to be a special case of the generalized Newtonian model for incompressible fluids. Taking into account that this modified stress-stretching relation leads to     τ = 2μ0 I2 (D0 ) + τ y 1 − exp(−σ I2 (D0 )) , we may conclude that it recovers the Newtonian model when σ = 0 and approaches the Bingham model as σ → ∞. Viscoelastic fluids: differential and rate-type models Viscoelastic fluids are a common form of non-Stokesian fluid. They can exhibit a response that resembles that of an elastic solid under some circumstances, or the 18

See, e.g. [30].

314

5 Fluids

response of a viscous liquid under other circumstances. Typically, fluids that exhibit this behaviour are macromolecular in nature, such as polymeric fluids (melts and solutions) used to make plastic articles, food systems such as dough used to make bread and pasta, and biological fluids such as synovial fluids found in joints. In the case of polymeric fluids, the nature of polymeric molecules has a direct influence on the processing behaviour and in some cases on the performance of the polymer in a given application. In order to describe a so complex behaviour, the stress–strain constitutive relation of a viscoelastic fluid involves memory effects: the actual value of the stress accounts for not only the actual value of the strain rate but also its past values. In the literature three different approaches have been devised in this concern. First, the stress at some instant may depend not only on the strain rate but also on its time derivatives at the same instant. This differential approach is suitable in modelling fluids with short memory, for instance fluids of second (or higher) grade, and must involve objective time derivatives. Alternately, the hereditary character of the fluid may be implicitly established through a rate-type relation (either linear or nonlinear) between the actual values of the stress, the strain rate and their time derivatives. Relations of this type have a somewhat complex structure whose compatibility with thermodynamics and the principle of frame-indifference is not obvious. For this reason, rate-type models will be scrutinized in a separate section. Finally, assuming that the actual value of the stress depends on the whole past deformation history. This is the so-called integral (or long-memory) approach and traces back to Maxwell, Boltzmann and Volterra. Models of this type are mainly linear and may be applied to both solid and fluids. Their constitutive structure falls outside this chapter, so their analysis will be carried out within the second part of this book concerning non-simple materials.

5.9 Models of Turbulence Turbulence denotes a fluid motion characterized by chaotic changes in pressure and flow velocity. It is viewed in contrast to a laminar flow, which occurs when a fluid flows in parallel layers, with no disruption between those layers. With this view it is standard to describe fluids where the velocity and the pressure are decomposed into ensemble means and fluctuating parts. Also, there is a view that turbulence is caused by excessive kinetic energy in parts of a fluid flow, which overcomes the damping effect of the fluid viscosity; that is why the so-called k-equation is applied in the modelling of turbulence, k being a turbulent kinetic energy.

5.9.1 Incompressible Fluids For simplicity consider an incompressible viscous fluid; the velocity field v is subject to

5.9 Models of Turbulence

∇ · v = 0,

315

ρ∂t v + ρ(v · ∇)v = −∇ p + 2μ∇ · D.

The fields v, p are (assumed to be) random functions of space and time. Let v = V + u,

p = P + π,

where u and π are the fluctuating parts. It is assumed that u = 0, ∂t u = 0, ∇u = 0,

π = 0, ∂t π = 0, ∇π = 0.

the superposed bar denoting the time average (over a suitably large period of time). Accordingly ∇ · u = 0 and 0 = ∇ · v = ∇ · (V + u), 0 = ∇ · V + ∇ · u, =⇒

∇ · V = 0, ∇ · u = 0.

Moreover, (v · ∇)v = ∇ · (v ⊗ v), (v · ∇)v = ∇ · v ⊗ v = (V · ∇)V + ∇ · u ⊗ u. Substituting v and p in the equation of motion and taking the time average we obtain D + ρτ ) ρ∂t V + ρ(V · ∇)V = −∇ P + ∇ · (2μD where D = sym∇V is the mean stretching tensor and τ = −u ⊗ u. Hence we can say that ∇ · τ is the sole contribution of the turbulent fluctuations to the mean motion, with velocity V, and this distinguishes a turbulent flow from a laminar counterpart. Accordingly ρτ is an additional stress due to the turbulent fluctuations; τ is usually referred to as Reynolds stress. The decomposition of the instantaneous properties into mean and fluctuating parts has resulted in the occurrence of a new unknown, the Reynolds stress, without no additional equations. The system of equations is not closed. To close it we need additional equations; there are various approaches, leading to different turbulence models, that result in the closure of the system. In so doing often recourse is made to the turbulence kinetic energy defined by k=

1 2

u · u = − 21 tr τ .

Boussinesq model Boussinesq closure identifies the Reynolds stress as

316

5 Fluids

τ = 2νT D − 23 k1, where νT is the kinetic eddy viscosity. It follows that tr D = 0, which is consistent in that tr D = ∇ · V = 0. To make this assumption operative a balance equation for k is needed. Spalart–Allmaras model In this model k is not involved and the Reynolds stress is defined τ = 2νT D though now νT is defined in terms of quantities subject to appropriate transport equations. The k − ω Model The kinetic eddy viscosity νT is defined as the ratio νT =

k , ω

with k, ω satisfying the evolution (or transport) equations ∂t k + V · ∇k = τ · L − β ∗ kω + ∇ · [(ν + 21 νT )∇k], ω ∂t ω + V · ∇ω = α τ · L − βω 2 + ∇ · [(ν + 21 νT )∇ω], k with properly defined parameters α, β, β ∗ , ν, νT . It is worth observing that both equations can be viewed as diffusion equations, say k˙ = −∇ · h,

h = −γ∇k,

supplemented by appropriate terms in τ .

5.9.2 Compressible Fluids We let p = −tr T/3, T = T0 − p1 and write the balance equations in the Eulerian form

5.9 Models of Turbulence

317

∂t ρ + ∇ · (ρv) = 0, ∂t (ρv) + ∇ · (ρv ⊗ v) = −∇ p + ∇T0 , ∂t [ρ(ε + 21 v2 )] + ∇ · [ρv(h + 21 v2 )] = ∇ · (T0 v) − ∇ · q + ρr, where h = ε + p/ρ is the enthalpy density (see (2.45)). The space-time dependence of ρ motivates a time averaging that traces back to Favre [167]. For any density, per unit mass, ϕ we define ϕ˜ :=

ρϕ , ρ

ϕ1 := ϕ − ϕ. ˜

It follows that ϕ − ϕ = 0 whereas ϕ1 = ϕ − ϕ˜ = 0. We then review the balance ˜ In this connection observe equations in terms of the means ϕ, ϕ. ρϕ = ρϕ + ρϕ − ρϕ = ρϕ˜ + ρϕ − ρϕ, whence ρϕψ = ρ(ϕ˜ + ϕ1 )(ϑ˜ + ϑ1 ),

ρϕ = ρϕ, ˜

ρϕ1 = ρϕ − ρϕ˜ = ρϕ˜ − ρϕ˜ = 0.

Consequently ρϕϑ = ρ(ϕ˜ + ϕ1 )(ϑ˜ + ϑ1 ),

ρϕϑ = ρϕ˜ ϑ˜ + ρϕ1 ϑ1 .

The structure of the balance equations suggests that we let ρ = ρ + , v = v˜ + v1 ,

p = p + π, ε = ε˜ + ε1 ,

q = q + ζ, h = h˜ + h 1 ,

T0 = T0 + T , r = r˜ + r1 ,

often referred to as Reynolds decomposition and Favre decomposition, respectively. Substitution in the balance equations and evaluation of the average result in ∂t ρ + ∇ · (ρ˜v) = 0,

(5.39)

∂t (ρ˜v) + ∇ · (ρ˜vv˜ ) = −∇ p + ∇ · T0 − ∇ · (ρv1 ⊗ v1 ),

(5.40)

∂t [ρ(ε˜ + 21 v˜ · v˜ ) + 21 ρv1 · v1 ] + ∇ · [ρ˜v(h˜ + 21 v˜ · v˜ ) + 21 ρv1 · v1 v˜ ] = ∇ · [T0 − ρv1 ⊗ v1 v1 − q − ρh 1 v1 + T0 v1 − 21 ρ(v1 · v1 )v1 ] + ρ˜r .

318

5 Fluids

Two kinetic energies occur, namely the averaged kinetic energy (per unit mass) and the turbulent kinetic energy 1 ρ v˜ 2

· v˜ ,

1 ρ v1 2

· v1 .

Also by analogy with the case of incompressible fluids, the kinetic energy of turbulent character is defined to be k = 21 v 1 · v1 so that 1 ρ v1 2

· v1 = ρk.

If attention is restricted to (5.39) and (5.40) then only the quantity ρ v1 · v1 occurs of turbulent character. Still recourse is made to Reynolds stress τ by letting ˜ − 1 ∇ · v˜ 1) − 2 ρ k 1, ρτ = −ρ v1 · v1 = 2μT (D 3 3 ˜ = sym∇ v˜ . As a consequence, the effective stress in (5.40) is T0 + ρτ only where D and τ involves k of turbulent character. Turbulent models are then completed by adding an equation for k.

Part III

Non-simple Materials

Chapter 6

Rate-Type Models

Spatial interaction and memory effects are among the possible features of non-simple materials. As to memory effects in fluids, the inadequacy of the classical NavierStokes theory to describe rheological complex fluids such as geological materials, liquid foams, and polymeric fluids has led to the development of several theories of non-Newtonian fluids; Chap. 7 describes memory effects via functionals on appropriate histories. To simplify the models, a natural idea is to reduce the knowledge from that of a whole history to that of a set of time derivatives. In this sense many materials models are given in the literature to describe the non-Newtonian behaviour. Here, among others, the elastic-plastic and the Kelvin-Voigt solids and the Bingham, the Maxwell-Wiechert, and the Jeffreys fluids are shown to derive through rheological models. A material of grade n denotes a model where the constitutive functions depend on time derivatives of appropriate variables up to order n. Among such models of fluids, second grade fluids are most often investigated, which means that the pertinent variables are the velocity (gradient) and the derivative of the velocity (gradient), the Newtonian fluid being a fluid of grade 1. Especially in connection with fluids, the more generic expression fluids of differential type is used to denote fluids of higherorder grade. The restrictions placed by the objectivity principle and the second law of thermodynamics are investigated both in the Eulerian and in the Lagrangian description. In particular, the Oldroyd-B fluid and the White-Metzner fluid are examined in detail. Temperature-rate dependent thermoelastic materials are modelled and the effects on discontinuity waves are investigated. Likewise, spatial interaction might be described by functionals on the fields of appropriate physical variables. Again for operative convenience, the models are often considered by functions of space derivatives up to an order n of appropriate variables (often the displacement).

© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_6

321

322

6 Rate-Type Models

6.1 Rheological Models The modelling of material behaviour is often established by combining elastic, viscous, and plastic properties. Now, springs, dashpots, and friction dampers are familiar mechanical devices which exhibit elastic, viscous, and plastic response, respectively. It is then natural that constitutive equations describing the material behaviour are often produced by relating stress and strain as are force and displacement in appropriate combinations of springs, dashpots, and friction sliders. The combinations of these elements are usually denoted as rheological models. They are especially convenient in the description of substances that have a complex microstructure (polymers, biological materials, soft matter). The adjective rheological is motivated by the fact that rheology is the study of the flow of matter, primarily in a liquid state, but also as solids under conditions in which they respond in non-elastic way. Incidentally, the term rheology was coined by E.C. Bingham. The term was inspired by the aphorism of Simplicius (often attributed to Heraclitus), pánta rheî, “everything flows”, and was first used to describe the flow of liquids and the deformation of solids. There are three basic rheological (one-dimensional) models: the linear elastic spring, the linear viscous damper, and the friction slider (or St. Venant element). It is customary to denote1 the stress by σ and the strain by ε. The linear elastic spring is governed by Hooke’s law; stress is directly proportional to strain so that, borrowing by the proportionality between force and displacement, we write σ (s) = k ε(s) ,

(6.1)

k > 0 being Young’s modulus, the analogue of the spring elastic constant. For the dashpot, (6.2) σ (d) = η ∂t ε(d) , η > 0 being the viscosity coefficient. For the friction slider ⎧ (f) ⎪ ⎨= 0 if |σ | < σ y , ∂t ε( f ) > 0 if σ ( f ) = σ y , ⎪ ⎩ < 0 if σ ( f ) = −σ y ,

(6.3)

σ y being the sliding limit (or yield stress). This device is also called rigid-perfectly plastic model. The superscripts (s), (d) and ( f ) denote quantities (stress or strain) pertaining to springs, dashpots and friction elements, respectively (Figs. 6.1 and 6.2).

1 To adhere to the literature, in rheological models σ is the stress and ε is the strain, and η is the viscosity coefficient, not to be confused with the entropy production, the internal energy, and the entropy densities.

6.1 Rheological Models

323

Fig. 6.1 A picture of spring, dashpot, and slider elements

Fig. 6.2 A picture of the rigid-perfectly plastic and elastic-perfectly plastic units

6.1.1 Rigid-Perfectly Plastic Model with Kinematic Hardening The arrangement of an elastic spring and a sliding frictional element in parallel leads to ε = ε(s) = ε( f ) , σ = σ (s) + σ ( f ) = k ε(s) + σ ( f ) , 

and then ∂t ε =

0 1 k

if |σ − kε| < σ y , ∂t σ otherwise.

(6.4)

6.1.2 Elastic-Perfectly Plastic Solid The serial arrangement of an elastic spring and a sliding frictional element dates back to Reuss and gave rise to the additive decomposition of strain into elastic and plastic components. Namely, the stress is the same whereas the strain is shared by both components, ε = ε(s) + ε( f ) = and then

1 k

σ (s) + ε( f ) ,

σ = σ (s) = σ ( f ) ,

⎧ 1 ⎪ ⎨= k ∂t σ if |σ| < σ y , ∂t ε > 0 if σ = σ y , ⎪ ⎩ 0 = ∂t  if σ = σ y , ⎪ ⎩ −σ y ∂t ε > 0 = ∂t  if σ = −σ y .

6.1.3 Elastic-Plastic Model with Kinematical Hardening The model consists of a spring and an elastic-perfectly plastic (EP) unit connected in parallel. Denote by k ( E P ) , σ ( E P ) , ε( E P ) the quantities pertaining to the EP unit and by k (s) , σ (s) , ε(s) those pertaining to the spring in parallel. Since σ = σ ( E P ) + σ (s) = σ ( E P ) + k (s) ε(s) ,

ε = ε( E P ) = ε(s) ,

we then obtain  ∂t σ =

[k (s) + k ( E P ) ] ∂t ε if |σ − k (s) ε| < σ y , if |σ − k (s) ε| = σ y . k (s) ∂t ε

(6.7)

It looks natural to take the energy for this model as the sum of the energies due to the two linear springs in the unit, E P =

2 1 (s) 2 1 k ε + ( E P ) σ − k (s) ε . 2 2k

In view of (6.7) ∂t  E P



k ( E P ) ∂t ε if |σ − k (s) ε| < σ y , 1 (s) = k ε ∂t ε + ( E P ) σ − k ε k 0 if |σ − k (s) ε| = σ y . (s)

and then the imbalance inequality (6.6) holds (Fig. 6.3).

6.1 Rheological Models

325

Fig. 6.3 A picture of the elastic-plastic unit with kinematical hardening

6.1.4 Maxwell-Wiechert Fluid In the Maxwell, or Maxwell-Wiechert, (MW) model a spring and a dashpot are placed in series so that the strain ε is shared by both components whereas the stress σ is the same, σ = σ (s) = σ (d) . ε = ε(s) + ε(d) , Owing to (6.1)–(6.2), time differentiation of σ (s) and the summation ∂t σ (s) /k + σ (d) /η give 1 1 ∂t σ + σ = ∂t ε. (6.8) k η The response to a given stress loading follows by a direct integration to obtain ε(t) − ε(t0 ) =

1 1 t [σ(t) − σ(t0 )] + ∫ σ(s)ds. k η t0

Looking at a constant stress σ, the strain increases unboundedly; that is why the scheme is regarded as a fluid model. Moreover, it follows from (6.8) that σ = η∂t ε, which is the classical behaviour of a Newtonian fluid (Fig. 6.4). Letting σ(t0 ) = σ0 and solving (6.8) in the unknown σ we find t     σ(t) = σ0 exp − (t − t0 )/τ + k ∫ exp − (t − s)/τ ∂s ε(s)ds, t0

where τ = η/k. At constant strain the stress decays with the time constant τ . We say that the model accounts for relaxation and hence that τ is the relaxation time. In particular, as t → ∞ we have σ → 0, a property that is characteristic of a fluid. Letting t0 → −∞ we obtain t   σ(t) = k ∫ exp − (t − s)/τ ∂s ε(s)ds. −∞

By the change of variable t − s = y we find

Fig. 6.4 The Maxwell-Wiechert fluid model

326

6 Rate-Type Models ∞

σ(t) = k ∫ exp(−y/τ ) ∂t ε(t − y)dy.

(6.9)

0

Thus σ(t) depends linearly on the past values of the strain rate ∂t ε via a weighted integral with the memory kernel k exp(−y/τ ). Finally, an integration by parts yields σ(t) = k ε(t) −

k∞ ∫ exp(−y/τ ) ε(t − y)dy. τ 0

(6.10)

It is natural to take the energy as that of the spring,  M W = 21 k [ε(s) ]2 =

1 2 σ . 2k

Hence, the energy of the MW fluid is a function of σ. It follows from (6.8) that ∂t  M W = (1/k)σ∂t σ = σ∂t ε − (1/η)σ 2 ≤ σ∂t ε.

6.1.5 Bingham Model The arrangement of a dashpot and a sliding frictional element in parallel gives rise to a rigid-perfectly viscoplastic unit usually denoted as Bingham fluid model or Bingham plastic element. A Bingham plastic is a viscoplastic material that behaves as a rigid body at low stresses but flows as a viscous fluid at high stress. Letting σ = σ (d) + σ ( f ) = η ∂t ε(d) + σ ( f ) , we obtain ∂t ε =

⎧ ⎪ ⎨0 1 η ⎪ ⎩1 η

ε = ε(d) = ε( f ) ,

if |σ| < σ y , (σ − σ y ) if σ > σ y , (σ + σ y ) if σ < −σ y .

(6.11)

It is used as a mathematical model of mud flow along a pipe in drilling engineering. Another common example is toothpaste. If σ y = 0 and the viscosity of the dashpot, η, has the Norton form η0 n−1 η = η0 , n ∈ N, |σ| η0 being the kinematic viscosity, then the corresponding constitutive equation is named Norton-Hoff model (Fig. 6.5).

6.1 Rheological Models

327

Fig. 6.5 A picture of the Bingham (left) and Bingham-Maxwell (right) units

6.1.6 Bingham-Maxwell Model The model consists of a spring and a Bingham unit connected in series, but can also be achieved by arranging a Maxwell unit in parallel with a sliding frictional element (Fig. 6.5). Letting ε = ε(s) + ε( B ) = we obtain

1 k

σ (s) + ε( B ) ,

⎧1 ⎪ ⎨ k ∂t σ ∂t ε = k1 ∂t σ + ⎪ ⎩1 ∂σ+ k t

1 η 1 η

σ = σ (s) = σ ( B ) ,

if |σ| < σ y , (σ − σ y ) if σ > σ y , (σ + σ y ) if σ < −σ y ,

(6.12)

When η takes the Norton form, η = η0



η0 n−1 , |σ| − σ y

n ∈ N,

then the corresponding model is named Bingham-Norton solid. Rheological models of this kind are typically developed for metals and steel but are, to some extent, used to characterize viscoplastic effects in geomaterials. A direct check shows that the imbalance inequality (6.6) holds with  B M = 2k1 σ 2 . Indeed, in view of (6.12),

∂t  B M

⎧ if |σ| < σ y , ⎪k ∂t ε 1 1 ⎨ = σ∂t σ = σ k ∂t ε − ηk (σ − σ y ) if σ > σ y , k k ⎪ ⎩ k ∂t ε − ηk (σ + σ y ) if σ < −σ y , ⎧ if |σ| < σ y , ⎪ ⎨0 = σ ∂t ε − η1 σ(σ − σ y ) if σ ≥ σ y , ≤ σ ∂t ε, ⎪ ⎩1 σ(σ + σ y ) if σ ≤ −σ y , η

328

6 Rate-Type Models

Fig. 6.6 The Kelvin-Voigt solid model

6.1.7 Kelvin-Voigt Solid The model consists of a spring and a dashpot connected in parallel (Fig. 6.6). The analogue between σ, ε and the force-displacement pair indicates that the stress is shared by both components whereas the strain is the same, namely σ = σ (s) + σ (d) ,

ε = ε(s) = ε(d) .

Hence the formal sum of (6.1) and (6.2) results in the constitutive equation σ = k ε + η ∂t ε.

(6.13)

The response to a given strain loading is then given directly. Instead, the response to stress loading is determined by solving the differential problem ∂t ε +

1 k ε = σ, η η

ε(t0 ) = ε0 ,

in the unknown ε to obtain   1 t   ε(t) = ε0 exp − (t − t0 )/τ + ∫ exp − (t − s)/τ σ(s)ds, η t0

(6.14)

where τ = η/k is referred to as retardation time. If the loading is a step stress, σ(t) = σ0 H (t − t0 ), then    σ0  ε(t) = ε0 exp − (t − t0 )/τ + 1 − exp − (t − t0 )/τ . k Hence, in the Kelvin-Voigt (KV) model the creep compliance, that is the strain response per unit stress when ε0 = 0, is given by ε(t) = K (t − t0 )σ0 ,

K (t − t0 ) =

  1 1 − exp − (t − t0 )/τ . k

The KV model is relative to a solid, as apparent by observing that, asymptotically, we recover the strain-stress relation of an elastic solid: as t → ∞,

6.1 Rheological Models

329

ε → σ0 /k. Letting t0 → −∞ in (6.14), the change of variable t − s = y yields ε(t) =

1∞ ∫ exp(−y/τ ) σ(t − y)dy, η 0

and upon an integration by parts we obtain ε(t) =

1∞ 1 σ(t) − ∫ exp(−y/τ ) ∂t σ(t − y)dy. k k 0

(6.15)

For the Kelvin-Voigt solid, it follows from (6.13) that the imbalance inequality (6.6) holds with  K V = 21 k ε2 .

6.1.8 Jeffreys’ and Burgers’ Fluids A more realistic model for a composite viscoelastic fluid is obtained by adding a second dashpot in parallel with a Maxwell-Wiechert (MW) unit. In the literature of viscoelastic fluids (including polymeric liquids) this model is known as the Jeffreys fluid 2 or the anti-Zener model.3 Denote by k ( M ) , η ( M ) , σ ( M ) , ε( M ) the quantities pertaining to the MW unit and by η (d) , σ (d) , ε(d) those pertaining to the dashpot in parallel. The common value ε = ε( M ) = ε(d) of the strain is such that σ (d) = η (d) ∂t ε,

1 k (M)

∂t σ ( M ) +

1 η(M)

σ ( M ) = ∂t ε.

Substitution of σ ( M ) = σ − σ (d) and multiplication by η ( M ) yield σ + τ ∂t σ = (η ( M ) + η (d) )∂t ε + τ η (d) ∂t2 ε,

(6.16)

where τ = η ( M ) /k ( M ) is the same relaxation time as MW unit. This equation involves three independent parameters, say η (d) , η ( M ) and k ( M ) (or τ ). Let σ(t0 ) = σ0 and the strain on [t0 , t] be given. By solving the differential equation (6.16) in the unknown σ we obtain   t   1 ( M )  (η σ(t) = σ0 exp − (t − t0 )/τ + ∫ exp − (t − s)/τ + η (d) )∂s ε(s) + η (d) ∂s2 ε(s) ds. τ t0

2 3

See e.g. [244] and the review paper by Bird and Wiest [47]. See e.g. [79, 293].

330

6 Rate-Type Models

An integration by parts results in t     σ(t) = [σ0 − η (d) ∂t ε0 ] exp − (t − t0 )/τ + η (d) ∂t ε(t) + k ( M ) ∫ exp − (t − s)/τ ∂s ε(s)ds, t0

where ∂t ε0 = limt→t0+ ∂t ε(t). As in the Maxwell-Wiechert fluid, also in this case at constant strain we have limt→∞ σ(t) = 0. Letting t0 → −∞ we obtain t   σ(t) = η (d) ∂t ε(t) + k ( M ) ∫ exp − (t − s)/τ ∂s ε(s)ds. −∞

The change of variable t − s = y yields ∞

σ(t) = η (d) ∂t ε(t) + k ( M ) ∫ exp(−y/τ ) ∂t ε(t − y)dy.

(6.17)

0

Accordingly, σ(t) is given by the present value of the strain rate ∂t ε(t) and by the values of its past history up to the present time t via a weighted integral with the memory kernel k ( M ) exp(−y/τ ). Finally, an integration by parts shows that σ(t) = η (d) ∂t ε(t) + k ( M ) ε(t) −

k (M) ∞ ∫ exp(−y/τ ) ε(t − y)dy. τ 0

(6.18)

Taking the energy for the Jeffreys fluid as that of the spring in the MW unit we obtain  J as a function of σ and ∂t ε,  J (σ, ∂t ε) =

1  ( M ) 2 1 σ = ( M ) (σ − η (d) ∂t ε)2 . ( M) 2k 2k

In view of (6.16) a direct check shows that the imbalance inequality holds, τ 1 ∂t  J = ( M ) (σ − η (d) ∂t ε)(∂t σ − η (d) ∂t2 ε) = ( M ) (σ − η (d) ∂t ε)(−σ + η (d) ∂t ε + η ( M ) ∂t ε) η η  2 1 = − ( M ) (σ − η (d) ∂t ε)2 − η (d) ∂t ε + σ∂t ε ≤ σ∂t ε. η

Otherwise, we can define  J∗ (σ, ∂t ε) =

η(M) (σ − η (d) ∂t ε)2 , 2k ( M ) (η ( M ) + 2η (d) )

and the imbalance inequality holds anyway. Indeed we have ∂t  J∗ = −

σ2 η (d) (η ( M ) + η (d) )  2 − ∂t ε + σ∂t ε ≤ σ∂t ε. η ( M ) + 2η (d) η ( M ) + 2η (d)

6.1 Rheological Models

331

Fig. 6.7 The Jeffreys and Burgers models

Moreover, each function obtained by the convex combination of  J and  J∗ satisfies the same inequality, in that   ∂t λ J + (1 − λ) J∗ = σ∂t ε − λξ 2 − (1 − λ)ζ 2 ≤ σ∂t ε,

λ ∈ [0, 1],

where ξ2 =

 2 (σ − η (d) ∂t ε)2 + η (d) ∂t ε , (M ) 1

η

ζ2 =

σ2 η (d) (η ( M ) + η (d) )  2 + ∂t ε . η ( M ) + 2η (d) η ( M ) + 2η (d)

We therefore conclude that there are an infinite number of energy functions for the Jeffreys fluid that verify the imbalance inequality (Fig. 6.7). Burgers’ fluid A collection of two MW units in parallel is referred to as Burgers’ fluid model. Since the stress is shared by both components whereas the strain is the same, then σ = σ1 + σ2 ,

ε = ε1 = ε 2 ,

where, in force of (6.8), ηi ∂t σi + σi = ηi ∂t εi . ki

i = 1, 2.

(6.19)

Upon time differentiation and multiplication by η j /k j , j = i, the formal sum of (6.19) results in η1 η2 2 η1 η2 η1 η2 ∂ σ + ∂t σ 2 + ∂t σ 1 = (k1 + k2 )∂t2 ε. k1 k2 t k1 k2 k1 k2 On the other hand the direct sum of (6.19) yields η1 η2 ∂t σ1 + ∂t σ2 + σ = (η1 + η2 )∂t ε. k1 k2 and then

η η1 η2 η2  1 ∂t σ + σ − (η1 + η2 )∂t ε. ∂t σ 2 + ∂t σ 1 = + k1 k2 k1 k2

(6.20)

332

6 Rate-Type Models

In view of this identity, (6.20) gives the constitutive rate equation of Burgers’ model η η1 η2 2 η2  η1 η2 1 ∂t σ + σ = (η1 + η2 )∂t ε + ∂t σ + + (k1 + k2 )∂t2 ε. k1 k2 k1 k2 k1 k2

(6.21)

It might seem natural to take the energy for Burgers’ model as that of the two springs. Yet, this quantity cannot be expressed as a function of ε, σ and their time derivatives. This in turn indicates that for involved materials the physical energy need not be the pertinent function of the imbalance inequality and hence the recourse to additional (internal) variables is justified. An integral form of this constitutive equation can easily be obtained by adding constitutive relations of two MW elements in the form (6.9). We obtain ∞

σ(t) = ∫ [k1 exp(−y/τ1 ) + k2 exp(−y/τ2 )] ∂t ε(t − y)dy,

(6.22)

0

where τi = ηi /ki , i = 1, 2. Various combinations of (KV, MW, SLS) units may result in models capable of reproducing the behaviour of real materials [220]. In particular, generalized KelvinVoigt and Maxwell-Wiechert models are some of the most well-known models to characterize the behavior of polymers [392].

6.1.9 Standard Linear Solid A more involved model for solids is given by the so called standard linear solid (SLS) introduced by Zener [44]. It is obtained by adding a second spring in parallel with a MW unit. Denote by k ( M ) , η ( M ) , σ ( M ) , ε( M ) the quantities pertaining to the MW unit and by k (s) , σ (s) , ε(s) those pertaining to the spring in parallel. The common value ε = ε( M ) = ε(s) of the strain is such that σ (s) = k (s) ε,

1 1 ∂t σ ( M ) + ( M ) σ ( M ) = ∂t ε. k (M) η

Substitution of σ ( M ) = σ − σ (s) and multiplication by η ( M ) yields σ + τ ∂t σ = k (s) ε + (k (s) + k ( M ) )τ ∂t ε,

(6.23)

where τ = η ( M ) /k ( M ) is the relaxation time. This equation involves three parameters, say k (s) , k ( M ) , and η ( M ) (or τ ). That is why the SLS is also named three parameter model (Fig. 6.8). The energy for the SLS is the sum of the energies of the two linear springs in the unit. Owing to (6.23) and letting

6.1 Rheological Models

333

Fig. 6.8 The standard linear solid model

S L S (ε, σ) =

2 1 (s) (s) 2 1  1 1 k [ε ] + ( M ) σ ( M ) = k (s) ε2 + ( M ) σ − k (s) ε)2 , 2 2k 2 2k

we obtain τ (σ − k (s) ε)(∂t σ − k (s) ∂t ε) η(M) 1 = k (s) ε ∂t ε + ( M ) (σ − k (s) ε)(−σ + k (s) ε + η ( M ) ∂t ε) η 1 = − ( M ) (σ − k (s) ε)2 + σ∂t ε ≤ σ∂t ε. η

∂t S L S = k (s) ε ∂t ε +

so proving the imbalance inequality. Assuming σ(t0 ) = σ0 and a given strain on [t0 , t] we solve the differential equation (6.23) in the unknown σ to obtain t

 k (s)

t0

τ

σ(t) = σ0 exp(−(t − t0 )/τ ) + ∫ exp(−(t − s)/τ )

 ε(s) + (k (s) + k ( M ) )∂s ε(s) ds.

Let ε0 = ε(t0 ). Integrating by parts we find σ(t) = [σ0 − (k (s) + k ( M ) )ε0 ] exp(−(t − t0 )/τ ) + (k (s) + k ( M ) )ε(t) k (M) t ∫ exp(−(t − s)/τ )ε(s)ds. − τ t0

(6.24)

Likewise we can solve (6.23) in the unknown ε, ε + τ ∗ ∂t ε =

1 τ σ + (s) ∂t σ, k (s) k

where τ ∗ = (1 + k ( M ) /k (s) )τ ; τ ∗ is called retardation time. We obtain   ε(t) = ε0 exp − (t − t0 )/τ ∗ +

1 k (s) τ ∗

t   ∫ exp(−(t − s)/τ ∗ ) σ(s) + τ ∂s σ(s) ds. t0

An integration by parts and some rearrangements result in

334

6 Rate-Type Models

ε(t) = ε0 −

k (s)

  1 1 σ0 exp − (t − t0 )/τ ∗ + (s) σ(t) ( M) +k k + k (M) t k ( M ) /k (s) ∫ exp(−(t − s)/τ ∗ )σ(s)ds. + ∗ (s) ( M) τ (k + k ) t0

(6.25)

The SLS accounts for both relaxation and retardation. These features are shown by means of relaxation and creep tests, respectively. A relaxation test consists of monitoring the time-dependent stress σ(t) resulting from the application of a steady strain. A creep test consists of measuring the time dependent strain ε(t) from the application of a steady stress. It is worth looking at the results of relaxation and creep tests by applying (6.24) and (6.25), respectively. Let t0 = 0. At constant strain, ε(t) = ε0 as t > 0, by (6.24) we find that the stress decays with time constant τ , σ(t) = (σ0 − k (s) ε0 ) exp(−t/τ ) + k (s) ε0 . Asymptotically the model behaves like a spring: σ → k (s) ε0 =: σ∞ as t → ∞. Likewise, at constant stress, σ(t) = σ0 as t > 0, the strain (6.25) decays with time constant τ (d) , ε(t) = (ε0 − σ0 /k (s) ) exp(−t/τ (d) ) + σ0 /k (s) . Hence, as t → ∞ we have ε → σ0 /k (s) =: ε∞ . Creep and relaxation tests can then verify whether the model fits the data and accordingly provide the values of k (s) , τ , τ (d) . Letting t0 → −∞ in (6.24) and (6.25) we obtain t

σ(t) = (k (s) + k ( M ) )ε(t) − (k ( M ) /τ ) ∫ exp(−(t − s)/τ )ε(s)ds, ε(t) =

−∞ (M)

t 1 k ∫ σ(t) + exp(−(t − s)/τ ∗ )σ(s)ds. k (s) + k ( M ) τ ∗ k (s) (k (s) + k ( M ) ) −∞

The change of variable t − s = y yields ∞

σ(t) = (k (s) + k ( M ) )ε(t) − (k ( M ) /τ ) ∫ exp(−y/τ ) ε(t − y)dy, 0

∞ 1 k (M) ∫ exp(−y/τ ∗ )σ(t − y)dy. ε(t) = (s) σ(t) + ( M) ∗ (s) (s) ( M) k +k τ k (k + k ) 0

(6.26)

Finally, we integrate by parts to find ∞

σ(t) = k (s) ε(t) + k ( M ) ∫ exp(−y/τ )∂t ε(t − y)dy,

(6.27)

0

ε(t) =

1 k

σ(t) − (s)

∞ k (M) ∫ exp(−y/τ ∗ )∂t σ(t − y)dy. k (s) (k (s) + k ( M ) ) 0

(6.28)

6.1 Rheological Models

335

6.1.10 Generalized Models A generalized Kelvin-Voigt (GKV) model is a collection of n KV units in series plus a single spring. In the j-th unit σ j = k j ε j + η j ∂t ε j ,

j = 1, ..., n.

(6.29)

constitutive The possible single spring is governed by σ0 = k0 ε0 . To determine the n εi , we equation, that is the relation between the common value of σ and ε = i=0 first solve the differential equation (6.29) for ε j , j = 1, ..., n, to obtain ε j (t) = ε j (t0 ) exp[−(t − t0 )/τ j ] +

1 t ∫ exp[−(t − s)/τ j ]σ(s)ds. η j t0

where τ j = η j /k j . The limit as t0 → −∞ results in ε j (t) =

1 t ∫ exp[−(t − s)/τ j ]σ(s)ds. η j −∞

Hence, upon summation and the change of variable t − s = y, we find the constitutive equation ∞ 1 1 (6.30) ε(t) = σ(t) + ∫ nj=1 exp[−y/τ j ]σ(t − y)dy. k0 ηj 0 Equation (6.30) is an example of constitutive relation where the strain ε, at time t, depends on the present value σ(t) of the stress and on the history of σ up to time t. The function n j=1 (1/η j ) exp(−y/τ j ) is known in the literature as the Prony series and is referred to as the creep function of the GKV model. As is apparent, it is the sum of the creep functions of each KV unit. A generalized Maxwell-Wiechert (GMW) model is a collection of n MW units in parallel plus a single spring. For any j-th MW unit, ∂t ε j =

1 1 ∂t σ j + σ j , kj ηj

j = 1, ..., n,

(6.31)

whereas n the single damper is governed by σ0 = η0 ∂t ε0 . To find the relation between σi and the common value of ε, we have to solve the differential equation σ = i=0 (6.31) for σ j , j = 1, ..., n. Letting τ j = η j /k j and taking the limit as t0 → −∞ we find that t σ j (t) = k j ∫ exp[−(t − s)/τ j ]∂t ε(s)ds. −∞

336

6 Rate-Type Models

Fig. 6.9 The generalized Maxwell-Wiechert and Jeffreys models

If k0 is the constant of the single damper then, upon summation and the change of variable t − s = y, it follows ∞

σ(t) = k0 ε(t) + ∫ 0

n j=1 k j

exp[−y/τ j ]∂t ε(t − y)dy,

whence, upon an integration by parts, σ(t) = [k0 +

n

j=1 k j ]ε(t)

∞

−∫ 0

kj n j=1 τj

exp[−y/τ j ]ε(t − y)dy.

(6.32)

The relaxation function of the GMW model n

j=1 (k j /τ j ) exp[−y/τ j ]

is the sum of the relaxation functions of each MW unit (Fig. 6.9). We consider again a collection of n MW units connected in parallel with each other. If we add a single damper (rather than a single spring) in parallel we obtain the generalized Jeffreys (or anti-Zener) model. For any j-th MW unit, ∂t ε j =

1 1 ∂t σ j + σ j , kj ηj

j = 1, ..., n,

(6.33)

whereas n the single damper is governed by σ0 = η0 ∂t ε0 . To find the relation between σi and the common value of ε = εi , i = 0, 1, ..., n, we have to solve σ = i=0 the differential equation (6.33) for σ j . Letting τ j = η j /k j and taking the limit as t0 → −∞ we find that t

σ j (t) = k j ∫ exp[−(t − s)/τ j ]∂t ε(s)ds, −∞

j = 1, ..., n.

6.1 Rheological Models

337

Upon summation and the change of variable t − s = y, we obtain ∞

σ(t) = η0 ∂t ε(t) + ∫ 0

n j=1 k j

exp[−y/τ j ]∂t ε(t − y)dy.

Finally, an integration by parts yields σ(t) = η0 ∂t ε(t) +

n

j=1 k j ε(t)

∞

−∫ 0

n j=1 (k j /τ j ) exp(−y/τ j )ε(t

− y)dy.

 We say that nj=1 (k j /τ j ) exp[−y/τ j ] is the relaxation function of the generalized Jeffreys model. As with the GMW model, it is the sum of the relaxation functions of each unit connected in parallel. The generalized Zener model is given by n SLS units connected in parallel. For any j-th SLS unit, since ε j = ε we have (see Sect. 6.1.9) ∞

(M) (M) σ j (t) = (k (s) j + k j )ε(t) − (k j /τ j ) ∫ exp(−y/τ j ) ε(t − y)dy.

(6.34)

0

Upon summation we obtain σ(t) =

∞ n  (s) (M)  ε(t) − ∫ nj=1 (k (jM ) /τ j ) exp(−y/τ j )ε(t − y)dy, j=1 k j + k j 0

which matches (6.32) provided that k j = k (jM ) and k0 =

n

(s) j=1 k j .

We then conclude that the generalized Zener model is equivalent to the GMW model.

6.1.11 Relaxation Modulus and Creep Compliance It is customary to denote by relaxation modulus the function G on [0, ∞) such that ∞

σ(t) = ∫ G(s)∂t ε(t − s)ds. 0

Likewise, the creep compliance is the function J on [0, ∞) such that ∞

ε(t) = ∫ J (s)∂t σ(t − s)ds. 0

338

6 Rate-Type Models

According to (6.9), (6.17) and (6.27) the relaxation moduli G ( M ) , G ( J ) and G (S) for the Maxwell-Wiechert, Jeffreys and SLS models are G ( M ) (s) = k exp(−s/τ ), G ( J ) (s) = η (d) δ(s) + k ( M ) exp(−s/τ ), (S)

G (s) = k

(s)

+k

(M)

(6.35)

exp(−s/τ ).

By (6.15) and (6.28) we find that the compliance moduli J ( K ) and J (S) for the KelvinVoigt and SLS models are 1 1 − exp(−s/τ ), k k 1 k (M) (S) exp(−s/τ ∗ ). J (s) = (s) − (s) (s) k k (k + k ( M ) )

J ( K ) (s) =

(6.36)

Similar expressions can be recovered when generalized models are considered. Rheological models constitute helpful suggestions in the modelling of material behaviours. In a sense they are rate-type models (see next section), in that the resulting constitutive equations involve derivatives of the pertinent variables σ, ε. Also, they may be quite involved, depending on the number of (KV, MW, SLS) units. Yet they are quite elementary in that they are linear and involve stress and strain in onedimensional pictures. We now look for rate-type models in a more general setting.

6.2 Rate-Type Models of Fluids Rate equations occur in various models of continuous bodies, mainly for materials with internal variables. Modelling the constitutive properties via internal variables may be appropriate to describe microstructural effects on the macromotion or memory features associated with motion and thermal behaviour of the body. Internal variables trace back to Duhem [427], Sect. I.8, and Bridgman [66]. The first analysis within continuum thermodynamics is given in [99] while a review on the subject is given in [303]. For definiteness, suppose the state of a solid is characterized by deformation, temperature, and an internal scalar variable α so that the evolution is given by α˙ = f (F, θ, ∇θ, α), the stress T, the free energy ψ, the entropy η, and the heat flux q too being functions of the variables F, θ, ∇θ, α. Assume the entropy flux is taken as q/θ so that the Clausius-Duhem inequality reads

6.2 Rate-Type Models of Fluids

339

1 1 −ψ˙ − η θ˙ + T · D − q · ∇θ ≥ 0. ρ ρθ ˙ θ, ˙ F˙ we find Upon computation of ψ˙ and using the arbitrariness of ∇θ, ∂∇θ ψ = 0,

η = −∂θ ψ, −∂α ψ f −

T = ρ ∂F ψ FT ∈ Sym,

1 q · ∇θ ≥ 0. ρθ

If, in particular, f is independent of ∇θ then we have ∂α ψ f ≤ 0, that is a condition on ∂α ψ and on the evolution function f . Things are more involved if α is a vector or a tensor function because α˙ is no longer an objective derivative.

6.2.1 Rate Equation for Thermo-Viscous Fluids in the Eulerian Description Borrowing from the standard linear solid (6.23) and the Maxwell-Wiechert fluid (6.8) one might guess that in the three-dimensional setting Eqs. (6.23) and (6.8) become ˙ αε˙ + βε = γT + δ T,

˙ ε˙ = cT + d T.

However T˙ and ε˙ are not objective. We might replace ε˙ with D, which is objective. Hence a conceptually-correct form of constitutive assumption might be to let the objective rate of T be a function of the chosen variables. Indeed, we regard the body as a fluid and hence it seems natural to consider the rate equation for the dissipative stress T = T + p1. Likewise, borrowing from the wide literature on the Maxwell-Cattaneo equation for the heat flux4 q + τ q˙ = −k∇θ, we might write the constitutive equation for q as a rate-type equation involving an objective derivative. It might seem natural to let p = p(ρ, ˆ θ) be related to the free energy. We will see in a moment that the occurrence of rate equations results in more involved thermodynamic schemes, unless the fluid is incompressible.

4

See, e.g., Sect. 7.3 and [402].

340

6 Rate-Type Models

For definiteness and simplicity we take rate equations involving the corotational derivative. Hence we let ◦

T = Tˆ (ρ, θ, D, ∇θ, T , q),



ˆ q = q(ρ, θ, D, ∇θ, T , q).

We therefore let the free energy ψ and the entropy η depend on the variables ρ, θ, D, ∇θ, T , q. The time derivatives of T and q take the form T − T W + Tˆ (ρ, θ, D, ∇θ, T , q), T˙ = WT ˆ q˙ = Wq + q(ρ, θ, D, ∇θ, T , q). For convenience in calculations we consider separately the rate equations of T = tr T and T 0 = T − 13 T 1; it follows T˙ 0 = WT T 0 − T 0 W + Tˆ 0 (ρ, θ, D, ∇θ, T , q),

T˙ = Tˆ (ρ, θ, D, ∇θ, T , q).

Thermodynamic consistency ˆ p depend on the variables The constitutive functions ψ, η, Tˆ0 , Tˆ , q, ρ, θ, D, ∇θ, T 0 , T , q. To examine the thermodynamic consistency of these constitutive assumptions we start with the entropy inequality ˙ + 1 T · D − 1 q · ∇θ ≥ 0. −(ψ˙ + η θ) ρ ρθ Upon computation of ψ˙ and substitution we find the inequality ˙ − ∂ ψ · (WT ˙ − ∂∇θ ψ · ∇θ T 0 − T 0 W + Tˆ 0 ) −(∂θ ψ + η)θ˙ + ρ∂ρ ψ∇ · v − ∂D ψ · D T0 p 1 1 1 ˆ + T 0 · D0 + T ∇ · v − ∇ · v − q · ∇θ ≥ 0. −∂T ψ Tˆ − ∂q ψ · (Wq + q) ρ 3ρ ρ ρθ ˙ θ˙ implies that ˙ ∇θ, The arbitrariness of D, ∂D ψ = 0,

∂∇θ ψ = 0,

Hence the entropy inequality simplifies to

η = −∂θ ψ.

6.2 Rate-Type Models of Fluids

341

T 0 − T 0 W + Tˆ 0 ) − ∂T ψ Tˆ − ∂q ψ · (Wq + q) ˆ ρ∂ρ ψ∇ · v − ∂T 0 ψ · (WT p 1 1 1 + T 0 · D0 + T ∇ · v − ∇ · v − q · ∇θ ≥ 0. ρ 3ρ ρ ρθ While this is required to hold for any pair of functions Tˆ 0 , Tˆ , qˆ of ˆ ρ, θ, D, ∇θ, T , q, detailed restrictions follow by specifying the functions Tˆ 0 , Tˆ , q. Here we define Tˆ , Tˆ , and qˆ by generalizing the classical Navier-Stokes and Fourier laws and to recover them in stationary conditions. We let 1 T 0 − 2μ D0 ], Tˆ 0 = − [T τT

1 Tˆ = − [T − (2μ + 3λ)∇ · v], τT

qˆ = −

1 [q + κ ∇θ], τq

where τT , τq , μ, λ, κ are functions of ρ and θ; moreover τT , τq > 0. Thus the evolution of T and q is governed by 1 1 1 T 0 − T 0W − T 0 − 2μD0 ], T˙ = − [T − (2μ + 3λ)∇ · v], q˙ = Wq − [T [q + κ∇θ]. T˙ 0 = WT τT τT τq

Upon substitution of the rates, the entropy inequality has the form T 0 − T 0 W) − ∂q ψ · Wq + ( −∂T 0 ψ · (WT

κ 1 ∂q ψ − q) · ∇θ + ... ≥ 0, τq ρθ

where the dots stand for terms independent of W and ∇θ. Hence it follows that ∂q ψ −

τq q = 0, ρθκ

Consequently

T 0T + (∂T 0 ψ)T T 0 + ∂q ψ ⊗ q ∈ Sym. ∂T 0 ψT

˜ θ, T 0 , T ) + τq q2 , ψ = ψ(ρ, 2ρθκ

and therefore ∂q ψ ⊗ q ∈ Sym. Further, since T ∈ Sym then it follows ∂T 0 ψ T 0 ∈ Sym. The reduced inequality can be written [ρ∂ρ ψ −

1 1 2μ + 3λ p 2μ ∂T ψ − + T ]∇ · v + ( T 0 − ∂T ψ) · D0 τT ρ 3ρ ρ τT 0 1 1 + (∂T 0 ψ · T 0 + ∂T ψ T ) + ∂q ψ · q ≥ 0 τT τq

and implies that

342

6 Rate-Type Models

ρ∂ρ ψ −

1 2μ + 3λ p ∂T ψ − + T = 0, τT ρ 3ρ

1 2μ T0− ∂T ψ = 0, ρ τT 0

1 1 (∂T 0 ψ · T 0 + ∂T ψ T ) + ∂q ψ · q ≥ 0. τT τq The first equation shows an indeterminacy between p and T /3 in that p − T /3 occurs in the thermodynamic restriction. It seems natural to let p = ρ2 ∂ρ ψ, as it happens formally in thermoelastic fluids. Here, though, ψ depends also on T 0 , T , q and hence we cannot require that p be a function of ρ, θ. A direct integration of τT τT T, ∂T 0 ψ = T0 ∂T ψ = 3ρ(2μ + 3λ) 2ρμ yields ψ = (ρ, θ) +

τq τT τT T 0 ·T 0 + T2+ q2 . 4ρμ 6 ρ(2μ + 3λ) 2ρθκ

Thus the inequality is 1 1 1 2 T 0 ·T 0 + T2+ q ≥ 0. 2ρμ 3ρ(2μ + 3λ) ρθκ The independence of T 0 , T , q implies that μ > 0,

2μ + 3λ > 0,

κ > 0.

We have thus obtained the classical thermodynamic requirements on the shear viscosity μ, the bulk viscosity μ + 3λ/2, and the heat conductivity κ.

6.2.2 Invariant Fields and Objective Rates Let T = T + p1. The invariance of the free energy ψ under Euclidean transformations is made obvious by letting ψ depend on T and q through invariants. Let R be the rotation matrix associated with the deformation gradient and given by the polar decomposition, F = RU. Under a Euclidean transformation with rotation matrix Q T QT (QR) = R T T R, R T T R → (QR)T QT Hence R T T R and R T q are invariant. Denote by

R T q → (QR)T Qq = R T q.

6.2 Rate-Type Models of Fluids

343

T := R T T R,

Q := R T q

the two invariants. To distinguish formally the behaviour in dilatations and deformations we decompose T in the trace-free part T 0 and the trace T , T = T 0 + 13 T 1. Correspondingly the invariant tensor T satisfies T = T 0 + 13 T 1,

T = tr T = T ,

T 0 = R T T 0 R.

˙ T and hence R ˙ = ZR. Observe Let Z = RR T − T ZT )R, T˙ = R T (T˙ − ZT whence

T˙ = R T T R,

˙ = R T (q˙ − Zq)R, Q

˙ = RT q Q .

The time derivative of the invariants T , Q is the transform of the (objective) Green

Naghdi derivatives T , q. Let (ρ, θ, T, Q, D, ∇θ) be the set of independent variables. Hence the free energy



ψ, the entropy η, the pressure p, and the derivatives T , q are assumed to be functions of (ρ, θ, T , Q , D, ∇θ). Evaluation of ψ˙ and substitution in the entropy inequality result in ˙ ˙ − ρ∂D ψ · D ˙ + ∂∇θ ψ · ∇θ ρ(∂ρ ψ − p/ρ2 )ρ˙ + ρ(∂θ ψ + η)θ˙ + ρ∂T ψ · T˙ + ρ∂Q ψ · Q 1 T · D − q · ∇θ ≥ 0. +T θ By the standard arguments it follows ∂D ψ = 0,

∂∇θ ψ = 0,

η = −∂θ ψ.

Again we observe ρ˙ is not independent of D. Yet we can assume p = ρ2 ∂ρ ψ, as we did in the previous section and as it happens formally in thermoelastic fluids. Here, though, we cannot require that p be a function of ρ, θ. As a consequence, the inequality reduces to ˙ + T · D − 1 q · ∇θ ≥ 0. ρ∂T ψ · T˙ + ρ∂Q ψ · Q θ

344

6 Rate-Type Models

If ψ depends separately on T 0 and T then the reduced inequality becomes 1 T∇ 3

1 ˙ ≥ 0. ˙ + ρ∂T 0 ψ · T˙0 − ρ∂Q ψ · Q · v + T 0 · D0 − q · ∇θ − ρ∂T ψ · T θ

By T 0 = R T T 0 R and Q = R T q we have



∂T 0 ψ = R∂T 0 ψR T , T˙ 0 = R T T 0 R, ∂T 0 ψ · T˙0 = ∂T 0 ψ· T 0 ,

˙ = R T q, ˙ = ∂q ψ· q ∂Q ψ · Q . ∂q ψ = R∂Q ψ, Q

Hence we formally consider T , T , q as independent variables in place of T, T , Q and take the reduced inequality in the form 1 T 3

1

∇ · v + T 0 · D0 − q · ∇θ − ρ∂T ψ · T˙ − ρ∂T 0 ψ· T 0 −ρ∂q ψ· q ≥ 0. (6.37) θ

We now consider rate equations for T 0 , T , q and show two views to determine the consequences of (6.37). Rate equations and thermodynamic restrictions For definiteness we let T 0 , T , q be given by the rate equations τT T˙ + T = (3λ + 2μ)∇ · v,

T 0 = 2μD0 , τT T 0 +T

τq q +q = −κ∇θ, (6.38) so that the Navier-Stokes-Fourier scheme is recovered in stationary conditions; in terms of T 0 , T, Q the rate equations are ˙ + T = (3λ + 2μ)∇ · v, τT T

τT T˙ 0 + T 0 = 2μRD0 R T ,

˙ + Q = −κR∇θ. τq Q

Substitute ∇ · v, D0 , ∇θ from (6.38) into (6.37) to obtain (−ρ∂T ψ +

τq τT τT

T )T˙ + (−ρ∂T 0 ψ + T 0 )· T 0 +(−ρ∂q ψ + q)· q 3(2μ + 3λ) 2μ κθ 1 1 1 T2+ T 0 ·T 0 + q · q ≥ 0. + 3(2μ + 3λ) 2μ κθ

Since ∇ · v, D0 , ∇θ are allowed to take arbitrary values then the same is true for

T˙ , T 0 , q. Hence the inequality holds if and only if

−ρ∂T ψ +

τT T = 0, 3(2μ + 3λ)

−ρ∂T 0 ψ +

τT T 0 = 0, 2μ

−ρ∂q ψ +

τq q = 0, κθ

6.2 Rate-Type Models of Fluids

345

1 1 1 T2+ T 0 ·T 0 + q · q ≥ 0. 3(2μ + 3λ) 2μ κθ The inequality then implies the classical restrictions μ > 0,

2μ + 3λ > 0,

κ > 0.

(6.39)

By direct integrations, the free energy takes the form ψ = (ρ, θ) +

τT 1 τT T2+ T 0 ·T 0 + q · q. 6ρ(2μ + 3λ) 4ρμ 2ρκθ

(6.40)

Since τT and τq are assumed to be positive then the free energy has a minimum at equilibrium, that is when T , T 0 , and q are zero.

If, instead, we substitute T˙ , T 0 , q from (6.38) into (6.37) and use the arbitrariness of ∇ · v, D0 , ∇θ we eventually find

ρ(2μ + 3λ) ∂T ψ = 13 T , τT

2ρμ ∂T 0 ψ = T 0 , τT

ρκ 1 ∂q ψ = q, τq θ

1 1 1 ∂T ψ T + ∂T 0 ψ · T 0 + ∂q ψ · q ≥ 0. τT τT τq The usual integrations provide the free energy function and hence the restrictions on μ, 2μ + 3λ, κ; the results (6.39) and (6.40) follow again.

6.2.3 Rate Equations in the Material Description It is crucial for the next developments that The material fields q R and T R R are invariant under changes of frame. To check these invariance properties we observe that under the change of frame F → F ∗ we have q∗R = J Q q(QF)−T = J Q q F−T Q = J QT Q q F−T = q R and T∗R R = J (QF)−1 QTQT (QF)−T = J F−1 QT QTQT QF−T = J F−1 TF−T = T R R . Thus the time derivatives q˙ R and T˙ R R too are invariant under a change of frame. Also by analogy with the composite models of springs and dashpots, we let the elastic and viscous behaviours of the solid be described by a stress Y, subject to a rate equation, and by the complementary part T R R − Y; it is natural to view Y as the

346

6 Rate-Type Models

dissipative stress and T R R − Y as the elastic stress. We then assume that Y and q R are governed by rate equations of the form q˙ R = −λ(q R + K∇R θ),

(6.41)

˙ = −(Y − GC). ˙ Y

(6.42)

We assume that the scalars λ, , the second-order tensor K and the fourth-order tensor G are invariant so that the rate equations (6.41) and (6.42) are invariant. We assume that λ ≥ λ0 > 0,  ≥ 0 > 0 and that K and G are invertible. In stationary conditions ˙ Y = G C. q R = −K ∇R θ, Hence K and G can be viewed as the conductivity tensor and the viscosity tensor (loss modulus) in R. We now examine the compatibility of (6.41) and (6.42) with the second law of thermodynamics via the Clausius-Duhem inequality (2.43). We let the extra-entropy flux be zero and show that the scheme proves to be compatible with the second law. Let ψ, η, q R , T R R be given by continuous functions of the set of independent variables ˙  = (θ, C, q R , Y, ∇R θ, C). Moreover let ψ be a C 2 function. The scalars λ,  and the tensors K, G are allowed to depend on the temperature θ. Inequality (2.43) then reads 1 q R ) · ∇R θ θ ¨ ≥ 0. +ρ R λ∂q R ψ · q R + ρ R ∂Y ψ · Y − ρ R ∂∇R θ ψ · ∇R θ˙ − ρ R ∂C˙ ψ · C

˙ + (ρ R λ K T ∂q ψ − −ρ R (∂θ ψ + η)θ˙ + ( 21 T R R − ρ R ∂C ψ − ρ R GT ∂Y ψ) · C R

˙ C, ¨ θ˙ implies that The arbitrariness of the values ∇R θ, ∂∇R θ ψ = 0,

∂C˙ ψ = 0,

η = −∂θ ψ.

Moreover we make the identification Y := T R R − 2ρ R ∂C ψ. Hence the inequality reduces to ˙ + (ρ R λ K T ∂q ψ − 1 q R ) · ∇R θ + ρ R λ∂q ψ · q R + ρ R ∂Y ψ · Y ≥ 0. ( 21 Y − ρ R GT ∂Y ψ) · C R R θ

˙ and ∇R θ implies that The arbitrariness of C ρ R GT ∂Y ψ = 21 Y,

ρ R λ K T ∂q R ψ =

1 qR , θ

(6.43)

6.2 Rate-Type Models of Fluids

347

ρ R λ∂q R ψ · q R + ρ R ∂Y ψ · Y ≥ 0.

(6.44)

Since G and K are invertible we can write ∂Y ψ =

1 G−T Y, 2ρ R 

and observe ∂Y ∂Y ψ =

1 G−T , 2ρ R 

∂q R ψ =

1 K−T q R ρ R λθ

∂q R ∂q R ψ =

1 K−T . ρ R λθ

The symmetry of ∂Y ∂Y ψ and ∂q R ∂q R ψ implies the symmetry of G and K, G = GT ,

K = KT .

Consequently, by integration of Eqs. (6.43) we have ψ = (θ, C) +

1 1 Y · G−1 Y + q R · K−1 q R . 4ρ R  2ρ R λθ

Hence inequality (6.44) results in Y · G−1 Y ≥ 0,

q R · K−1 q R ≥ 0.

This means that the viscosity tensor G and the conductivity tensor K are required to be positive definite. We now look for the rate equations, in the spatial description, associated with (6.41) and (6.42). First we observe that, by viewing ψ as a function of F through C, the chain rule implies ∂F ψ = 2 F ∂C ψ. Thus ρ ∂F ψ F T = 2 ρ F ∂C ψ F T and hence

2 ρ R ∂C ψ = ρ J F−1 ∂F ψ = J F−1 (ρ ∂F ψ FT )F−T .

This in turn requires that Y := T R R − 2 ρ R ∂C ψ = J F−1 (T − ρ ∂F ψ FT )F−T . Hence T := T − ρ ∂F ψ FT is the dissipative stress in the spatial description (dissipative Cauchy stress).

348

6 Rate-Type Models

Substitution of

and use of

q R = J F−1 q, J˙ = J ∇ · v,

yield

Y = J F−1T F−T ˙ F−1 = −F−1 F˙ F−1

q˙ R = J F−1 (q˙ − Lq + ∇ · v q), ˙ = J F−1 (T˙ + ∇ · v T − LT T − T LT )F−T . Y

By (6.41) and (6.42) we obtain λ q˙ − Lq + ∇ · v q + λq = − F K ∇R θ, J  ˙ FT . T − T LT + ∇ · v T + T T = F GC T˙ − LT J To express the right-hand sides by spatial gradients we use the identities F ∇R θ = F FT ∇θ, Hence we have

˙ FT F = FT (L + LT )F.

λ q˙ − Lq + ∇ · v q + λq = − F K FT ∇θ, J

(6.45)

2 T − T LT + ∇ · v T + T T = T˙ − LT F G FT DF FT . J

(6.46)

If we approximate F on the right as the unit tensor we find q˙ − Lq + ∇ · v q + λq = −λ K∇θ T − T LT + ∇ · v T + T T = 2 G D. T˙ − LT The rate equations (6.45) and (6.46) are compatible with thermodynamics, the compatibility being proved within the material description. Thus the Truesdell rates are compatible with thermodynamics and this follows by the identities 

q˙ R = J F−1 q,



˙ = J F−1 T F−T . Y

In undeformable bodies (L = 0, J = 1) Eq. (6.45) reduces to the classical Maxwell-Cattaneo model for heat conduction. In deformable materials we see that

6.2 Rate-Type Models of Fluids

349

the Truesdell derivatives for the rate equations are compatible with thermodynamics if appropriate terms involving ∇θ and D are considered. To determine T and q we go back to (6.42) and (6.41). Let λ, K, , G be known functions of time. Then by (6.42) we find t

t

−∞

τ

˙ )dτ , Y(t) = ∫ (τ )G(τ ) exp(− ∫ (s)ds)C(τ where the assumption  ≥ 0 > 0 is used. Since Y = J F−1 T F−T it follows T (t) =

t t 1 ˙ )dτ FT (t). F(t) ∫ (τ )G(τ ) exp(− ∫ (s)ds)C(τ J (t) −∞ τ

The analogue holds for q with obvious changes. As for Eqs. (6.42) and (6.41) we observe that the right-hand sides are written by analogy with Fourier’s and Navier-Stokes’ laws. Yet, in the spatial description the 

rate equation, with the rate q, is 

q = −λ(q + κ∇θ). Now, q and ∇θ have different transformation laws, from R to R, q R = J F−1 q, Hence

∇R θ = FT ∇θ.

J F−1 (q + κ∇θ) = q R + κJ C−1 ∇R θ

and the rate equation becomes q˙ R = λ(q R + K ∇R θ),

K = κJ C−1 .

Remarkably, this shows that an isotropic conductivity in the spatial description produces an anisotropic conductivity in the reference configuration, this being produced by the different properties of q and ∇θ. Upon integration we find that t

t

−∞

τ

q R (t) = ∫ κ(τ )J (τ ) exp(− ∫ λ(s)ds)C−1 (τ )∇R θ(τ )dτ whence q(t) =

t t 1 F(t) ∫ κ(τ )J (τ ) exp(− ∫ λ(s)ds)F−1 (τ )∇θ(τ )dτ . J (t) −∞ τ

It is of interest that this solution is independent on the chosen reference configuration. For, let

350

6 Rate-Type Models

˜ 0, F = FF F0 being the deformation gradient of the intermediate configuration R˜ relative to the ˜ Upon substitution it follows that reference configuration R. Let J˜ = det F. q(t) =

t t 1 ˜ F(t) ∫ κ(τ ) J˜(τ ) exp(− ∫ λ(s)ds)F˜ −1 (τ )∇θ(τ ). −∞ τ J˜(t)

Hence q is formally invariant under the change R → R˜ of the reference configuration.

6.2.4 Nonlinear Rate-Type Models of Fluids Based on the classical models of Navier-Stokes and Fourier we show how nonlinear rate-type models of fluids by letting the entropy production σ be an appropriate constitutive function. Hence we apply the standard Clausius-Duhem inequality where σ occurs in the form ˙ + T · D − 1 q · ∇θ = θσ ≥ 0. −ρ(ψ˙ + η θ) θ Remarkable rate-type equations are shown to follow by letting some time derivatives be mutually dependent. Heat-conducting fluids Heat-conducting fluids are modelled by letting the constitutive functions depend on the mass density ρ, the temperature θ, and possibly their derivatives. For formal convenience we use the Jacobian J = det F = ρ R /ρ as an independent variable in place of ρ. Hence we start with models where ˙ ∇θ)  = (J, θ, J˙, θ, is the set of variables and let ψ, η, T, q, σ be functions of . Indeed, to begin with, we let T = − p1, q = −κ∇θ, thus accounting for Pascal’s principle about pressure and Fourier’s law about q. The Clausius-Duhem inequality can be written in the form ˙ − p tr D = θζ, −ρ(ψ˙ + η θ)

ζ = σ − σF ,

σ F :=

κ |∇θ|2 , θ2

6.2 Rate-Type Models of Fluids

351

ζ being the entropy production due to mechanical dissipation. Since ζ = σ when ∇θ = 0 then ζ ≥ 0. Compute ψ˙ and substitute in the Clausius-Duhem inequality to find ˙ = −J θζ/ρ . (∂ J ψ + p/ρ R ) J˙ + (∂θ ψ + η)θ˙ + ∂ J˙ ψ J¨ + ∂θ˙ ψ θ¨ + ∂∇θ ψ · ∇θ R ˙ implies ¨ J¨, and ∇θ The arbitrariness of θ, ∂θ˙ ψ = 0,

∂ J˙ ψ = 0,

∂∇θ ψ = 0.

Consequently ψ = ψ(J, θ) and (∂ J ψ + p/ρ R ) J˙ + (∂θ ψ + η)θ˙ = −J θζ/ρ R .

(6.47)

Heat-conducting thermoelastic fluids We consider thermoelastic fluids characterized by ζ = 0 while p, η, in addition to ψ are independent of ∇θ. Hence ψ(J, θ), η(J, θ), p(J, θ) are required to satisfy the equality (6.48) (∂ J ψ + p/ρ R ) J˙ + (∂θ ψ + η)θ˙ = 0. If J˙ and θ˙ are mutually independent then it follows η = −∂θ ψ,

p = −ρ R ∂ J ψ;

the relation for p is usually referred to as the equation of state of the fluid. By way of example, let ψ = ψ0 (θ) − α(θ) ln J . Hence ρ = ρ R /J and η = −ψ0 (θ) + α (θ) ln J,

p = α(θ)ρ.

If, in particular, ψ0 (θ) = cV θ(1 − ln θ), α(θ) = βθ we find η = cV ln θ + β ln J,

p J = ρ R βθ.

In terms of the specific volume v = 1/ρ = J/ρ R we have5 η = cV ln θ + β ln ρ R v,

pv = βθ,

ε = ψ + θη = cV θ.

If, instead, θ˙ and J˙ are not mutually independent then η and p are no longer given by partial derivatives of ψ. Now, if ∂θ (∂ J ψ + p/ρ R ) = ∂ J (∂θ ψ + η),

i.e. ∂θ p = ρ R ∂ J η,

Equation pv = βθ is the equation of state of an ideal gas, with β = R/M, R being the gas constant and M the molar mass.

5

352

6 Rate-Type Models

it follows that Eq. (6.48) is integrable; there is then a function,  say, such that (θ, J ) is constant thus resulting in a connection between θ and J . As an example we let p = ρ R α(θ)/J η = −ψ0 (θ) + α (θ) ln J, and find  = ψ(θ, J ) − ψ0 (θ) + α(θ) ln J. If p + ρ R ∂ J ψ = 0 or η + ∂θ ψ = 0 we can write the relations ∂θ ψ + η ˙ J˙ = − θ ∂ J ψ + p/ρ R

or

∂ J ψ + p/ρ R ˙ θ˙ = − J. ∂θ ψ + η

Dissipative thermoelastic fluids The dissipative character is related to a nonzero value of ζ > 0. If, e.g., η + ∂θ ψ = 0 then we can write ∂ J ψ + p/ρ R ˙ J θζ . θ˙ = − J− ∂θ ψ + η ∂θ ψ + η For appropriate dependences of ζ this relation allows the description of hysteretic phenomena among θ and J , the pressure p being a parameter. Thermoviscous fluids Stricter dissipative properties and more general models are described by letting an extra stress and an extra heat flux occur so that T = − p 1 + S,

q = −κ∇θ + y.

By means of the decomposition S = S0 + 13 (tr S)1 the Clausius-Duhem inequality becomes ˙ + (− p + 1 tr S)tr D + S0 · D0 − 1 y · ∇θ = θζ, − ρ(ψ˙ + η θ) 3 θ

(6.49)

where ζ = σ − κ|∇θ|2 . Consider the referential quantities T R R = J F−1 TF−T , q R = J F−1 q, E˙ = FT DF, ∇R θ = FT ∇θ and observe ˙ ˙ |∇θ|2 = ∇R θ · C−1 ∇R θ. T · D = J T R R · E, tr D = C−1 · E˙ = tr (C−1 E), Hence we can express the stress tensor, the heat flux, and the Clausius-Duhem inequality in the form

6.2 Rate-Type Models of Fluids

T R R = − p J C−1 + S R R ,

353

q R = −κJ C−1 ∇R θ + y R ,

(6.50)

˙ − p J tr (C−1 E) ˙ + S R R · E˙ − 1 y R · ∇R θ = J θζ, −ρ R (ψ˙ + η θ) θ where ζ = σ − σ F , σ F = κ∇R θ · C−1 ∇R θ/θ2 . Some models are now sketched by having recourse to appropriate sets of variables , free energies ψ, entropy productions σ. The Navier-Stokes-Fourier fluid Heat conduction is modelled by q = −κ∇θ and then y = 0. The fluid is compressible and hence we let ˙ D0 , ∇θ)  = (J, θ, J˙, θ, be the set of variables. Time differentiation of ψ() and substitution in (6.49) yield ˙ = −θζ, ˙ 0 + ρ∂∇θ ψ · ∇θ J −1 (ρ R ∂ J ψ + p − 13 tr S) J˙ + ρ(∂θ ψ + η)θ˙ − S0 · D0 + ρ∂ J˙ ψ J¨ + ρ∂D0 ψ · D

˙ implies that ψ is indepen¨ J¨, D ˙ 0 , and ∇θ where ζ = ζ(). The arbitrariness of θ, ˙ J˙, D0 , and ∇θ. Hence ψ = ψ(J, θ) and the Clausius-Duhem inequality dent of θ, simplifies to J −1 (ρ R ∂ J ψ + p − 13 tr S) J˙ + ρ(∂θ ψ + η)θ˙ − S0 · D0 = −θζ.

(6.51)

The Navier-Stokes-Fourier fluid is characterized by ζ = ζ N S :=

1 2μ (λ + 23 μ)(tr D)2 + |D0 |2 , θ θ

σ = ζN S +

κ |∇θ|2 , θ2

where μ, 3λ + 2μ, κ are non-negative valued functions of J, θ. If θ˙ and J˙ are mutually independent and p is independent of J˙ then it follows from (6.51) that p = −ρ R ∂ J ψ,

η = −∂θ ψ,

1 (tr S)(tr D) 3

+ S0 · D0 = θζ N S .

This in turn implies that tr S = (3λ + 2μ)tr D,

S0 = 2μD0

and then T = (ρ R ∂ J ψ + λ tr D)1 + 2μD. Incompressibility implies ρ = ρ R and hence J = 1, tr D = ∇ · v = 0. Hence the set of variables reduces to ˙ D0 , ∇θ)  = (θ, θ,

354

6 Rate-Type Models

and then ψ = ψ(θ), ζ N S = (2μ/θ)|D0 |2 and p is unknown while T = − p 1 + 2μD,

μ(θ) ≥ 0.

6.3 Rate-Type Models of Solids In this section the memory properties of (viscoelastic) solids are modelled by ratetype equations. We argue within the referential description and let T R R and q R be given by (6.50). To establish constitutive rate equations we insert S˙ R R and y˙ R among the variables so that ˙ E, ˙ S˙ R R , y˙ R , ∇R θ)  R = (θ, E, S R R , y R , θ, is the set of variables. We then let ψ, η, p, and σ be functions of  R . The ClausiusDuhem inequality takes the form ρ R (∂θ ψ + η)θ˙ + [ρ R ∂E ψ + p J (1 + 2E)−1 − S R R ] · E˙ + ρ R ∂S R R ψ · S˙ R R + ρ R ∂y R ψ · y˙ R ¨ + ρR ∂ ˙ +ρ R ∂θ˙ ψ θ¨ + ρ R ∂E˙ · E S

RR

ψ · S¨ R R + ρ R ∂y˙ R ψ · y¨ R ρ R ∂∇R θ ψ · ∇R θ˙ +

1 y R · ∇R θ = −J θζ, θ

˙ θ, ¨ E, ¨ S¨ R R , y¨ R , where ζ = σ − (κ/θ2 )∇R θ · (1 + 2E)−1 ∇R θ. The arbitrariness of ∇R θ, ˙ E, ˙ S˙ R R , y˙ R and η is related to ψ in the and θ˙ implies that ψ is independent of ∇R θ, θ, classical way so that ψ = ψ(θ, E, S R R , y R ),

η = −∂θ ψ.

Though cross-coupling mechanical-thermal effects are thermodynamically allowed, we restrict attention to the case when E˙ and S˙ R R are independent of y˙ R and ∇R θ. Hence it follows [ρ R ∂E ψ + p J (1 + 2E)−1 − S R R ] · E˙ + ρ R ∂S R R ψ · S˙ R R = −J θσmech , 1 κJ ∇R θ · C−1 ∇R θ = −J θσth , ρ R ∂y R ψ · y˙ R + y R · ∇R θ − θ θ

(6.52) (6.53)

with σ = σmech + σth , σmech ≥ 0, σth ≥ 0.

6.3.1 The Kelvin-Voigt-Fourier Solid For definiteness we now establish a class of models where both mechanical dissipation and heat conduction occur. Look at (6.52) and let

6.3 Rate-Type Models of Solids

σmech =

355

ν ˙ n |E| , θJ

ν ≥ 0, n ∈ N,

where ν is possibly dependent on θ and E. As the simplest model assume ∂S R R ψ = 0 and hence, in light of the representation formula with Z = S R R − ρ R ∂E ψ − p J (1 + ˙ E| ˙ and G = 0 it follows from (6.52) that 2E)−1 , N = E/| ˙ S R R = ρ R ∂E ψ + p J (1 + 2E)−1 + ν|E˙ n−2 | E. Upon substitution of S R R in (6.50) we find ˙ ˙ n−2 E. T R R = ρ R ∂E ψ(θ, E) + ν|E| If n = 1 and ρ R ∂E ψ(θ, E) = CE then we have the linear Kelvin-Voigt model ˙ T R R = CE + ν E. If, rather, we let σmech =

1 ˙ ˙ E · NE, θJ

where N is a fourth-order tensor then we have ˙ T R R = CE + NE. We now look for a non-linear model with entropy productions σmech =

ν ˙ n |E| , θJ

σth = σ F +

λ |∇R θ|m , θ2 J

ν, λ ≥ 0, n, m ≥ 2

where ν and λ are possibly dependent on the temperature θ. For definiteness we assume ∂S R R ψ, ∂y R ψ = 0. We then apply the representation formula with ˙ E|, ˙ and G = 0 or Z = S R R − ρ R ∂E ψ − p J (1 + 2E)−1 , N = E/| −1 Z = y R − κJ C ∇R θ, N = ∇R θ/|∇R θ|, and w = 0. It follows from (6.52) and (6.53) that ˙ S R R = ρ R ∂E ψ + p J (1 + 2E)−1 + ν|E˙ n−2 | E, y R = (κJ ∇R θ · C−1 ∇R θ − J θ2 σth )

∇R θ = −λ|∇R θ|m−2 ∇R θ. |∇R θ|2

Upon substitution of S R R and y R we find T R R and q R in the form ˙ ˙ n−2 E, T R R = ρ R ∂E ψ + ν|E|

q R = −κJ C−1 ∇R θ − λ|∇R θ|m−2 ∇R θ.

If n = m = 2 then we have the linear relations

(6.54)

356

6 Rate-Type Models

˙ T R R = ρ R ∂E ψ + ν E,

q R = −κJ C−1 ∇R θ − λ∇R θ.

We may rightly refer to Eqs. (6.54) as the non-linear Kelvin-Voigt-Fourier solid.

6.3.2 Thermo-Viscoelastic Materials To describe thermo-viscoelastic materials we let ˙ T˙ R R , q˙ R , ∇R θ)  R = (θ, E, T R R , q R , E, ˙ T˙ R R , q˙ R , ∇R θ need not be mutually be the set of variables; the single variables E, independent. We consider the Clausius-Duhem inequality. By the standard arguments we ˙ E, ˙ ∇R θ, ¨ T ¨ R R , q¨ R implies observe that the arbitrariness of θ, ψ = ψ(θ, E, T R R , q R ),

η = −∂θ ψ,

and hence the Clausius-Duhem inequality reduces to qR (ρ R ∂E ψ − T R R ) · E˙ + ρ R ∂T R R ψ · T˙ R R + ρ R ∂q R ψ · q˙ R + · ∇R θ = −θσ R ≤ 0, θ where σ R = J σ. Models of thermo-viscoelastic materials are characterized by non-negative (referR = J σmech and σthR = J σth independent of q˙ R , ∇R θ, ential) entropy productions σmech ˙ T˙ R R . Let and E, R ˙ T˙ R R , 0, 0), = σ R (θ, E, T R R , q R , E, σmech

σthR = σ R (θ, E, T R R , q R , 0, 0, q˙ R , ∇R θ)

˙ T˙ R R are independent of q˙ R , ∇R θ. Hence the Clausius-Duhem inequality and assume E, splits into the two inequalities R ≤ 0, (ρ R ∂E ψ − T R R ) · E˙ + ρ R ∂T R R ψ · T˙ R R = −θσmech

ρ R ∂q R ψ · q˙ R +

qR · ∇R θ = −θσthR ≤ 0, θ

(6.55) (6.56)

Moreover assume ∂T R R ψ, ∂q R ψ = 0. Applying the representation formula (3.15) we obtain from (6.55) and (6.56) that T˙ R R = C R R E˙ − P R R + (I − N ⊗ N)G R R , where

q˙ R = −K R ∇R θ − Q R + (1 − n ⊗ n)w R , (6.57)

6.3 Rate-Type Models of Solids

N=

357

R N ∂T R R ψ T R R − ρ R ∂E ψ θσmech , CR R = N ⊗ , PR R = , |∂T R R ψ| ρ R |∂T R R ψ| ρ R |∂T R R ψ|

n=

∂q R ψ , |∂q R ψ|

KR = n ⊗

qR , ρ R θ|∂q R ψ|

QR =

θσthR n , ρ R |∂q R ψ|

G R R being an arbitrary tensor and w R an arbitrary vector. If, instead, ∂T R R ψ, ∂q R ψ = 0 then (6.55) and (6.56) reduce to R , (T R R − ρ R ∂E ψ) · E˙ = θσmech

q R · ∇R θ = −θ2 σthR .

(6.58)

As a possible consequence for T R R let M be a symmetric non-singular fourth-order ˙ ˙ and Z = M−1 [T R R − ρ R ∂E ψ], and G = tensor with ME˙ = 0. Letting N = ME/|M E| 0 by the representation formula we have T R R = ρ R ∂E ψ +

R θσmech ˙ M2 E. ˙ 2 |ME|

As for the second relation in (6.58) we observe that Fourier’s law q R = −κ∇R θ, κ being a positive-definite second-order tensor, is allowed with θ2 σthR = ∇R θ · κ∇R θ. The same result follows by using the representation formula with N = A∇R θ/|A∇R θ|, Z = A−1 q R , and w = 0. We find q R = −θ2 σthR A2 ∇R θ/|A∇R θ|2 . Fourier’s law is recovered by taking κ = A−1 q R and σthR = ∇R θ · κ∇R θ/θ2 . A class of thermo-viscoelastic models is established by the selection of quite a general pair of free energy ψ and entropy dissipation σ. Let M be a non-singular, fully symmetric, fourth-order tensor, M a non-singular, symmetric, second-order tensor and G a smooth function from Sym to Sym. Consider the free energy ψ given by E ρ R ψ = ρ R ψ0 (θ) + ∫ G (u)du + 21 [T R R − G (E)] · M(θ)[T R R − G (E)] + 21 q R · M(θ)q R , 0

where u ∈ Sym. Observe that G  (E)]T M[T R R − G (E)], ρ R ∂q R ψ = Mq R . ρ R ∂T R R ψ = M[T R R − G (E)], ρ R ∂E ψ = G (E) + [G

Moreover we let R σmech =

1 [T R R − G (E)] · M[T R R − G (E)], θτT

Hence we take

R σth =

1 q R · Mq R , θτq

τT , τq > 0.

358

6 Rate-Type Models

N=

M[T R R − G (E)] , |M[T R R − G (E)]|

n=

Mq R |Mq R |

and apply (6.57) to obtain 1 T˙ R R = [M−1 + G  (E)]E˙ − [T R R − G (E)], τT

1 1 q˙ R = − M−1 ∇R θ − q R , θ τq (6.59)

where we have chosen 1 G R R = [M−1 + G  (E)]E˙ − [T R R − G (E)], τT

1 1 w R = − M−1 ∇R θ − q R . θ τq

A linear viscoelastic model If the function G is linear, namely G (E) = G∞ E, it follows that ˙ ˙ R R + 1 (T R R − G∞ E) = G0 E, T τT

q˙ R +

1 q R = −κ∇R θ, τq

(6.60)

where G0 = G∞ + M−1 and κ = M−1 /θ; G0 and G∞ stand for the usual elastic and relaxation moduli while κ stands for the conductivity tensor. If G∞ > 0 the rate equations (6.60) represent the standard linear solid (or Zener) model with MaxwellCattaneo heat equation. If G∞ = 0 we have the Maxwell model for fluids 1 ˙ T˙ R R + T R R = G0 E, τT

q˙ R +

1 q R = −κ∇R θ. τq

The classical Maxwell model is recovered through the linear approximation, T R R  T, E  ε.

6.3.3 Thermo-Viscoplastic Models One-dimensional constitutive models for viscoplasticity, based on spring-dashpotslider elements include the perfectly viscoplastic solid, the elastic viscoplastic solid, and the elasto-viscoplastic hardening solid. As with the rheological models, the elements may be connected in series (additivity of the strain) or in parallel (additivity of the stress). Many of these one-dimensional models can be generalized to three dimensions for the small strain regime. As a relevant example we now examine the following model. The Bingham–Maxwell model

6.4 Higher-Order Rate Models

359

This model describes the behaviour of an elastic—perfectly viscoplastic solid. It is based on an elementary rheological unit where the sliding friction element and the dashpot are arranged in parallel and then connected in series to an elastic spring. The similarity with Maxwell’s and Bingham’s models makes it to be referred to as Bingham-Maxwell model. ˙ E, ˙ T˙ R R , q˙ R , ∇R θ) be the set of variables we can Letting  R = (θ, E, T R R , q R , θ, argue in the standard way and assume that cross-coupling terms, between ∇R θ, q R ˙ T˙ R R are absent; we then get inequality (6.55). Moreover assume ∂E ψ = 0. and E, Inequality (6.55) then reads R . T R R · E˙ − ρ R ∂T R R ψ · T˙ R R = θσmech

˙ N = T R R /|T R R |, G R R = 0 to Hence we apply the representation formula with Z = E, obtain TR R TR R TR R R = (ρ R ∂T R R ψ · T˙ R R + θσmech E˙ = E˙ · ) |T R R | |T R R | |T R R |2 whence TR R R E˙ = H R R T˙ R R + θσmech , |T R R |2

H R R :=

ρR T R R ⊗ ∂T R R ψ. |T R R |2

(6.61)

R The entropy production σmech is taken in the form

 R σmech =

0 if |T R R | ≤ S y , (1/θλn )|T R R |n (|T R R | − S y ) otherwise,

(6.62)

where λ, S y are positive parameters. Equations (6.61)–(6.62), with n > 1, characterize the Bingham-Norton model [228]. If, instead, n = 1 and S y = 0 the model reduces to 1 E˙ = H R R T˙ R R + T R R , λ which may be viewed as a generalized Maxwell-Wiechert model. In any case, the free energy is given by a function ψ(θ, T R R ).

6.4 Higher-Order Rate Models Non-Newtonian phenomena exhibited e.g. by asphalt and some biomaterials show properties that can be associated with different relaxation times or rather to higherorder differential equations.This view is often described by having recourse to the Burgers model [73] or the best known Oldroyd-B fluid model [348].

360

6 Rate-Type Models

6.4.1 Burgers’ Fluid Let T = − p1 + S. In the Burgers model of an incompressible fluid the extra-stress S obeys the secondorder rate equation 





S + τ S +β S = 2μD + ν D .

(6.63)

Letting τ = τ1 + τ2 , β = τ1 τ2 , μ = μ1 + μ2 , ν = 2(μ1 τ2 + μ2 τ1 ), we can split (6.63) into the sum of two polymeric (viscoelastic) parts, 

S = S1 + S2 ,

S1 + τ1 S1 = 2μ1 D,



S2 + τ2 S2 = 2μ2 D,

where μ1 , μ2 and τ1 , τ2 are thought as viscosity coefficients and relaxation times; they are expected to be positive. The thermodynamic consistency of the Burgers model is investigated in the referential version. With T = − p1 + S we associate T R R = −J pC−1 + S R R where S R R satisfies the differential equation ¨ S R R + τ S˙ R R + β S¨ R R = 2μE˙ + ν E.

(6.64)

Owing to incompressibility, ˙ 0 = ∇ · v = C−1 · E,

˙ S R R · E.

Hence the Clausius-Duhem inequality can be written in the (referential) form −ρ R (ψ˙ + η)θ˙ + S R R · E˙ = J θσ ≥ 0. We let the coefficients τ , μ, β, ν be parameterized by the temperature θ and the (constant) Jacobian J . Moreover we look at (6.64) as a rate equation of the form ˙ S˙ R R , E). ¨ S¨ R R = f(θ, J, S R R , E, ˙ S˙ R R , E ¨ be Hence we investigate the thermodynamic consistency by letting θ, S R R , E, the independent variables. We then assume the free energy ψ has the form ˙ S˙ R R , E), ¨ ψ = ψ(θ, S R R , E, and the like for η and σ R = J σ. Upon computation of ψ˙ and substitution of ψ˙ we find that η = −∂θ ψ ∂E¨ ψ = 0,

6.4 Higher-Order Rate Models

361

and ¨ − ρR ∂ ˙ − ρ R ∂S R R ψ · S˙ R R − ρ R ∂E˙ ψ · E S

RR

˙ S˙ R R , E) ¨ + S R R · E˙ = θσ R ≥ 0. ψ · f(θ, J, S R R , E,

(6.65) Inequality (6.65) is the general thermodynamic restriction for the function f. We now take f as given by the Burgers equation (6.64); substitution in (6.65) results in τ



2μρ R ν ¨ ∂ ˙ ψ · E˙ + ρ R ∂S˙ R R ψ − ∂S R R ψ · S˙ R R − ρ R ∂E˙ ψ + ∂S˙ R R ψ · E SR R − β SR R β β 1 +ρ R ∂S˙ R R ψ · S R R = θσ R ≥ 0. β ¨ implies The arbitrariness of E ∂E˙ ψ =

ν ∂˙ ψ β SR R

(6.66)

and



τ 1 2μρ R ∂S˙ ψ · E˙ + ρ R ∂S˙ ψψ − ∂S R R ψ · S˙ R R + ρ R ∂S˙ ψ · S R R = θσ R ≥ 0. SR R − R R R R β β β RR

(6.67)

Further consequences follow once a detailed function ψ is selected. We take ψ in the quadratic form ρ R ψ = ρ R ψ0 (θ) +

α1 α2 ˙ α3 ˙ 2 ˙ |S R R |2 + |S R R |2 + |E| + γ1 S˙ R R · S R R + γ2 S R R · E˙ + γ3 S˙ R R · E. 2 2 2

(6.68)

Upon substitution of ˙ ρ R ∂E˙ ψ = γ2 S R R + γ3 S˙ R R + α3 E,

˙ ρ R ∂S˙ R R ψ = γ1 S R R + α2 S˙ R R + γ3 E,

into (6.66) it follows that ν γ2 = − γ1 , β

ν γ3 = − α2 , β

ν α3 = − γ3 . β

Since also ρ R ∂S R R ψ = α1 S R R + γ1 S˙ R R + γ2 E˙ then the reduced inequality can be written as ˙ 2 + 2 A12 S˙ R R · S R R + 2 A13 S R R · E˙ + 2 A23 S˙ R R · E˙ = θσ R , A11 |S R R |2 + A22 |S˙ R R |2 + A33 |E|

(6.69) where A11 =

γ1 , β

A22 =

τ α2 − γ1 , β

A33 = −

2μγ3 , β

362

2 A12 =

6 Rate-Type Models

τ γ1 α2 + − α1 , β β

2 A13 = 1 +

γ3 2μγ1 − , β β

2 A23 =

τ γ3 2μα2 − γ2 − . β β

Thus σ R is positive definite if the matrix A is positive definite. This is apparently the case if A is a diagonal matrix with positive diagonal entries namely when γ1 > 0, γ3 > 0, α2 >

βγ1 τ γ1 τ γ3 α2 2μα2 , α1 = + , γ3 = 2μγ1 − β, γ2 = − . τ β β β β

Necessary and sufficient conditions for the definiteness of σ R follow by evaluating the principal minors of A, which are required to be positive. Here we state the result. Proposition. The matrix A is positive definite, and hence the Burgers model (6.64) with free energy (6.68) is thermodynamically consistent, if and only if one of the following cases occur, ν = 0, μ > 0, β < 0; ν = 0, μ = 0, τ /ν ≥ 0; ν = 0, μ > 0, ντ ≥ 2μβ. The three thermodynamically-consistent models are then represented as follows, ˙ S R R + τ S˙ R R + β S¨ R R = 2μE,

β < 0, μ, > 0,

¨ S R R + τ S˙ R R + β S¨ R R = ν E, ¨ S R R + τ S˙ R R + β S¨ R R = 2μE˙ + ν E,

τ /ν ≥ 0, μ > 0, ντ > 2μβ.

˙ S˙ R R , E) ¨ compatible More general, possibly non-linear, functions f(θ, J, S R R , E, with thermodynamics follow from (6.65) and the representation formula (A.8). Letting N = ∂S˙ R R ψ/|∂S˙ R R ψ| we find that ¨ − θσ R ) S¨ R R = (S R R · E˙ − ρ R ∂S R R ψ · S˙ R R − ρ R ∂E˙ ψ · E

∂S˙ R R ψ ρ R |∂S˙ R R ψ|2

∂S˙ R R ψ

∂S˙ R R ψ ⊗ + I− G, |∂S˙ R R ψ| |∂S˙ R R ψ|

(6.70) ˙ S˙ R R , E. ¨ In the special case G being any second-order tensor function of θ, J, S R R , E, σ R = 0 and G = 0 we have ¨ S¨ R R = (S R R · E˙ − ρ R ∂S R R ψ · S˙ R R − ρ R ∂E˙ ψ · E)

∂S˙ R R ψ . ρ R |∂S˙ R R ψ|2

As a check of the consistency of the whole approach we now ask whether (6.70) leads to (6.64) as a possible case, provided, of course, ψ and σ R are thermodynamically admissible. To fix ideas we consider the first case where ν = 0 and μ > 0, β < 0; hence the expected equation is ˙ β S¨ R R = −S R R − τ S˙ R R + 2μE. If ν = 0 then ψ and σ R have the form

(6.71)

6.4 Higher-Order Rate Models

ρ R ψ = ρ R ψ0 (θ) + Thus N :=

363

τ β ˙ 1 β ˙ 2 |S R R |2 + S R R · S R R , θσ R = |S R R |2 − |S R R | . 4μ 2μ 2μ 2μ

∂S˙ R R ψ SR R ˆ = =: S, ˙ |S |S R R ψ| RR|

∂S˙ R R ψ 2μ Sˆ = . β |S R R | ρ R |∂ S˙ R R ψ|2

Substitution in (6.70) yields ˆ ˆ E˙ − τ S˙ R R ] − S R R + β(I − Sˆ ⊗ S)G. β S¨ R R = (Sˆ ⊗ S)[2μ Letting βG = 2μE˙ − τ S˙ R R we obtain Eq. (6.71). While the representation formula, here (6.70), is likely to generate a class of nonlinear equations, the (admissible) linear equations of rate-type models appear to be special cases.

6.4.2 Oldroyd-B Fluid The Oldroyd-B model describes an incompressible fluid consisting of a solution of a (viscoelastic) polymer in a viscous solvent. Accordingly the stress T is (taken as) the sum of two terms, T = 2μs D + Y, 2μs D being the viscous stress within the solvent and Y the viscoelastic stress of the polymer. The stress Y is assumed to be given by the rate equation 

Y + τ1 Y= 2μ p D.

(6.72)

If μs is constant then we obtain 







T= 2μs D + Y= 2μs D +

1 (2μ p D − T + 2μs D). τ1

Hence we have the Oldroyd-B fluid model6 



T + τ1 T = 2μ(D + τ2 D)

(6.73)

where μ = μ p + μs and τ2 = μs τ1 /μ. As an aside, by assumption the Oldroyd model applies to incompressible fluids. For compressible fluids the corresponding model should involve the Truesdell rate 6

The name Oldroyd-B is due to the chosen derivative; the alternative model Oldroyd-A involved the Cotter-Rivlin derivative, considered inappropriate.

364

6 Rate-Type Models 



Y rather than the upper convected derivative Y. Formally, the constitutive equation for T is obtained by replacing the time derivative, within the Maxwell model, with the upper convected derivative. That is why we can refer to equation (6.72) as upper convected Maxwell model. Moreover, (6.73) can be viewed as the direct extension of the Jeffreys model where the upper convected derivative replaces the time derivative. Hence the model (6.73) is also referred to as the upper convected Jeffreys model. If we let the scalar μs be a function of, e.g., the temperature θ then 





T= 2μs D +2∂θ μs θ˙ D+ Y and (6.73) has to be replaced by 



˙ + 2μτ2 D . T + τ1 T = 2(μ + τ1 ∂θ μs θ)D Nonlinear models occur of polymer melts that may be viewed as generalizations of (6.72). Doi-Edwards theory for polymer melts results in the differential equation 

T +τ Y+

2τ (D · T )Y − G 1 = 2μD 3G

for the stress Y determined by the stretching D. The time τ , the viscosity μ and the pressure G are three parameters characterizing the model. To investigate the thermodynamic consistency of (6.73) we start from its analog in the reference configuration, ¨ T R R + λT˙ R R = 2μE˙ + ν E.

(6.74)

We assume that the free energy ψ and the entropy η are functions of the set of variables ˙ E, ˙ E). ¨  R = (θ, E, T R R , θ, The Clausius-Duhem inequality reads − ρ R (∂θ ψ + η)θ˙ + (T R R − ρ R ∂E ψ) · E˙ − ρ R ∂T R R ψ · T˙ R R ... ¨ − ρ R ∂E¨ ψ · E = J θσ ≥ 0. −ρ R ∂θ˙ ψ θ¨ − ρ R ∂E˙ ψ · E

(6.75)

... ˙ Hence the arbitrariness of E , θ, ¨ and next of θ˙ implies Assume η is independent of θ. ∂E¨ ψ = 0,

∂θ˙ ψ = 0,

η = −∂θ ψ.

For definiteness we also let ∂E ψ = 0. Hence (6.75) yields ¨ = J θσ ≥ 0. T R R · E˙ − ρ R ∂T R R ψ · T˙ R R − ρ R ∂E˙ ψ · E

(6.76)

6.4 Higher-Order Rate Models

365

For a generic model described by  R Eq. (6.76) is required to hold for pairs of ˙ and σ( R ) ≥ 0. Yet here we investigate the thermodynamic functions ψ(θ, T R R , E) ¨ is not arbitrary and we replace E ¨ from (6.74) consistency of (6.74). Hence, e.g., E to obtain νT R R · E˙ − ρ∂E˙ ψ · T R R − ρ R λ∂E˙ ψ · T˙ R R − ρ R ν∂T R R ψ · T˙ R R + 2μρ R ∂E˙ ψ · E˙ = ν J θσ. We look for the function ψ in the form ˙ 2, ρ R ψ = ρ R ψ0 (θ) + γ|λT − ν E| ˙ to be determined. Since where γ is a parameter, independent of T R R and E, ˙ ρ R ∂T R R ψ = 2γλ(λT − ν E),

˙ ρ R ∂E˙ ψ = −2γν(λT − ν E),

then we have ˙ 2 = J θσ ≥ 0. [1 − γ(2ν + 4μλ)]T R R · E˙ + 2γλ|T R R |2 + 4μγν|E| The inequality holds if γ=

1 , γλ ≥ 0, 2(ν + 2μλ)

μγν ≥ 0.

Consequently ψ = ψ0 (θ) +

1 ˙ 2, |λT R R − ν E| 2(1 + 2μλ)

J θσ =

λ/ν 2μ ˙ 2. |T R R |2 + |E| 1 + 2μλ/ν 1 + 2μλ/ν

The thermodynamic consistency holds if7 μ > 0, λν > 0.

6.4.3 White-Metzner Fluid The White-Metzner fluid [182, 445] is similar to the Oldroyd-B model (6.73). It involves both the rate of the stress and the rate of the stretching. The coefficients μ, λ, ν are assumed to depend on D but λ and ν, viewed as relaxation times, are taken to be equal. Further the objective time derivative, of T and D, is taken to be a combination of the Oldroyd and Cotter-Rivlin derivatives. Here we follow the Eulerian description and consider the model equation in the form ◦ ◦ (6.77) T + λ T= 2μD + ν D, 7

As in other models, the (positive) sign of the relaxation times 1/λ, 1/ν is given by boundedness requirements of the solution; thermodynamics gives the (positive) sign of the product.

366

6 Rate-Type Models

where the objective derivative is the corotational one. Consistent with the literature we let the fluid be incompressible. However the coefficients λ, ν are not assumed to be equal and we let them be given by functions μ(θ, p, D), λ(θ, p), ν(θ, p), the pressure p being a parameter. To guarantee the boundedness of the solution T or D of (6.77) we assume λ, ν > 0. Let ◦  = (θ, T, D, D) ◦

be the set of variables and consider ψ, η, and σ as functions of  while T is given by (6.77). Upon substitution of ψ˙ in the Clausius-Duhem inequality we have ◦

˙ − ρ∂ ◦ ψ · D ˙+ T · D = θσ ≥ 0. −ρ(∂θ ψ + η)θ˙ − ρ∂T ψ · T˙ − ρ∂D ψ · D D



By (6.77) we can write T˙ =T +WT + TWT in the form ◦ ˙ = 1 (2μD + ν D −T) + WT + TWT . T λ

By definition,



(6.78)



¨ − (WD)˙− (DWT )˙. D ˙= D

¨ and θ˙ occur linearly in the Clausius-Duhem inequality; their arbitrariness Hence D implies ∂ ◦ ψ = 0, η = −∂θ ψ. D



˙ − WD − DWT , makes the inequality in Substitution of T˙ from (6.78), with D= D the form 1 ˙ + T · D = θσ ≥ 0. ˙ − νWD − νDW T − T) + WT + TW T ] − ρ∂D ψ · D − ρ∂T ψ · [ (2μD + ν D λ

(6.79)

We now have to find a pair of functions ψ, σ satisfying inequality (6.79). The occurrence of the terms WD and WT indicates that ψ might be a function of a combination of D and T. Accordingly we let ψ = ψ(θ, p, ξ), Hence

∂T ψ = 2ψ  (λ2 T − λνD),

ξ := |λT − νD|2 . ∂D ψ = 2ψ  (ν 2 D − λνT),

where ψ  denotes the derivative with respect to ξ. Inequality (6.78) then becomes

6.4 Higher-Order Rate Models

367

˙ − T) − 2ρψ  (ν 2 D − λνT) · D ˙ +T·D −2ρψ  (λT − νD) · (2μD + ν D +2ρψ  (λT − νD) · [ν(WD + DWT ) − λ(WT + TWT )] = θσ ≥ 0 Observe that, by the symmetry of D and T, D · (WD) = D · (DWT ) = 0, D · (WT) + T · (WD) = (DT + TD) · W = 0, and the like by interchanging D and T. Hence the occurrence of W, induced by the corotational derivative, results in a zero entropy production. Moreover we have ˙ + 2ρψ  (ν 2 D − λνT) · D ˙ = 0. 2ρψ  (λT − νD) · ν D The inequality then reduces to T · D(1 − 4ρμλψ  − 2ρνψ  ) + 2ρλψ  |T|2 + 4ρμνψ  |D|2 = θσ ≥ 0. Hence it follows that ψ  > 0,

 |1 − 4ρμλψ  − 2ρνψ  | ≤ 4ρ 2λμν |ψ  |.

μ > 0,

A sufficient condition is obtained by letting 1 − 4ρμλψ  − 2ρνψ  = 0 whence ψ =

1 , 2ρ(ν + 2μλ)

ψ = ψ0 (θ, p) +

1 |λT − νD|2 . 2ρ(ν + 2μλ)

Consequently the entropy production σ is given by θσ =

λ 2μν |T|2 + |D|2 . ν + 2μλ ν + 2μλ

6.4.4 Wave Features in Higher-Order Rate Models The Burgers model is usually applied in the Eulerian version so that 

T = − p1 + S,





S + λ S +β S = 2μD + ν D .

In the Burgers model the extra-stress S describes an incompressible fluid and obeys the second-order rate equation 





S + λ S +β S = 2μD + ν D .

368

6 Rate-Type Models

Plane longitudinal waves in higher-order rate models are investigated with some approximations. The pressure is assumed to be uniform and hence no pressure contribution occurs in the equation of motion. Plane longitudinal waves denote that the spin W is zero. The equation of motion and the constitutive equation are considered  in the linear approximation which means that T T˙  ∂t T. In addition ρ is constant ˙  ∂t T. Letting x be the coordinate in the direction of propagation we and v˙  ∂t v, T let the stress and the displacement depend on x and t in the form exp[i(kx − ωt)]. With these approximations, the equation of motion and the rate equation allow us to write T + λ∂t T + β∂t2 T = μ∂x v + ν∂x ∂t v. ρ∂t v = ∂x T, Applying ∂x to the second equation and replace ∂x T from the first one we have ρ∂t v + ρλ∂t2 v + ρβ∂t3 v = μ∂x2 v + ν∂x2 ∂t v.

(6.80)

The dependence on exp[i(kx − ωt)] of v in (6.80) implies that the wave number k and the angular frequency ω are subject to iρω − ρλω 2 − iρβω 3 = μk 2 − iνk 2 ω. At high frequencies we have

ρβ k2 . = ω2 ν √ Waves propagate with phase speed ν/ρβ, with νβ > 0. If β = 0 (Oldroyd model) then we have i(νλω − μ/ω) + μλ + ν k2 = −ρ . ω2 μ2 + ν 2 ω 2 The real part is negative or positive according as λ, ν > 0 or λ, ν < 0. Let k = kr + iki and hence k 2 = kr2 − ki2 + 2ikr ki . It follows that k 2 > 0, and hence kr2 > ki2 , if λ, ν < 0 and vice versa. Instead one expects that k 2 > 0 so that kr ki > 0 (the wave amplitude decays while it propagates) and this is true only depending on the frequency range.

6.5 Wave Equation in Hereditary Fluids Nonlinear acoustic wave propagation in viscous fluids with memory has been modelled with a third-order integro-differential equation. Here we outline the main steps leading to the model.

6.6 Further Thermoelastic Models

369

We follow the Navier-Stokes-Fourier model of the fluid and hence state the balance of mass, linear momentum, and energy in the form ∂t  + ∇ · (ρv) = 0, ρ∂t v + ρ(v · ∇)v = −∇P + μv + ζ∇(∇ · v), ρ(∂t e + (v · ∇)e = −∇ · q + 2μD0 · D0 + ζ(tr D)2 , where ζ = λ + 2μ/3). Let p = p0 + P, ρ = ρ0 + , η = η0 + s, ε = ε0 + e, θ = θ0 + ϑ, where p0 , ρ0 , η0 , ε0 , θ0 are the values at equilibrium. We take P in the form P=A

B   + ( )2 + ∂η p s, ρ0 2 ρ0

where A = ρ0 ∂ρ p = ρ0 c2 , B = ρ20 ∂ρ p. Hence we can write P = c2  +

c2 B 2  + ∂η p s. ρ0 2 A

Finally we take the standard Fourier law in the form q = −κ∇ϑ.

6.6 Further Thermoelastic Models There are thermodynamic schemes that lead to the heat conduction inequality q · ∇θ ≤ 0 as the reduced entropy inequality. This inequality in turn is apparently satisfied by Fourier’s law q = −κ∇θ, κ > 0. If, further, the body is undeformable and the internal energy ε depends only on the temperature θ then the balance of energy results in the partial differential equation θ˙ = kθ + g(x, t) where k = κ/ρ c, c = dε/dθ. An initial temperature field θ(x, 0) with compact support is felt at any time t > 0 at every point outside the support; we say that the speed of propagation is infinite. This seems to be paradoxical from a physical point of view and is in contrast with low-temperature experimental observations.8 To overcome this 8

See, e.g., [86] and references therein.

370

6 Rate-Type Models

drawback many schemes have been elaborated. The rate-type models of the previous section can be viewed within this line. Here we pursue this line and emphasize some features of the well-known Maxwell-Cattaneo equation and establish a generalized form. Next we review some uncommon approaches developed in the literature.

6.6.1 Maxwell-Cattaneo Equation; a Generalized Form Physical heuristic arguments indicate [84, 306, 431] that in non-stationary processes Fourier’s law of heat conduction in rigid solids might be replaced by the rate-type equation τ q˙ + q = −κ∇θ, τ > 0. (6.81) Equation (6.81) is named after Maxwell-Cattaneo (for short MC equation); it reduces to Fourier’s law in stationary conditions (q˙ = 0) or, formally, if the relaxation time τ vanishes. The thermodynamic consistency of (6.81) along with the balance of energy, ρε˙ = −∇ · q + ρr, is proved e.g. by letting θ, q, ∇θ be the set of independent variables (see Sect. 7.8.3). While κ and τ are often viewed as constants, a simple improvement is obtained by letting κ, τ be functions of θ. The fit of experimental data or physical aspects might indicate that a rate-type equation would be in order for an appropriate vector, say w, in place of q [331]. Here we follow this idea by paralleling the model in [330]. We look for a generalized MC equation in the form ˙ + w = −m∇θ, w

(6.82)

where w is the sought analogue of q; the scalar m and the tensor  are allowed to depend on θ. Let , N be the free energy and the entropy per unit volume. We assume that , N , q are functions of θ, w, ∇θ while w is subject to (6.82). The Clausius-Duhem inequality takes the form 1 ˙ + ∂∇θ  · ∇ θ˙ + q · ∇θ ≤ 0. (∂θ  + N )θ˙ + ∂w  · w θ ˙ from (6.82) results in Substitution of w 1 (∂θ  + N )θ˙ + ( q − m∂w ) · ∇θ − ∂w  · w + ∂∇θ  · ∇ θ˙ ≤ 0. θ ˙ θ˙ implies The arbitrariness of ∇ θ, ∂∇θ  = 0,

N = −∂θ .

6.6 Further Thermoelastic Models

371

As for q(θ, w, ∇θ) we observe that a dependence of q on ∇θ would result in a Fourier-like equation. Hence we consider q(θ, w); the linearity of ∇θ implies that 1 q = m∂w , θ

∂w  · w ≥ 0.

To make the model operative we have to determine the vector w. We require the consistency between Fourier’s law and the stationary version of (6.82), w = −m−1 ∇θ,

q = −K∇θ,

K being the conductivity tensor. Hence we assume w = m−1 K−1 q. Consequently ∂w  =

1 Kw, θm 2

 = 0 (θ) +

and hence 0 ≤ ∂w  · w =

1 w · Kw 2θm 2

1 w · Kw. 2θm 2

Irrespective of the form of  it follows that K has to be positive definite, as expected. We still need to determine  and m. By definition the energy E (per unit volume) is  K K   w, −θ E = 0 − θ0 + w · 2 2θm 2θm 2 the prime denoting the derivative with respect to θ. The energy then depends only on the temperature if K  K −θ = 0, 2 2θm 2θm 2 which happens if K = θ, 2θm 2  being a constant tensor. Hence  = 2θ2 m 2 K−1 ,

w=

1  −1 q. 2θ2 m

The model is conclusive once the function m is selected.

372

6 Rate-Type Models

6.6.2 Temperature-Rate Dependent Thermoelastic Materials Borrowing from [288] and [52] we allow for the dependence on the temperature rate ˙ Also to account for objectivity, it is convenient to follow the material formulation. θ. Hence we let ψ R , η R , T R R , q R , k R be functions of ˙ C, θ, ∇R θ, θ. We assume that the free energy ψ R and the extra-entropy k R are differentiable while η R , T R R , and q R are continuous. To determine the thermodynamic restrictions we then consider the entropy inequality in the material formulation, ˙ + 1 TR R · C ˙ − 1 q R · ∇R θ + θ∇R · k R ≥ 0. −(ψ˙ R + η R θ) 2 θ Upon computation of ψ˙ R and ∇ R · k R we obtain ˙ − ∂∇R θ ψ R · ∇R θ˙ − ∂θ˙ ψ R θ¨ − 1 q R · ∇R θ −(∂θ ψ R + η R )θ˙ + ( 21 T R R − ∂C ψ R ) · C θ ˙ ≥ 0. +θ(∂C k R · ∇R C + ∂θ k R · ∇R θ + ∂∇R θ k R · ∇R ∇R θ + ∂θ˙ k R · ∇R θ) ¨ ∇R C, ∇R ∇R θ and C, ˙ ∇R θ˙ implies that The arbitrariness of θ, ∂θ˙ ψ R = 0,

∂C k R = 0,

T R R = 2 ∂C ψ R ,

∂∇R θ k R ∈ Skw,

−∂∇R θ ψ R + θ∂θ˙ k R = 0.

The entropy inequality reduces to 1 −(∂θ ψ R + η R )θ˙ − ( q R − ∂θ k R ) · ∇R θ ≥ 0. θ As we see shortly, the qualitative novelty of this model is the dependence of the entropy on the temperature rate. Hence, for simplicity, we let k R = 0, which implies that 1 ψ R = ψ R (C, θ). (∂θ ψ R + η R )θ˙ + q R · ∇R θ ≤ 0, θ Moreover we assume η is independent of ∇R θ and let ˙ η R = η0 (C, θ) + λ(C, θ)θ. It follows that η0 = −∂θ ψ R ,

1 λθ˙2 + q R · ∇R θ ≤ 0. θ

6.6 Further Thermoelastic Models

373

Letting ∇R θ = 0 we find the condition λ ≤ 0. Fourier’s law in the form q R = −K(C, θ)∇R θ, K being positive definite completes a possible, thermodynamically consistent scheme. To sum up, in addition to Fourier’s law we have ψ R = ψ R (C, θ),

˙ η R = −∂θ ψ R + λ(C, θ)θ,

T R R = 2 ∂C ψ R .

Second-order waves We now investigate the compatibility of this scheme with wave propagation at a finite speed. Let  be a time-dependent surface in the reference configuration R and denote by n R the unit normal. The surface  is a second-order wave (or acceleration ˙ ∇R θ are continuous functions of X and t in a neighbourhood N wave) if x, x˙ , F, θ, θ, ¨ ˙ ˙ of , x¨ , F, θ, ∇R θ, ∇R ∇R θ and all higher-order derivatives suffer jump discontinuities across  but are continuous functions of X and t in N \ . As a consequence of the constitutive equations it follows that [[ψ R ]] = 0, [[η R ]] = 0, [[T R R ]] = 0, [[q R ]] = 0 across . Define a := [[

d 2x ]], dn 2R

ϑ := [[

d 2θ ]] dn 2R

where d/dn R = n R · ∇R . Hugoniot’s compatibility conditions imply that ˙ = −U R a ⊗ n R , [[F]]

[[∇R F]] = a n R ⊗ n R , [[∇R ∇ R θ ]] = ϑ n R ⊗ n R ,

[[¨x]] = U R2 a,

[[∇R θ˙ ]] = −U R ϑ n R ,

[[θ¨ ]] = U R2 ϑ.

The discontinuities a, ϑ are subject to the balance equations. We let ρ R and b be continuous across . Hence by the equation of motion ρ R v˙ = ∇R · T R + ρ R b it follows ρ R [[˙v]] = [[∇R · T R ]]. ˙ R ]]n R then Since [[∇R · T R ]] = −[[T ˙ R ]]n R . − U R3 ρ R a = [[T Moreover T R = FT R R = 2F∂C ψ R (C, θ) and then

˙ ˙ + ∂θ ∂C ψ R θ). ˙ R R + 2F(∂C2 ψ R C T˙ R = FT

(6.83)

374

Now,

6 Rate-Type Models

˙ = −U R (n R ⊗ FT a + FT a ⊗ n R ). ˙ = [[F˙ T ]]F + FT [[F]] [[C]]

Let A = ∂C2 ψ R ; by definition A enjoys the major and minor symmetries, A H K M N = A M N H K and A H K M N = A K H M N = A H K N M . Hence we find ˙ R ]]n R = −U R Qa [[T where Q = (n R · T R R n R )1 + n R FAFT n R . Consequently (6.83) implies (Q − ρ R U R2 1)a = 0,

(6.84)

which is the analogue of the propagation condition for acceleration waves in elastic solids. This is so because ∂θ ∂C ψ R θ˙ is continuous across . We now examine the balance of energy. Since ˙ ε R = ψ R + θη R = ψ R − θ∂θ ψ R + θλ θ, where λ = λ(C, θ), we find

[[ε˙R ]] = θλ[[θ¨ ]].

Hence the balance of energy yields ˙ ρ R θλ[[θ¨ ]] = −[[∇R · q R ]] + T R R · [[C]]. Since −[[∇ R · q R ]] =

1 1 ˙ = n R · Kn R ϑ [[q˙ R ]] · n R = − n R · K[[∇R θ]] UR UR

then the balance of energy results in (ρ R θλU R2 − n R · Kn R )ϑ = −2U R (FT R R n R ) · a.

(6.85)

Two possibilities occur. If ρ R U R2 is an eigenvalue of Q, for an appropriate direction n R , then by (6.84) it follows that a propagates with that speed U R . Correspondingly ϑ is determined by (6.85) in the form ϑ=−

2U R (FT R R n R ) · a. ρ R θλU R2 − n R · Kn R

6.6 Further Thermoelastic Models

375

Otherwise, if ρ R θλU R2 − n R · Kn R = 0 then (6.85) implies a = 0. However, λ < 0 and K > 0 do not allow for this condition. ˙ so that λ = 0, allows the propagation Thus the dependence of the entropy on θ, of thermal waves, [[θ¨ ]] = 0, with no contribution from heat conduction (K = 0). If, instead, λ = 0 then only waves occur with speed U R such that ρ R U R2 is an eigenvalue of Q. In this case ϑ is given by ϑ=

2U R (FT R R n R ) · a. n R · Kn R

Remark 6.1 The model with λ = 0 might appear surprising in that it allows wave propagation with finite speed though without the seemingly crucial dependence of ˙ Now, Eq. (6.85), with λ = 0, shows that ϑ is related to a and ϑ = 0 if a = 0. η R on θ. ˙ = 0. Hence the customary paradox Moreover a = 0 if the body is undeformable, C of infinite speed of propagation is based also on the assumption that the body is undeformable.

6.6.3 Thermoelasticity Based on an Integral Variable Two schemes about heat flows have been established by Green and Naghdi [210, 211], within a re-examination of the basic postulates of thermomechanics. Constitutive equations are given especially for the flow of heat, with particular reference to the propagation of thermal waves at finite speed. The schemes involve the so-called thermal displacement α and consider ∇α, instead of ∇θ, among the state variables. Both the absolute temperature and an empirical temperature are considered.9 In essence, though, the content of thermal displacement is provided by restricting attention to a single temperature, the absolute one. That is why here we consider only the absolute temperature. The thermal displacement α on R × R is defined by α˙ = θ. The thermal displacement gradient β is defined by β = ∇R α. It follows that

β˙ = ∇R α˙ = ∇R θ = FT ∇θ.

Thermoelasticity without energy dissipation

9

Similar approaches involving two temperatures are reviewed in [391].

376

6 Rate-Type Models

As the constitutive assumption we assume that the free energy ψ, the entropy η, the Cauchy stress T, and the heat flux q are functions of θ, ∇R α, F; the extra-entropy flux is assumed to be zero. The constitutive equations are required to satisfy the entropy inequality ˙ + T · D − 1 q · ∇θ ≥ 0. −ρ(ψ˙ + η θ) θ Upon computation of ψ˙ and substitution we find 1 −ρ(∂θ ψ + η)θ˙ − ρ∂∇R α ψ · (FT ∇θ) − ρ(∂F ψFT ) · L + T · D − q · ∇θ ≥ 0. θ ˙ W, D, ∇θ. The arbitrariness of their values, indeThe left-hand side is linear in θ, pendent of θ, ∇R α, F, implies that η = −∂θ ψ, T = ρ∂F ψFT ,

∂F ψFT ∈ Sym, q = −ρF∂∇R α ψ.

By objectivity, ψ may depend on F via the invariant tensor C. Hence we let ˜ ∇R α, C). ψ = ψ(θ, Consequently it follows

and then

˜ F C = 2F∂C ψ˜ ∂F ψ = ∂C ψ∂ ˜ T. ∂F ψFT = 2F∂C ψF

The requirement ∂F ψFT ∈ Sym is apparently satisfied and hence ˜ T ∈ Sym, T = 2F∂C ψF as expected. The remaining conditions show that both η and q are derived by the free energy. Moreover, the entropy inequality results in an entropy equality and that is why we can say that this is a model of thermoelasticity without energy dissipation. Of course, the vanishing of dissipation (entropy equality) implies that no inequality holds for the constitutive equations. This seems to be a weak point of the scheme. To fix ideas, let ψ˜ = (θ, C) + γ(θ, C)(∇R θ)2 ;

6.6 Further Thermoelastic Models

377

no restriction follows from thermodynamics on the function γ(θ, C). Second-order waves Again we investigate the compatibility of the constitutive scheme with wave propagation at a finite speed though with a different characterization of second-order waves. Let σ be a time-dependent surface in the current configuration and denote by n the unit normal. The surface σ is a second-order wave if x, x˙ , F, α, α˙ = θ, ∇R α are con˙ ∇R θ, ∇R ∇R α and ˙ α¨ = θ, tinuous functions of x and t in a neighbourhood N of σ, x¨ , F, all higher-order derivatives suffer jump discontinuities across σ but are continuous functions of x and t in N \ σ. First we observe that if [[φ]] = 0 then [[∂t φ]] = −u n [[∂n φ]] and, since [[v]] = 0,

˙ = −U [[∂n φ]], [[φ]]

where ∂n = n · ∇ is the normal derivative, u n is the speed of displacement, and U is the local speed of propagation. Now, since [[F]] = 0 and F˙ = ∇R v = ∇vF then ˙ = [[∇v]]F = [[∂n v]] ⊗ nF, [[F]]

[[˙v]] = −U [[∂n v]],

[[∇R θ]] = [[∇θ]]F = [[∂n θ]]nF,

˙ = −U [[∂n θ]]. [[θ]]

Moreover [[ψ]] = 0,

[[η]] = 0,

[[T]] = 0,

[[q]] = 0.

We let a = [[∂n v]],

ϑ = [[∂n θ]]

be the representatives of the discontinuities at the wave. Assume [[b]] = 0 and [[r ]] = 0. Since [[ρ]] = 0 then the equation of motion results in ρ [[˙v]] = [[∇ · T]]. Now, since [[T]] = 0 then [[∇ · T]] = [[∂n T]]n = − Observe that

1 ˙ + ∂θ T[[θ˙ ]] + ∂∇R α T[[∇R θ]]}n. {∂F T[[F]] U

∂θ T[[θ˙ ]] = −U ∂θ Tϑ.

Moreover, in suffix notation

378

6 Rate-Type Models T ˙ {(∂F T[[F]])n} i = n j ∂ Fk P Ti j FP l n l ak =: Q ik ak = (Qa)i ,

{(∂∇R α T[[∇R θ]])n}i = n j ∂α, P Ti j [[θ,l ]]Fl P = n j ∂α, P Ti j FPTl n l ϑ =: ci ϑ. Hence the equation of motion implies that ρU 2 a = Qa + (c − U ∂θ T)ϑ.

(6.86)

We now investigate the consequences of the balance of energy. We have ρ[[ε]] ˙ = T · [[D]] − [[∇ · q]]. Observe that ˙ + ∂θ ε[[θ˙ ]] + ∂∇R α ε · [[∇R θ]] = (∂F εFT n) · a + (∂∇R α ε · n − U ∂θ ε)ϑ [[ε]] ˙ = ∂F ε · [[F]] and 1 1 ˙ + ∂∇R α q[[∇R θ]]} ˙ + ∂θ q[[θ]] ˙ · n = − {∂F q[[F]] [[q]] U U 1 = {(n∂F qFT n) · a + (n · ∂∇R α qFt n − U n · ∂θ q)ϑ}, U

[[∇ · q]] = [[∂n q]] · n = −

[[L]] = [[∇v]] = a ⊗ n,

T · [[D]] = (Tn) · a.

Hence it follows (ρ∂F εn −

1 1 n∂ qFT n − Tn) · a + [ρ(∂∇R α ε · n − U ∂θ ε) − n · ∂∇R α qFT n − U n · ∂θ q]ϑ = 0. U F U

(6.87)

Equations (6.86)–(6.87) constitute a linear homogeneous system in the unknowns a, ϑ. A very simple scheme arises by letting ψ = ψ1 (θ) + ψ2 (F) − 21 κ(∇R α)2 . This free energy function implies that c, ∂θ T, ∂F ε, ∂∇R α ε, and ∂θ q are zero. In addition let the body be free from stress and gradient of thermal displacement, T = 0, ∇R α = 0. The system simplifies to (ρU 2 1 − Q)a = 0,

(∂θ ε U 2 − κn · FFT n)ϑ = 0.

The system splits into two uncoupled equations. The first one is satisfied by acceleration waves (ϑ = 0) propagating with the speed U such that ρU 2 is an eigenvalue of Q. The second one admits wave solutions if κ > 0, which is not implied by thermodynamics. In this case purely thermal waves (a = 0) occur; they propagate with

6.6 Further Thermoelastic Models

379

local speed U given by U2 =

κ n · FFT n . ∂θ ε

Thermoelastic solid, type III Though within our approach and notation, we now review the scheme of heat conductor denoted as type III in [210]. Let θ, ∇R α, ∇θ, F be the set of variables and let ψ, η, T, q be given by constitutive functions. Differently from the thermoelastic solid without energy dissipation,10 now the dependence on ∇θ is considered. Upon substitution of ψ˙ in the second-law inequality ˙ + T · D − 1 q · ∇θ ≥ 0 −ρ(ψ˙ + η θ) θ we obtain ˙ − ρ(∂ ψFT ) · L + T · D − 1 q · ∇θ ≥ 0. −ρ(∂θ ψ + η)θ˙ − ρ∂∇R α ψ · FT ∇θ − ρ∂∇θ ψ · ∇θ F θ

˙ = ∇ θ˙ − L∇θ implies that The arbitrariness of ∇θ ∂∇θ ψ = 0. Hence the arbitrariness of θ˙ and L = W + D requires that η = −∂θ ψ,

∂F ψFT ∈ Sym,

T = ρ∂F ψFT .

Thus it follows the reduced inequality 1 (ρF∂∇R α ψ + q) · ∇θ ≤ 0. θ Since q depends on ∇θ, while ψ does not, then this inequality implies that ˜ q = −θρ F∂∇R α ψ + q, where q˜ is subject to the heat conduction inequality q˜ · ∇θ ≤ 0.

10

Denoted as type II in [210].

380

6 Rate-Type Models

To fix ideas let ψ = (θ, F) + 21 γ∇R α · ∇R α. and q˜ = − f ∇θ, where γ and f are functions of θ, F. Since ∇R α · ∇R α = ∇α · B∇α, where B = FFT , then q = −γθρB∇α − f ∇θ, and ε = ψ − θ∂θ ψ is a function of θ, ∇R α, F. The energy equation makes ρ∂θ εθ˙ − f θ be equal to first-order terms in α and θ thus making the equation for θ parabolic. The equation becomes hyperbolic if f = 0 that is if the dependence on ∇θ is ruled out, as it happens in the type II model.

Chapter 7

Materials with Memory

Real materials exhibit a continuing strain rate (creep) under persistent constant stress and decay of stress (relaxation) under constant strain. The analogue occurs for electric current or electric polarization and electric field. In many materials these effects are very pronounced while in others they may be negligible, the extent being possibly influenced by the temperature and being related to the time scale adopted. The theory of linear viscoelasticity encompasses those phenomena which can be described by a linear superposition of the effect of stress or strain. Viscoelasticity may be viewed as perhaps the leading model of material with long range, gradually fading memory. Electromagnetism in matter is likewise a subject described by fading memory. After some preliminaries on the mathematical framework for fading memory, this chapter develops relevant models of materials and investigates the thermodynamic consistency and the wave propagation properties. First thermoelastic materials (with fading memory) are modelled through histories and summed histories. Next the wide topic of viscoelasticity is developed; the linear viscoelastic solid with emphasis on thermodynamic restrictions and free energies; viscoelastic solids with unbounded relaxation functions; viscoelastic fluids and in particular Boltzmann-like and Maxwell-like models. Attention is then addressed to electromagnetic materials: dielectrics with memory, magneto-viscoelastic materials. The connection between causality and the Kramers–Kronig relations is established. Finally, the modelling of memory properties via derivatives of fractional order is investigated; with particular reference to heat conductors, conceptual difficulties are shown for the thermodynamic consistency and wave properties.

7.1 Materials with Fading Memory We consider functions on R × R and, to save writing, we usually omit writing R, as the space domain, and the pertinent point X ∈ R. A process is a function, say φ, on R with (scalar, vector, or tensor) values in a Hilbert space W : the symbol “·” denotes © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_7

381

382

7 Materials with Memory

its inner product and | · | the corresponding norm. If φ is a process and t ∈ R then we can define s ∈ R+ φt (s) = φ(t − s), and say that φt is the history of φ up to time t. The restriction of φt to (0, ∞) is called the past history of φ up to time t and denoted by φrt . For any history φt , φt (0) = φ(t) is called the present value of φt . Denote by φ(t0 )† the constant process with value φ(t0 ), t ∈ R, φ(t0 )† (t) = φ(t0 ), so that the constant history, up to time t0 , with value φ(t0 ), is given by φt0 † (s) = φ(t0 )† (t0 − s) = φ(t0 ),

s ∈ R+ .

When t > t0 the constant1 continuation of φt0 up to t, also denoted by φtt0 , is the history defined by φtt0 (s) =

 φt0 (0),

s ∈ [0, t − t0 ), φt0 (s − t + t0 ), s ∈ [t − t0 , ∞).

Alternatively, if τ = t − t0 then the constant continuation up to t0 + τ can be given as  φt0 (0), s ∈ [0, τ ), φtt00 +τ (s) = φt0 (s − τ ), s ∈ [τ , ∞). Similarly, for a given τ > 0 we define the τ –section φt(τ ) of φ t as φt(τ ) (s) = φ t (s + τ ),

s ∈ R+ .

Since, by definition, φ t(τ ) (s) = φ t−τ (s), then φ t(τ ) is the history up to the earlier time t − τ . Let H be the set of all admissible histories. It is required to contain all constant histories. Moreover, if φt ∈ H, then t for any τ > 0 both its constant continuation, φt+τ t , and its τ –section, φ(τ ) , belong to H. General materials with memory are those in which a set of variables, say ψ, at a point X ∈ R and time t, are functionals of the histories of independent variables, say φ, up to time t. The response function ψ takes (scalar, vector, or tensor) values in a Hilbert space V . Obviously, the principle of causality is implied: the response value 1

Not necessarily the statical continuation where φtt0 (s) = 0 as s ∈ [0, t − t0 ) [428].

7.1 Materials with Fading Memory

383

at an instant t, ψ(t), cannot depend on the values of the process φ subsequent to that instant. Hence we let (7.1) ψ(X, t) = (φt (X, ·)), where  : H → V is called response functional. For brevity we write ψ(t) = (φt ), it being understood that the response ψ and the history φt are considered at the same point X ∈ R. For inhomogeneous bodies the constitutive functionals are allowed to depend explicitly on the position X. For definiteness, in deformable non-isothermal bodies the response functionals provide the stress tensor T, the heat flux q, and the internal energy ε in terms of the histories of the deformation gradient F, the temperature θ, and the temperature gradient ∇θ. The study of these materials arises from the pioneering articles of Ludwig Boltzmann [55, 56] and Vito Volterra [436, 437]. However a statement like (7.1) raised some criticism in the scientific community due to the conceptual difficulty to accept the idea of a past history defined on an infinite time interval (when even the age of the universe is finite!). Really, the entire history of a quantity φ in a system can never be known. Then a constitutive relation of the memory type has an operative meaning only if additional assumptions are made. First, based on the model of viscoelasticity, the following property is usually assumed. Relaxation property. The functional  with values in a vector space V is said to have the relaxation property if lim (φtt0 ) = (φt0 † ).

t0 →−∞

In words, if φ is kept constant for a long time (t  t0 ) then the initial history of φ becomes inessential as the response ψ is concerned. Coleman and Mizel [100, 101] introduced the notion of fading memory. Precisely, they considered (7.1) with the further assumption that recent values of φt (s), with s around 0+ , rather than the remote values, s → ∞, have the main effect on the present response value ψ(t). In other words, the memory of the material is fading in time. Incidentally, this also gave the ultimate answer to the philosophical question of a memory of infinite duration. While the idea is quite intuitive, to make it operative, we need a precise mathematical statement. Indeed, the fading memory property can be established by endowing the space H of histories with a suitable norm. In essence, we first make a choice of the norm in the domain of  such that two histories are close if they are close in the recent past. Then we assume the functional to be smooth in this topology, which amounts to letting ψ(t) weakly dependent on remote values of φ. Fading memory property. The material is said to have fading memory if the constitutive functionals have continuous Fréchet derivatives relative to an appropriate norm  · H on a space H of histories φt . Specifically, the functional  : H → V

384

7 Materials with Memory

is n-times Fréchet differentiable, at φt ∈ H, if there exist bounded linear operators t L 1 , ..., L n , where L j : H ⊗ H ⊗ · · · ⊗ H → V , such that for any λ ∈ H j times

(φt + λt ) − (φt ) −

n  j=1

t Lj λ ... ⊗ λt V = o(λt nH ).  ⊗  j times

This mathematical statement is made operative by the choice of the normed space H.

7.1.1 Fading Memory Space of Histories Let k : R+ → R++ , k(0) = 1, be a function which decays to zero for large s in such a way that, k(s) = o(s −α ) as s → ∞ where α > 1. A function satisfying these conditions is called an influence function of order α (or obliviator) and characterizes the rate at which the memory fades. For definiteness, to specify the smoothness of the functional , we introduce a weighted L 2 -norm of φt , with weight k, defined as ∞ 1/2 φt k = ∫ |φt (s)|2 k(s)ds 0. When To determine φ(t the process φ is everywhere differentiable with respect to time, then for any t, h ∈ R, s ∈ R+ we can write ˙ − s)h + o(h). φ(t − s + h) = φ(t − s) + φ(t Let the time dependence of the response be given by ψ(t) = (φ(t), φrt ). To compute the time derivative of ψ we observe

7.1 Materials with Fading Memory

387

˙ ψ(t + h) − ψ(t) = (φ(t) + φ(t)h + o(h), φrt + φ˙ rt h + o(h)) − (φ(t), φrt ) ˙ + h d(φ(t), φt |φ˙ t ) + o(h). = h D(φ(t), φt )φ(t) r

r

r

In view of (7.2), the limit ˙ = lim 1 [(φ(t + h), φt+h ) − (φ(t), φt )] ψ(t) r r h→0 h yields

˙ = D(φ(t), φt ) · φ(t) ˙ + d(φ(t), φt | φ˙ t ). ψ(t) r r r

Then, for practical purposes, we assume that the histories φt are time differentiable and the response functional  is continuously differentiable in H. This is the mathematical content of the fading memory assumption.

7.1.2 Difference and Summed Histories A different approach2 regards the response functional as dependent, separately, on the present value φ(t) and the difference (or relative) history φtd , that is defined on R+ as the difference between the present value and the history values, φtd (s) := φt (0) − φt (s) = φ(t) − φ(t − s). In particular it follows that φtd (0) = 0 and φt = φt† + φtd ,

(7.4)

where φt† is the constant history with value φ(t). As a consequence, any constant history has a null relative history. In this case we can take the inner product and the associated norm in the form ∞

(φt , χt )∗ = φt (0) · χt (0) + φtd , χtd k = φ(t) · χ(t) + ∫ φtd (s) · χtd (s) k(s) ds 0



φt 2∗ = |φt (0)|2 + φtd 2k = |φ(t)|2 + ∫ |φ(t) − φ(t − s)|2 k(s) ds.

(7.5)

0

The Hilbert space H∗ endowed with this norm is the fading memory space of all pairs (φ(t), φtd ) belonging to W × Sk0 where Sk0 = { f ∈ L 2k (R+ , W ) : f (0) = 0}. 2

See, e.g. [102, 112].

388

7 Materials with Memory

The Like space H, H∗ contains all constant histories. Indeed, if φt (s) = φ0 , s ∈ R+ , then φt ∗ = |φ0 |. Nevertheless, it is very different from H. An advantage of this approach is that the history space H = {φt : R+ → W ; (φ(t), φtd ) ∈ H∗ } can be represented by virtue of (7.4) as a direct sum, H = W ⊕ Sk0 , where W is the (Hilbert) space of constant histories corresponding to W through an isometric isomorphism. Conversely, for any (χ0 , χ) ∈ H∗ there exists φt ∈ H such that φt (s) = χ0 + χ(s), s ∈ R+ . A bounded, linear functional L ∗1 : H∗ → R may be given the form of an inner product. For all histories λt ∈ H such that (λt (0), λtd ) ∈ H∗ there exists (g0∗ , g ∗ ) ∈ H∗ satisfying  L ∗1 [(λt (0), λtd )] = (g0∗ , g ∗ ), (λt (0), λtd ) H∗ , namely ∞

L ∗1 [(λt (0), λtd )] = g0∗ · λt (0) + ∫ g ∗ (s) · [λ(t) − λ(t − s)] k(s) ds. 0

Letting



G(s) = g0∗ + ∫ g ∗ (ξ)k(ξ) dξ s

s ∈ (0, +∞),

we can write the linear functional in the form ∞

L ∗1 [λt ] = G∞ · λ(t) − ∫ G (s) · [λ(t) − λt (s)] ds, 0

where G∞ = lims→∞ G(s) = g0∗ and G = −g ∗ k is subject to the condition ∞

∫ |G (s)|2 k −1 (s) ds < ∞. 0

At a fixed point X, the time derivative with respect to t of a relative, or difference, history φtd is defined by ˙ − φ (t − s)∂t (t − s) = φ(t) ˙ + φ (t − s)∂s (t − s) = φ(t) ˙ + ∂s φt (s), φ˙ td (s) := φ(t) d where the prime denotes differentiation with respect to the whole argument. From the assumption ψ(t) = (φtd ) it then follows that

7.1 Materials with Fading Memory

389

 t

t ˙ t† t

˙ ψ(t)=d(φ d | φ )+d φd ∂s φd (s) , ˙ where φ˙ t† is the constant history up to time t with value φ(t). It is easy to check that there exists a functional Jφ˙ such that 

˙ = Jφ˙ (φt )φ(t) ˙ + d φt ∂s φt (s) . ψ(t) d d d There are models where the history is involved through functionals of summed histories.3 Given the process φ on R and t ∈ R, we denote by φ¯ t : R+ → W the function t s s ∈ R+ , φ¯ t (s) := ∫ φ(ξ)dξ = ∫ φt (σ)dσ, t−s

0

and say that φ¯ t (s) is the summed history of φt on [0, s]. Of course, φ¯ t (0) = 0 and d ¯t φ (s) = φt (s). ds

(7.6)

We observe that its time derivative with respect to t is given by the relative history, d t ∫ φ(ξ)dξ = φ(t) − φ(t − s) = φtd (s). φ˙¯ t (s) = dt t−s From the assumption

ψ(t) = (φ¯ t )

it then follows that 

˙ ψ(t) = d(φ¯ t | φtd ) = d(φ¯ t | φt† ) − d(φ¯ t | φt ) = Jφ (φ¯ t )φ(t) − d φ¯ t ∂s φ¯ t (s) .

(7.7)

The fading memory space of all summed histories φ¯ t ∈ Sk0 is an Hilbert space ¯ It contains all constant histories provided that k is an influence function denoted by H. of order α > 2: if φt (s) = φ0 , s ∈ R+ , then φ¯ t† (s) = φ0 s and ∞ 1/2 φ¯ t† k = |φ0 | ∫ k(s)s 2 ds . 0

A bounded, linear functional L¯ 1 : H¯ → R may take the form of an inner product. For all summed histories λ¯ t ∈ Sk0 there exists g ∈ Sk0 satisfying ∞

L¯ 1 λ¯ t = ∫ g(s) · λ¯ t (s) k(s) ds. 0

3

See, e.g. [217].

390

7 Materials with Memory

Letting G = gk, we can write the linear functional in the form ∞



s

0

0

0

L¯ 1 λ¯ t = ∫ G (s) · λ¯ t (s) ds = ∫ G (s) · ∫ λ(t − σ)dσ ds, where G is subject to the condition G (0) = 0 and ∞

∫ |G (s)|2 k −1 (s) ds < ∞. 0

Upon an integration by parts the linear functional L¯ 1 transforms into a linear functional on Sk , namely ∞

ˇ · λ(t − s) ds. L¯ 1 λ¯ t = L 0 λt = − ∫ G(s) 0



ˇ where G(s) = − ∫ g(σ)k(σ) dσ. s

7.1.3 Properties About Histories and Norms Two properties about histories and norm are in order for later developments. We let φt be a history with scalar, vector, or tensor values as appropriate. Proposition 7.1 Given a time t, a history φt and an arbitrary value α, for every > 0 there exist histories φ˜ t such that ˙˜ φ(t) = α,

˜ = φ(t), φ(t) φ˜ t − φt k < ,

φ˙˜ t − φ˙ t k < .

Proof To show this property (see [94]), for any fixed t we consider the process4 ˜ ) = φ(τ ) − f (t − τ )(α − φ(t)), ˙ φ(τ

τ ∈ (−∞, t],

and the corresponding history ˙ φ˜ t (s) = φt (s) − f (s)(α − φ(t)),

s ∈ [0, ∞).

˜ = φ(t). Now, Letting f (0) = 0 we find φ(t) ˙ φ˙˜ t (s) = φ˙ t (s) + f (s)(α − φ(t)). 4

˙ is given by (7.3). Here φ(τ ) = φt (t − τ ) and φ(t)

7.1 Materials with Fading Memory

Hence

391

˙˜ φ(t) = f (0) α

˙˜ and the requirement φ(t) = α is satisfied by letting f (0) = 1. The pertinent norms are given by ∞ ˙ 2 ∫ f 2 (s)k(s)ds, φ˜ t − φt 2 = |α − φ(t)| k

0



˙ 2 ∫ f 2 (s)k(s)ds. φ˙˜ t (s) − φ˙ t (s)2k = |α − φ(t)| 0

We can pick f such that the integrals in the right-hand sides are as small as we please. For definiteness, let f be given by f (s) = b[1 − exp(−s/b)], so that f (0) = 0, f (0) = 1. As a consequence, ∞



0

0

∫ f 2 (s)k(s) ≤ b2 ∫ k(s)ds,

and, by the Cauchy–Schwarz inequality, ∞ 1/2 ∞ 2 1/2 ∞ 1/2 ∫ f 2 (s)k(s)ds ≤ ∫ exp(−4s/b)ds ∫ k (s)ds = b/4 ∫ k 2 (s)ds .

∞ 0

0

0

0

Since k(s) = o(s −1−δ ) then the integrals of k and k 2 are finite. Hence the choice of a suitably small b makes φ˜ t − φt k and φ˙˜ t − φ˙ t k smaller than the given .  Proposition 7.2 Given a time t and a history φt , for every > 0 there exists a history φ˜ t such that φ˜ t − φt k < , whereas d(φ˜t |φ˙˜ t ) takes arbitrary values. Proof Let λ ∈ R and

φ˜ t = φt + λϕt .

A suitably small value of λ makes λϕt k as small as we please. Now, d(φ˜t |φ˙˜ t ) = d(φ˜t |φ˙ t ) + λd(φ˜t |ϕ˙ t ). If ϕt (s) depends on s via s/λ, namely ϕt (s) = f (s/λ), then λd(φ˜t |ϕ˙ t ) = −d(φ˜t | f )

392

7 Materials with Memory

may take arbitrary values depending on f .

 Likewise we can prove that given two histories θ , φ the functionals d(θ , φ |θ˙t ) and d(θt , φt |φ˙ t ) can be regarded as arbitrary and independent of each other. t

t

t

t

7.2 Thermoelastic Materials with Memory A large domain of materials is accomplished by letting the independent variables be the histories Ft , θt , ∇R θt of the deformation gradient F, the temperature θ, and the material temperature gradient ∇R θ. As is well investigated in the literature,5 the dependence on the present value of ∇θ is not compatible with wave propagation at a finite speed. It is then convenient to allow for both possibilities, namely for the set of variables (F(t), θ(t), ∇R θ(t), Frt , θrt , ∇R θrt ) or (F(t), θ(t), Frt , θrt , ∇R θrt ). Not any arbitrary constitutive functional is admissible; the constitutive equations are subject to the principle of objectivity and the entropy inequality. We start to analyse the restrictions due to the principle of objectivity. For definiteness let the Cauchy stress T be objective and denote by Tˆ the corresponding response functional. Let x = χ(X, t) be a motion and ˆ t (X)) T(X, t) = T(F be the corresponding tensor response functional. For a given change of frame, the corresponding motion is given by6 x∗ = χ∗ (X, t ∗ ) = c(t) + Q(t)χ(X, t),

t ∗ = t − a,

where c is an arbitrary vector function, Q an arbitrary orthogonal tensor function, and a an arbitrary real number. Then, the associated tensor response functional is T∗ (X, t ∗ ) = Q(t)T(X, t)QT (t). For formal simplicity let a = 0. Then, t ∗ = t and χ∗ t (X) = ct + Qt χt (X) Objectivity requires that 5 6

See, e.g. [402]. See (1.54).

F∗ t (X) = Qt Ft (X).

7.2 Thermoelastic Materials with Memory

393

ˆ t (X))QT (t) = T(Q ˆ t Ft (X)), Q(t)T(F namely the history Qt Ft , with values Qt (s)Ft (s) = Q(t − s)F(t − s), produces a tensor that is the transform of ˆ t (X)). T(F To save writing, we now omit specifying the obvious dependence on the selected point X. By the polar decomposition theorem we can write F(t − s) = R(t − s)U(t − s), R being orthogonal. By the arbitrariness of Q we let Q(t − s) = R T (t − s). Hence we have T t t ˆ t Ft )Q(t) = R(t)T((R ˆ ˆ t ) = QT (t)T(Q ) F )R T (t) T(F T t t t ˆ = F(t)U−1 (t)T((R ) R U )U−1 (t)FT (t).

Since (R T )t Rt = 1 and C = U2 , letting ˆ t )U−1 (t), T (Ct ) := U−1 (t)T(U we have

ˆ t ) = F(t)T T(Ct )FT (t). T(F

Apart from the occurrence of the present values F(t) and FT (t) that operate on the components, objectivity requires that the functional dependence of T on the deformation history Ft is through Ct . We notice that materials with memory are often modelled within the material (Lagrangian) description. Indeed, the history of a given physical variable is naturally considered on the selected point X; different points may have different histories. A further advantage is that the material gradient ∇R θ(t) is an invariant vector and so is its past history ∇R θrt for any time t. We now evaluate the restrictions on the constitutive functionals placed by the entropy inequality. In the material formulation we have ˙ + 1 TR R · C ˙ − 1 q R · ∇R θ + θ∇R · k R ≥ 0. −(ψ˙ R + η R θ) 2 θ We let ψ R , η R , T R R , q R , k R , at the position X ∈ R and time t, be given by functionals , N , T, Q, K on (t) = (C(t), θ(t), ∇R θ(t), Crt , θrt , ∇R θrt ), or on 0 (t) = (C(t), θ(t), Crt , θrt , ∇R θrt ), at X. Let k on R+ be an influence function. The H-norm of  is taken in the form

394

7 Materials with Memory

∞ 1/2 (t)H = [C · C + θ2 + (∇R θ)2 ]1/2 (t) + ∫ [Ct · Ct + (θt )2 +∇R θt · ∇R θt ](s)k(s)ds 0

or the analogous norm of 0 without the present value (∇R θ)2 (t). Let  and K be continuously differentiable while N , T , Q are continuous. Hence ˙ + ∂θ ()θ˙ + ∂∇R θ () · ∇R θ˙ + d(|C ˙ rt ) + d(|θ˙ rt ) + d(|∇R θ˙ rt ). ψ˙ R = ∂C () · C

Upon substitution the entropy inequality takes the form ˙ − ρ∂∇R θ () · ∇R θ(t) ˙ + ( 1 T () − ∂C ()) · C(t) ˙ −(∂θ () + N ())θ(t) 2 ˙ rt ) − d(|θ˙rt ) − d(|∇R θ˙rt ) − 1 Q () · ∇R θ(t) + θ∇R · K () ≥ 0. −d(|C θ To determine the consequences of this inequality we need to establish the arbitrariness ˙ ∇R θ˙ at time ˙ θ, associated with the histories. By Proposition 7.1, we can say that C, t may take arbitrary values while the present values C, θ, ∇R θ, the past histories Crt , θrt , ∇R θrt and their derivatives affect the functionals as small as desired. The ˙ θ˙ and ∇R θ˙ requires that the inequality holds only if arbitrariness of C, T () = 2∂C (),

N () = −∂θ (),

∂∇R θ () = 0.

(7.8) (7.9)

The mutual independence of the past histories Crt , θrt , ∇R θrt (see Proposition 7.2) and ˙ rt , θ˙rt imply that the remaining inequality holds the linearity and the arbitrariness of C only if ˙ rt ) ≤ 0, d(|θ˙rt ) ≤ 0, (7.10) d(|C 1 ρd(|∇R θ˙rt ) + Q () · ∇R θ(t) − θ∇R · K () ≤ 0. θ

(7.11)

Remark 7.1 If 0 is involved rather than , we simply find the relations (7.8), (7.10), and (7.11). Hence the difference is revealed only in the expression of the functional Q , which depends on the actual value ∇R θ(t) or not. In terms of referential quantities, σ R = η˙ R −

ρR r + ∇R · (q R /θ) + ∇R · k R θ

is the (rate of) entropy production per unit volume in the reference configuration and the content of the second law is that σ R ≥ 0 for every process consistent with the balance equations. By the balance of energy we have

7.2 Thermoelastic Materials with Memory

395

1 ˙ + 1 TR R · C ˙ − 1 q R · ∇R θ + ∇R · k R . σ R = − (ψ˙ R + η R θ) θ 2θ θ2 In view of (7.8) we find σR = γR −

1 q R · ∇R θ, ρθ2

where 1 ˙ rt ) + d(|θ˙rt ) + d(|∇R θ˙rt )] + ∇R · k R (t). γ R (t) = − [d(|C θ The entropy production σ R consists of two terms; the former, γ R , is called the internal dissipation and involves the dependence on the past histories Crt , θrt , ∇R θrt , while the latter, −(1/ρθ2 )q R · ∇R θ, represents the dissipation due to heat conduction. Since σ R ≥ 0, we can write

1 q R · ∇R θ ≤ γ R . θ2

In general, from this inequality we cannot conclude that q R · ∇R θ ≤ 0 whereas it is true that γ R = σ R ≥ 0 if ∇R θ = 0. Hereafter we let k R = 0 and this will prove to be consistent with the local nature of the constitutive equations. By (7.10) and (7.11) it follows that ˙ − 1 q R · ∇R θ. ψ˙ R ≤ −η R θ˙ + 21 T R R · C θ Hence

˙ = 0, θ˙ = 0, ∇R θ = 0 C

=⇒

ψ˙ R ≤ 0.

(7.12)

˙ = 0 and ∇R θ(t) = 0 as t ≥ t0 , ˙ At a fixed point X, if it happens that C(t) = 0, θ(t) then the free energy cannot increase, regardless of the histories Ct0 , θt0 , ∇R θt0 . Let φ be the triplet φ = (C, θ, ∇R θ) and, for any history φt0 , consider the constant continuation φtt00 +d (s) =

 φ(t ), s ∈ [0, d), 0 φ(t0 − s + d), s ∈ [d, ∞),

396

7 Materials with Memory

subject to ∇R θ(s) = 0 as s ∈ [0, d). Hence, for any d > 0, from (7.12) we have (φtt00 +d ) ≤ (φt0 ). Moreover, ∞



d

d

φt0 † − φt0 +d k = ∫ |φ(t0 ) − φ(t0 − s + d)|2 k(s)ds ≤ |φtd0 |2max ∫ k(s)ds, where |φtd0 |max is the maximum of |φ(t0 ) − φ(t0 − τ )| over τ ∈ [0, ∞). Since the integral of k on [0, ∞) is finite, it follows that ∞

lim ∫ k(s)ds = 0.

d→∞ d

Hence φt0 +d approaches φt0 † as d → ∞. By the continuity of the functional  we have (7.13) (φt0 † ) = lim (φt0 +d ) ≤ (φt0 ). d→∞

This result may be phrased as follows. Proposition 7.3 Let t be a given time. Among all histories φt = (Ct , θt , ∇R θt ) ending with the given values C(t) = C, θ(t) = θ and ∇R θ(t) = 0, the history corresponding to those constant values for all times, namely φ† = (C† , θ† , 0† ), has the least free energy. Equations (7.8)–(7.11) give the restrictions placed by the second law on the constitutive functionals. The next sections examine them in connection with models of rigid heat conductors and linear viscoelastic solids. Throughout these sections we will continue to assume that the extra-entropy flux vanishes, k R = 0.

7.3 Rigid Heat Conductors Due to the rigid character of the body we identify the position vector x, of a point, with the reference position vector X and hence we let ∇R = ∇. Then we put ψ R = ρψ, ε R = ρε, η R = ρη, q R = q

7.3.1 Models Dependent on Thermal Histories Consistent with the choice of 0 , as the set of independent variables, and the thermodynamic restrictions (7.8)–(7.11), a simple model of (rigid) heat conductor with

7.3 Rigid Heat Conductors

397

memory is established by restricting the independent variables to the present temperature and the past history of the temperature and the temperature gradient. Hence we let ψ(t) = (θ(t), θrt , ∇ θrt ) and, according to (7.8) and (7.11), η(t) = N (θ(t), θrt , ∇ θrt ) = −∂θ (θ(t), θrt , ∇ θrt ), 1 Q (θ(t), ∇ θrt ) · ∇θ(t) ≤ 0, ρθ (7.14) it being again understood that all quantities are evaluated at the same point X. For definiteness we might look at the model arising from the Maxwell–Cattaneo (MC) equation7 κ 1 q˙ + q = − ∇ θ, τ τ d(θ(t), θrt , ∇ θrt | θ˙rt ) + d(θ(t), ∇ θrt |∇ θ˙rt ) +

where τ and κ are assumed to be constants. By a direct integration we have q(t) = −

κ∞ ∫ exp(−s/τ )∇ θ(t − s)ds, τ 0

whence Q (∇ θrt ) = −

κ∞ ∫ exp(−s/τ )∇ θ(t − s)ds. τ 0

More generally, though still with a linear dependence on ∇θt , we allow for a possible anisotropy and let ∞

Q (∇ θrt ) = − ∫ K(s)∇ θ(t − s)ds.

(7.15)

0

We assume K(s) = K0 β(s), 1/2

where K0 is positive definite so that we can define K0 . In addition, let β(s) ≥ 0, β(0) = 1 and β(s) → 0 as s → ∞. We now have to find the (or a) functional  consistent with (7.14) and (7.15). For simplicity, we disregard the dependence on the temperature past history. First we look for  in a double-integral form; we let

1/2 ∞

2 (θ(t), ∇ θrt ) = ψ0 (θ(t)) + 21 α(θ(t)) K0 ∫ β(s)∇ θ(t − s)ds , 0

7

See [319].

(7.16)

398

7 Materials with Memory

the function α being so far undetermined. Hence ∞



0

0

˙ − ξ)dξ. d(θ(t), ∇ θrt |∇R θ˙rt ) = α(θ(t))K0 ∫ β(s)∇ θ(t − s)ds · ∫ β(ξ)∇ θ(t ˙ − s) = −∂s ∇ θ(t − s) and an integration by parts result The observation that ∇ θ(t in ∞ ∞ d = α(θ(t)) ∫ β(s)∇ θ(t − s)ds · K0 ∇ θ(t) + ∫ β (ξ)∇ θ(t − ξ)dξ . 0

0

By substitution of d and Q in (7.14) we have the inequality ∞ ∞ ∞  1 α− ∇ θ(t) · K0 ∫ β(s)∇ θt (s)ds + α ∫ β(s)∇ θt (s)ds · K0 ∫ β (ξ)∇ θt (ξ)dξ ≤ 0. ρθ 0 0 0

Since, for any given ∇ θ(t), we may take the past history ∇ θrt arbitrarily, it follows that ∞ ∞ 1 ∫ β(s)∇ θ(t − s)ds · K0 ∫ β (ξ)∇ θ(t − ξ)dξ ≤ 0. , α= ρθ 0 0 By the positive definiteness of K0 , the assumptions on β and the arbitrariness of the past history ∇ θrt it follows that the previous inequality is generally satisfied provided that β (s) ≤ −λβ(s), s ∈ R+ , λ > 0. A finite sum of decaying exponentials in time, β(s) =

n

h=1 ch

exp(−λh s), ch > 0, λh > 0.

satisfies this condition by letting λ = minh λh > 0 and observing that   β (s) = − nh=1 ch λh exp(−λh s) ≤ −λ nh=1 ch exp(−λh s) = −λβ(s). According to this choice, ∞

q = −K0 ∫ 0

ψ = ψ0 (θ) +

h ch

exp(−λh s)∇ θ(t − s)ds,

2 1

1/2 ∞  K0 ∫ h ch exp(−λh s)∇ θ(t − s)ds

2ρθ 0

(7.17)

are constitutive functionals consistent with the second law. This in turn implies that the entropy takes the form η = −ψ0 (θ) +

2 1

1/2 ∞  K0 ∫ h ch exp(−λh s)∇ θ(t − s)ds . 2 2ρθ 0

(7.18)

7.3 Rigid Heat Conductors

399

Consistent with the general result (7.13), the free energy has a minimum at constant histories with zero past history of ∇ θ. Without changing the functional (7.15) for q, we might look for a different free energy functional in the single-integral form (θ(t), ∇ θrt ) = ψ0 (θ(t)) −

1 2



ˆ θ(t − s)ds, ∫ β 2 (s)∇ θ(t − s) · K∇ 0

ˆ is to be determined. Since where the tensor K ∞

˙ − s)ds d(θ(t), ∇ θrt |∇ θ˙rt ) = − ∫ ∇ θ(t − s) · β 2 (s)K∗ ∇ θ(t 0

˙ − s) = −∂s ∇ θ(t − s) and an integration by parts then the observation that ∇ θ(t yield ∞ ˆ θt (s) ds d = 21 ∫ β 2 (s)∂s ∇ θt (s) · K∇ 0

=

− 21 ∇



ˆ θ(t) − ∫ β(s)β (s)∇ θt (s) · K∇ ˆ θt (s)ds. θ(t) · K∇ 0

By substitution of d and Q in (7.14) we have the inequality ∞

− 21 ∇θ(t) · K∗ ∇θ(t) − ∫ β(s)β (s)∇ θt (s) · K∗ ∇ θt (s)ds 0



∞ 1 ∇ θ(t) · K0 ∫ β(s)∇ θt (s) ≤ 0. ρθ 0

For any given vector ∇ θ(t) we may take the past history ∇ θrt arbitrarily, so that the ˆ be last two integrals are as small as we please. The inequality then implies that K positive definite. Therefore, if now we take ∇ θ(t) = 0 then we conclude that β(s)β (s) ≥ 0. This implies that β 2 is increasing, contrary to the assumption that β(s) → 0 as s → ∞. Hence only the double-integral representation of the free energy is admissible when models dependent on thermal histories are concerned. Incidentally, as a generalization, we might consider a deformable heat conductor. By letting (C(t), θ(t), θrt , ∇R θrt ) be the set of independent variables then we find the relations (7.8) and the reduced inequality (7.14). We can parallel the previous procedure up to equations (7.17) and obtain the free energy in the form ψ R = (C, θ, ∇R θrt ) = 0 (C, θ) +

2 1

1/2 ∞  K0 ∫ h ch exp(−λh s)∇R θ(t − s)ds . 2ρθ 0

400

7 Materials with Memory

Consistent with the choice of  as independent variables of the rigid heat conductor, we might allow for the dependence on the present value of the temperature gradient, ∇θ(t). As a consequence of (7.9), we would find that  is independent of ∇θ(t) and exploiting the entropy inequality we end up with the previous result (7.16) for . Instead of (7.15), however, we let ∞

˜ θ(t) − K0 ∫ β(s)∇ θ(t − s)ds, q(t) = Q (∇ θ(t), ∇ θrt ) := −K∇

(7.19)

0

where β(s) ≥ 0, β(0) = 1 and β(s) → 0 as s → ∞. Accordingly, by replacing d and Q into (7.14) we obtain the inequality ∞  1 ˜ ∇ θ(t) · K0 ∫ β(s)∇ θt (s)ds − ∇θ(t) · K∇θ(t) α− ρθ 0 ∞



0

0

+α ∫ β(s)∇ θt (s)ds · K0 ∫ β (ξ)∇ θt (ξ)dξ ≤ 0. ˜ are positive definite and which is satisfied provided that K0 and K α=

1 , ρθ

β (s) ≤ −λβ(s), s ∈ R+ , λ > 0.

We finally observe that a free energy functional in the single-integral form, as ∞

˜ θ(t − s)ds, (θ(t), ∇ θrt ) = ψ0 (θ(t)) + α(θ(t)) ∫ β 2 (s)∇ θ(t − s) · K∇ 0

is compatible with (7.19) and the entropy inequality provided that α is properly chosen. Indeed, in this case (7.14) takes the form ∞  1 ˜ ˜ θt (s)ds ∇θ(t) · K∇θ(t) + 2α ∫ β(s)β (s)∇ θt (s) · K∇ α− ρθ 0 ∞ 1 − ∇ θ(t) · K0 ∫ β(s)∇ θt (s) ≤ 0. ρθ 0

Since for any given vector ∇ θ(t) we may take the past history ∇ θrt arbitrarily, the inequality then implies α = 1/ρθ and ∞



0

0

˜ θt (s)ds − ∇ θ(t) · K0 ∫ β(s)∇ θt (s) ≤ 0. 2 ∫ β(s)β (s)∇ θt (s) · K∇ Therefore, taking ∇ θ(t) = 0 we conclude that β(s)β (s) ≤ 0, which is consistent with the assumption that β(s) > 0 and β(s) → 0 as s → ∞.

7.3 Rigid Heat Conductors

401

Temperature-rate waves The next section shows a model of heat conductors in terms of summed histories [217]. Perhaps the main motivation for the introduction of summed histories was the observation that the classical equation of heat conduction, ∂t θ = αθ,

α > 0,

is parabolic and the solution at any point in the body is felt immediately after at every other point (infinite speed of propagation). Really, also the model in terms of histories is adequate to find a finite speed of propagation [319]. Thermal waves are deferred to Sects. 7.8.3 and 7.6.2 where the models involved are developed in strictly analogous schemes of rate-type materials.

7.3.2 Models Dependent on Summed Thermal Histories Following [217] we take as independent variables the present value of temperature θ(t), the summed history of temperature θ¯t and the summed history of the material temperature gradient ∇ θ¯t , t

θ¯t (s) = ∫ θ(ξ)dξ, t−s

t

∇ θ¯t (s) = ∫ ∇θ(ξ)dξ. t−s

Let ψ, η, q, at the position x and time t, be given by functionals , N , Q on θ (t) = (θ(t), θ¯t , ∇ θ¯t ), at x ≡ X. In addition  is assumed to be continuously differentiable while N and Q are continuous. The functionals , N , Q of ψ, η, q are subject to the entropy inequality ˙ − 1 q · ∇θ ≥ 0, −(ψ˙ + η θ) θ where we let ρ = 1 for simplicity. Now, d ¯t θ (s) = θ(t) − θt (s) = θdt (s), dt

d ¯t ∇ θ (s) = ∇θ(t) − ∇θt (s) = ∇θdt (s). dt

Hence, owing to (7.7), t† t t† t ˙ ˙ ψ(t)=∂ θ (θ )θ(t)+d(θ |θ )−d(θ |θ ) + d(θ |∇θ ) − d(θ |∇θ ).

Let Jθ and J∇θ be the functionals such that

402

7 Materials with Memory

d(θ |θt† ) = Jθ (θ )θ(t),

d(θ |∇θt† ) = J∇θ (θ ) · ∇θ(t).

(7.20)

The entropy inequality can be written as  1 t t ˙ −(∂θ +N )θ(t)−J θ θ(t)− J∇θ + Q · ∇θ(t) + d(θ |θ ) + d(θ |∇θ ) ≥ 0. θ For any values of θ(t) and ∇θ(t) we can choose histories θt and ∇θt such that d(θ |θt ) and d(θ |∇θt ) are as small as we please. In addition, the dependence ˙ and ∇θ(t) is linear and arbitrary. Hence the entropy inequality holds if and on θ(t) only if N = −∂θ ,

Q = −θJ∇θ ,

Jθ (θ )θ(t) − d(θ |θt ) − d(θ |∇θt ) ≤ 0.

Hence, while the entropy is given by the classical partial derivative of the free energy, the entropy flux Q /θ is given by the differential of the free energy relative to constant histories of the temperature gradient. Quasilinear approximation Let θ∗ be some given absolute temperature and let u ∈ U = (−∞, 1) denote the dimensionless variable θ − θ∗ u= θ such that u = 0 at θ = θ∗ . By means of the ballistic energy (sometimes referred to as the exergy potential at constant pressure) y = ε − θ∗ η = ψ + (θ − θ∗ )η , the local form of the entropy inequality reads (ρ = 1) y˙ ≤ −∇ · (uq) + ur .

(7.21)

Moreover, exploiting the dependence of y on θ through u ∂θ y = (θ − θ∗ )∂θ η,

∂θ2 y = ∂θ η + (θ − θ∗ )∂θ2 η

Since ∂θ η = c/θ, where c > 0 is the specific heat, y has a local minimum with respect to temperature at θ = θ∗ . By introducing the rescaled free energy [9] ψ∗ = y − u ε =

θ∗ ψ θ

and observing that ε = −∂u ψ∗ , inequality (7.21) can be written as

7.3 Rigid Heat Conductors

403

ψ˙ ∗ + ε u˙ ≤ −q · ∇u . Hereafter we investigate this inequality for small values of u, that is, in a neighbourhood of the temperature θ∗ . More precisely, we assume sup |u(t)| < δ , sup ∇u(t) < δ , t∈R

t∈R

from which the following approximations are obtained θ = θ∗ (1 + u) + O(δ 2 ) , ∇θ =

θ∗ ∇u = θ∗ ∇u + O(δ 2 ) . (1 − u)2

(7.22)

Accordingly, we introduce the summed past histories u¯ t and ∇ u¯ t τ

u¯ t (τ ) = ∫ u(t − s) ds , 0

τ

∇ u¯ t (τ ) = ∫ ∇u(t − s) ds , 0

belonging to the Hilbert spaces ¯ = 0}, H∇u := {∇ u¯ ∈ L 2k2 (R+ , R3 ) : ∇ u(0) ¯ = 0} Hu := {u¯ ∈ L 2k1 (R+ , U ) : u(0) respectively. In this approximation scheme, the thermal variables (θ(t), θ¯t , ∇ θ¯t ), can be replaced by (u(t), u¯ t , ∇ u¯ t ) whose norm in R × Hu × H∇u is of order O(δ), namely (7.23) sup |u(t)|2 + u¯ t 2Hu + ∇ u¯ t 2H∇u ≤ Cδ 2 . t∈R

Within this range, we are looking for thermodynamic restrictions. Letting u (t) = (u(t), u¯ t , ∇ u¯ t ) ˆ η, ˆ ψˆ∗ , qˆ on u , we obtain and assuming that ε, η, ψ∗ , q are given by functionals ε, ψ˙ ∗ (t)=∂u ψˆ ∗ (u )u(t) ˙ + Ju (u ) u(t) + J∇u (u ) · ∇u(t) − d ψˆ ∗ (u |u t ) − d ψˆ ∗ (u |∇u t ),

where Ju and J∇u have the analogue meaning of Jθ and J∇θ in (7.20), respectively. As a consequence, the entropy inequality yields (∂u ψˆ ∗ + ε)u(t) ˙ + (q + J∇u ) · ∇u(t) + Ju u(t) − d ψˆ ∗ (u |u t ) − d ψˆ∗ (u |∇u t ) ≤ 0 , This implies

εˆ = −∂u ψˆ ∗ , qˆ = −J∇u ,

(7.24)

Ju (u )u(t) − d ψˆ∗ (u |u t ) − d ψˆ ∗ (u |∇u t ) ≤ 0.

(7.25)

404

7 Materials with Memory

In order to obtain a quasilinear theory, according to the approximation assumptions (7.23) we consider constitutive relations for ε and q which depend at most linearly on u , ˆ u ) = r u(t) + P [u¯ t ] + Q [∇ u¯ t ] ε( ˆ u ) = c u(t) + F[u¯ t ] + G[∇ u¯ t ], q( where ∞



F[u¯ t ] = ∫ α(s)u¯ t (s) ds , P [u¯ t ] = ∫ κ(s)u¯ t (s) ds , 0



0



G[∇ u¯ ] = ∫ β(s) · ∇ u¯ (s) ds , Q [∇ u¯ ] = ∫ K(s)∇ u¯ t (s) ds , t

t

t

0

0

are linear functionals on Hu and H∇u , respectively. Here K is tensor valued, whereas β and κ are vector valued. In particular, if the material is homogeneous and isotropic we can prove that the general form of linear constitutive relations is ∞



ε(t) = ε0 + c u(t) + ∫ α(s)u¯ t (s) ds,

q(t) = − ∫ K (s)∇ u¯ t (s) ds

0

(7.26)

0

Indeed, by virtue of the isotropy of the material, the constitutive relations are subject to G = 0 , r = 0 , P = 0 , K = −K 1 . In steady-state heat conduction, only constant histories are considered, for instance u t (s) = u t† . Then q(t) = −k ∇u(t) ε(t) = c0 u(t), ∞



0

0

where c0 = c + ∫ α(s)s ds > 0 and k = ∫ K (s)s ds > 0 represent the stationary specific heat and thermal conductivity, respectively. In view of (7.24) and (7.26) the functional ψ∗ can be represented as ∞

ψ∗ (u ) = − 21 c u 2 (t) − u(t) ∫ α(s)u¯ t (s) ds 0



1 2



∫ α (s)[u¯ t (s)]2 ds + 0

1 2



∫ K (s)∇ u¯ t (s) · ∇ u¯ t (s) ds. 0

We now prove that the reduced inequality (7.25) holds. First, in light of (7.6) we observe that ψ∗ takes the form ∞

ψ ∗ (u ) = − 21 c u 2 (t) − ∫ α(s)u¯ t (s)[u(t) − u t (s)] ds + 0

Hence, for any values of u(t) and ∇u(t)

1∞ ∫ K (s)∇ u¯ t (s) · ∇ u¯ t (s) ds. 2 0

7.3 Rigid Heat Conductors

405 ∞



0

0

Jθ (u ) u(t) := dψ∗ (u |u t† ) = −u 2 (t) ∫ α(s) ds + u(t) ∫ α(s)u t (s) ds, ∞

J∇u (u ) · ∇u(t) := dψ∗ (u |∇u t† ) = ∫ K (s)∇ u¯ t (s) ds · ∇u(t). 0

Moreover we have ∞



0

0

d ψˆ∗ (u |u t ) = −u(t) ∫ α(s)u t (s) ds + ∫ α(s)[u t (s)]2 ds and, by (7.6), ∞ ∞ d d ψˆ∗ (u |∇u t ) = ∫ K (s)∇ u¯ t (s) · ∇ u¯ t (s) ds = − 21 ∫ K (s)∇ u¯ t (s) · ∇ u¯ t (s) ds. ds 0 0

Consequently, the reduced inequality (7.25) can be written as ∞

− ∫ α(s)[u(t) − u t (s)]2 ds + 0

1 2



∫ K (s)∇ u¯ t (s) · ∇ u¯ t (s) ds ≤ 0 0

and is satisfied provided that α(s) ≥ 0 and K (s) ≤ 0, s ∈ R+ . Since u¯ t (0) = ∇ u¯ t (0) = 0, an integration by parts transforms (7.26) into linear functionals on the past histories, namely ∞

ε(t) = c u(t) + ∫ m α (s)u t (s) ds, 0

where



m α (s) = ∫ α(σ) dσ, s



q(t) = − ∫ m K (s)∇u t (s) ds,

(7.27)

0



m K (s) = ∫ K (σ) dσ. s

7.3.3 Moore–Gibson–Thompson Temperature Equation Applying (7.27) and assuming that m α (0) and m K (0) are bounded, the linearized energy balance equation, ∂t ε + ∇ · q = 0, gives ∞



0

0

c ∂t u(t) + ∫ m α (s)∂t u t (s) ds = ∫ m K (s)u t (s) ds, where u  (θ − θ∗ )/θ∗ is the dimensionless relative temperature (see (7.22)). Upon some rearrangements this equation can be written as ∞



0

0

c ∂t2 u(t) + m α (0)∂t u(t) + ∫ m α (s)∂t u t (s) ds = m K (0)u(t) + ∫ m K (s)u t (s) ds.

406

7 Materials with Memory

Since m α (0), m K (0) > 0, this equation shows √ a dissipative hyperbolic character and is associated with the constant wave speed m K (0)/c. In the special case where the heat flux and the internal energy kernels are in exponential forms, m α (s) = m α (0) exp(−s/λα ),

m K (s) = m K (0) exp(−s/λ K ),

λα , λ K > 0,

from (7.27) it follows that λα ∂t ε(t) + ε(t) = cλα ∂t u(t) + [λα m α (0) + c]u,

λ K ∂t q(t) + q(t) = −λ K m K (0)∇u(t).

By eliminating ε and q from these equations and the linearized energy balance we find ([242], Eq. (5.7))  1 1 2 1 c 1 c ∂t3 u + m α (0) + c + m K (0)u. ∂t u + m α (0) + ∂t u = m K (0)∂t u + λK λα λK λα λα

This equation is a third-order hyperbolic equation since discontinuities propagate √ with constant speed m K (0)/c. Recently, a similar result has been obtained in [370] by introducing a relaxation parameter in the Green–Naghdi type III model. The resulting constitutive equation of the heat flux vector is τ ∂t2 q(t) + ∂t q(t) = k∗ ∇θ(t) + k∇∂t θ(t) Comparing this relation with the linearized energy balance, with ε = c θ, we have τ c ∂t3 θ + c ∂t2 u = k∂t θ(t) + k∗ θ(t),

(7.28)

√ so that thermal waves propagate with finite speed k/τ c. This equation represents the linear version of a third-order wave equation describing the nonlinear propagation of sound in nonlinear acoustics, a model which is known in the literature as the Jordan–Moore–Gibson–Thompson equation [240, 247]. Hence, (7.28) is referred to as Moore–Gibson–Thompson temperature equation in [370] and its connection with linear viscoelasticity is outlined in [125].

7.4 Linear Viscoelasticity Linear viscoelasticity8 traces back to Boltzmann [55] who gave a linear theory based on the view that, at any point x ∈  ⊂ E at a time t, the stress depends upon the strains which the point has experienced prior to time t. The influence of a previous 8

See, e.g. [158, 276].

7.4 Linear Viscoelasticity

407

strain depends on the time elapsed since the strain occurred and is less for those strains occurring long ago, according to the fading memory principle described above. A superposition of the influence of previous strains holds, which means that the stress–strain relation is linear. This is the essence of the Boltzmann model of linear viscoelasticity. Elastic solids and viscous liquids (or fluids) are two main types of simple materials. Among the mechanical aspects, the elastic solid has a definite shape and deforms under external forces, stores all energy that is obtained from external forces, restores the original shape once the force is removed. The viscous liquid, instead, has no definite shape and flows irreversibly under the action of external forces. Viscoelastic materials combine both liquid- and solid-like features. Accordingly, the theory of linear viscoelasticity can be profitably applied to both liquids and solids. Quite often (see e.g. [50]) linear viscoelasticity is motivated through elementary models in terms of series of springs and dashpots as shown in Sect. 7.1. The MaxwellWiechert model, for instance, leads to (6.9), an integral relation where the dependence of the stress on the history of the strain rate can be viewed as the simplest model of a linear viscoelastic fluid. Furthermore, the standard linear solid—see (6.26)— accounts for both the dependence of the stress on the history of the strain and that of the strain on the history of the stress. In both cases the relaxation modulus turns out to be given by exponentials. For the sake of generality, here linear viscoelasticity is framed within the scheme of materials with fading memory.

7.4.1 Linear Viscoelastic Solids Let E(X, t) be the Green-St.Venant strain at X ∈ R at time t. We denote by Et (X, ·) the strain history experienced by the particle X. For simplicity, hereafter we restrict our attention to homogeneous materials so that the dependence on X is understood and not written. Let k ∈ L 1 (R+ ) and consider the fading memory space ∞

H = {Et : R+ → Sym, ∫ |Et (s)|2 k(s)ds < ∞} 0

endowed with the norm ∞

Et  = |E(t)|2 + ∫ |Et (s)|2 k(s)ds. 0

Once we make the assumption that the constitutive equation for the stress T is given by a bounded, linear functional on H then the Riesz theorem gives the representation ∞

T (Et ) = G0 E(t) + ∫ G (s)E(t − s)ds. 0

408

7 Materials with Memory

The fourth-order tensor G0 is called an instantaneous elastic modulus and governs the response to instantaneous changes in strain, whereas the fourth-order tensorvalued function G on R+ is sometimes called Boltzmann function and is the memory kernel of the linear functional T . By the Cauchy–Schwarz inequality we can write

∞

∞ 1/2  ∞ 1/2

∫ G (s)E(t − s)ds ≤ ∫ |G G (s)|2 k −1 (s)ds ∫ k(s)|E(t − s)|2 ds . 0

0

0

The assumption that the response T be finite for any history Et with a finite H-norm requires that ∞

G (s)|2 k −1 (s)ds < ∞. ∫ |G 0

Moreover,





∫ G (s)E(t − s)ds ≤ sup |Et (s)| ∫ |G G (s)|ds. s∈R+

0

0

Hence, to guarantee that all bounded histories belong to H we assume the Property 1. The function G belongs to L 1 (R+ ; Lin(Sym)). To adhere to the standard notation we let G : R+ → Lin(Sym) be such that G (s) = G (s),

G(0) = G0 ,

the prime denoting the derivative. Consistent with the notation for the Maxwell and linear solid models, the solution s

G(s) = G0 + ∫ G (ξ)dξ 0

is called relaxation function (or relaxation modulus). The limit of G(s), as s → ∞, is supposed to exist and G∞ = lim G(s) s→∞

is called an equilibrium elastic modulus in that the constancy of the history Et , Et (s) = E, s ∈ R+ , gives T(t) = G∞ E. In a solid, a nonzero constant strain is supposed to induce a nonzero stress such that T · E is strictly positive. That is why in viscoelastic solids we let G∞ > 0, whereas in viscoelastic fluids G∞ = 0.

7.4 Linear Viscoelasticity

409

The generic constitutive equation of linear viscoelasticity can be written in the form ∞ T(t) = T (Et ) := G0 E(t) + ∫ G (s)E(t − s)ds. (7.29) 0

Alternatively, we can write (7.29) in the form ∞

T(t) = T ∗ (E(t), Edt ) := G∞ E(t) − ∫ G (s)[E(t) − E(t − s)]ds.

(7.30)

0

where T ∗ is a linear functional on H∗ given by (7.5). If the strain function has compact support, or anyway lims→−∞ E(s) = 0, then an obvious integration by parts yields ∞

˙ − s)ds, T(t) = ∫ G(s)E(t

(7.31)

0

the superposed dot denoting the time derivative with respect to t. We can also write ∞

˙ − s)ds, T(t) = G∞ E(t) − ∫ [G(s) − G∞ ]E(t

(7.32)

0

7.4.2 Thermodynamic Restrictions for the Linear Viscoelastic Solid The entropy inequality ρη˙ + ∇ ·

q ρr − +∇ ·k ≥0 θ θ

and the balance of energy allow us to write η˙ +

1 1 1 1 T · D − ε˙ − 2 q · ∇θ + ∇ · k ≥ 0 ρθ θ ρθ ρ

(7.33)

Let the material undergo a cycle as t ∈ [0, d], which means that the argument of the constitutive functionals, say , satisfies (0) = (d). Hence η((d)) = η((0)) and the like for ε. As a consequence, integration of (7.33) gives

410

7 Materials with Memory



d 0

 1 1 1 1 T · D − ε˙ − 2 q · ∇θ + ∇ · k dt ≥ 0. ρθ θ ρθ ρ

Let the material undergo an isothermal process, which means that θ is constant and uniform. Moreover let k = 0, which has to be shown consistent with the whole inequality. Hence we conclude that, in isothermal processes, for every cycle d

∫[ρ−1 T · D](t)dt ≥ 0. 0

˙ = FT DF we have Since E˙ = 21 C 1 1 ˙ T · D = T R R · E. ρ ρR Thus we can express the power in the form T · E˙ if T stands for the second Piola stress. Otherwise if T is the Cauchy stress then T · E˙ has to be viewed as a linear approximation (where F  1) of the power T · D. Consistent with the scheme of linear viscoelasticity, here we express the stress ˙ Moreover, the constitutive equation (7.29), or power in the linearized form T · E. (7.31), does not involve the temperature. Accordingly, in linear viscoelasticity, the second law requires that d

˙ ∫ T(t) · E(t)dt ≥0

(7.34)

0

hold for any cycle. The requirement (7.34) is a restriction on the relaxation functions compatible with thermodynamics. To find definite results9 we consider oscillatory strain tensor evolutions of the form ˜ E(t) = E1 cos ωt + E2 sin ωt, where ω > 0 and E1 , E2 ∈ Sym. Hence E˜ t = E˜ t+d ,

d = 2π/ω.

Denote by the subscript s and c the half-range sine and cosine transform, respectively ∞

G s (ω) = ∫ G (u) sin ωu du, 0



G c (ω) = ∫ G (u) cos ωu du. 0

We can now derive a set of restrictions placed by the second-law inequality on the relaxation function G. 9

The sequel of this section re-visits Chap. 3 of [158].

7.4 Linear Viscoelasticity

411

˜ ω ∈ (0, ∞), Theorem 7.1 Inequality (7.34) holds for any oscillatory function E, only if T G∞ = G∞ , (7.35) G0 = G0T , G s (ω) ≤ 0,

ω ∈ (0, ∞),

(7.36)

|E1 · (G c (ω) − G c (ω))E2 |2 ≤ 2([E1 · G s (ω)E1 ]2 + [E2 · G s (ω)E2 ]2 ), E1 , E2 ∈ Sym. T

(7.37)

Proof The history E˜ t is periodic with period d = 2π/ω. Upon substitution of E˜ t in (7.29), inequality (7.34) yields d ∫ (−ωE1 sin ωt + ωE2 cos ωt) · G0 (E1 cos ωt + E2 sin ωt) 0

 + ∫ G (s)[E1 (cos ωt cos ωs + sin ωt sin ωs) + E2 (sin ωt cos ωs − cos ωt sin ωs) ds dt ≥ 0. ∞



0

Integration with respect to t on [0, d] results in E1 · [G0T − G0 ]E2 −E1 · G s (ω)E1 −E2 · G s (ω)E2 −E1 · [G c (ω)−G c (ω)]E2 ≥ 0, (7.38) as ω > 0. By Riemann-Lebesgue’s lemma, the limit ω → ∞ makes the integrals vanish. Then the arbitrariness of E1 , E2 ∈ Sym implies that G0 = G0T and we are left with T

E1 · G s (ω)E1 + E2 · G s (ω)E2 + E1 · [G c (ω) − G c (ω)]E2 ≤ 0, ω > 0, (7.39) T

We now evaluate the limit as ω → 0+ . Since G ∈ L 1 (R+ ) then for every > 0 there exists N such that





∫ G (s) sin ωs ds ≤ ∫ |G (s)|ds < . N

N



Now let G = ∫ |G (s)|ds and observe 0

N

N

N

∫ G (s) sin ωs ds ≤ ∫ |G (s)|| sin ωs| ds < ∫ |G (s)|ωs ds. 0

0

Since ω → 0+ , we can choose ω < /G N and hence

N

∫ G (s) sin ωs ds < . 0

0

412

7 Materials with Memory

As a consequence, ∀ ∃δ(= /G N ) such that ω < δ implies ∞

N



∫ G (s) sin ωs ds ≤ ∫ |G (s)|ωsds + ∫ |G (s)|ds < 2 . 0

0

N

Likewise, since ∞





0

0

0

∫ G (s) cos ωs ds = ∫ G (s)ds + ∫ G (s)[cos ωs − 1]ds

and | cos ωs − 1| = |(− sin ωξ)ωs| ≤ ωs, ξ ∈ (0, s),

| cos ωs − 1| ≤ 2, s ∈ R,

then we find that, as ω < /G N , ∞

N



∫ G (s)[cos ωs − 1]ds ≤ ∫ |G (s)|ωs ds + ∫ |G (s)|| cos ωs − 1|ds < 3 . 0

0

N

Consequently lim G s (ω) = 0,

ω→0+



lim G c (ω) = ∫ G (s) ds = G∞ − G0 .

ω→0+

0

Accordingly, as ω → 0+ , inequality (7.39) implies T ]E2 − E1 · [G0 − G0T ]E2 ≤ 0. E1 · [G∞ − G∞

The symmetry of G0 reduces the inequality to T ]E2 ≤ 0. E1 · [G∞ − G∞

According to the arbitrariness of E1 , E2 we first select E2 = E1 and then E2 = −E1 to find the symmetry of G∞ . We now turn to (7.39). Let E1 = E2 = E and observe that, by the skew character T T of G (s) − G (s), it follows E · [G (s) − G (s)]E = 0. Hence (7.39) reduces to E · G s (ω)E ≤ 0, ω > 0,

(7.40)

that is (7.36). Finally, inequality (7.39) reads E1 · G s (ω)E1 + E2 · G s (ω)E2 + E1 · [G c (ω) − G c (ω)]E2 ≤ 0, ω > 0. T

Since G s is negative semi-definite it follows that

7.4 Linear Viscoelasticity

413

G c (ω) − G c (ω) T

is allowed to be nonzero provided that E1 · [G c (ω) − G c (ω)]E2 ≤ |E1 · G s (ω)E1 | + |E2 · G s (ω)E2 |. T

Since (a + b)2 ≤ 2(a 2 + b2 ) then (7.37) holds.



It is worth remarking that the Boltzmann function G need not be symmetric. If we assume T u ∈ R+ , G (u) = G (u), then inequality (7.37) becomes redundant and the thermodynamic restrictions reduce to (7.35) and (7.36). Inequality (7.36) in turn implies some further properties of the relaxation function. Upon extending the definition of G to R− as an odd function, G (−u) = −G (u), we let

u ∈ R+ ,



G F (ω) = ∫ G (u) exp(−iωu) du, −∞

ω ∈ R.

We can then write the half-range sine transform of G as G s (ω) = Hence

1 2



∫ G (u) sin ωu du = − 21 G F (ω),

ω ∈ R.

−∞

G s (ω) = −G s (−ω),

ω ∈ R.

Thus we have the inversion formula G (u) =

1 2π



∫ G F (ω) exp(iωu)dω =

−∞

1 π



∫ G s (ω) sin ωu dω =

−∞

2 π



∫ G s (ω) sin ωu dω. 0

Integration with respect to u gives G(u) − G0 =

2 π



∫ 0

1 − cos ωu Gs (ω)dω. ω

(7.41)

The negative definiteness of G s implies that G(u) − G0 < 0,

u > 0.

The limit of (7.41) as u → ∞ and that of the ratio

(7.42)

414

7 Materials with Memory

G(u) − G0 u as u → 0+ lead to G0 − G∞ > 0,

G 0 := lim+ G (u) ≤ 0. u→0

(7.43)

As a comment on (7.35), the symmetry of the instantaneous elastic modulus G0 was proved by Coleman [94], as a consequence of the second law in the form of the Clausius–Duhem inequality, whereas the symmetry of the equilibrium elastic modulus G∞ was derived by Day [114] via the Clausius inequality. Inequality (7.36) was first derived by Graffi [202] in the case of isotropic materials by requiring that energy be dissipated in a period of a sinusoidal strain function. Inequality (7.42), derived in [157], is in a sense related to a result by Day [114] who showed that, as a consequence of dissipativity, the relaxation function satisfies the condition (7.44) G0 − G∞ ≥ ±[G(s) − G∞ ]. To establish the connection between the two inequalities, observe that the limit as u → ∞ of (7.41) results in G∞ − G0 =

2 π



∫ 0

1 G (ω)dω. ω s

Hence it follows from (7.41) that ∞

G(u) − G∞ = − π2 ∫ 0

cos ωu Gs (ω)dω. ω

Now, for any strain E we have ∞

∫ 0

∞ | cos ωu|

∞ cos ωu

1 |E · G s (ω)E|dω ≥ ∫ |E · G s (ω)E|dω ≥ ∫ [E · G s (ω)E]dω . ω ω ω 0 0

Consequently, using (7.40) we get ∞ ∞



π E · [G − G ]E = E · ∫ 1 [−G (ω)]dω E = ∫ 1 |E · G (ω)E| dω ≥ π E · [G(u) − G ]E , ∞ ∞ 0 s s ω ω 2 2 0 0

whence E · [G0 − G∞ ]E ≥ |E · [G(u) − G∞ ]E|. Inequality (7.44) is a short way of writing this result. As we see in a moment, the restrictions (7.35) and (7.36) are also sufficient for the validity of the second law. In this connection, we prove the following preliminary

7.4 Linear Viscoelasticity

415

Lemma 7.1 If E : R → Sym is a periodic function with period d and K ∈ Lin(Sym) is symmetric, K = KT , then d

˙ ∫ E(t) · KE(t)dt = 0. 0

Proof Observe ˙ ˙ ˙ E(t) · KE(t) = [E(t) · KE(t)]˙− E(t) · KE(t) = [E(t) · KE(t)]˙− E(t) · KT E(t). Since KT = K then

˙ 2E(t) · KE(t) = [E(t) · KE(t)]˙.

Integration over [0, d] gives d

˙ ∫ E(t) · KE(t)dt = 0

1 2

d E(t) · KE(t) 0 = 0.



Theorem 7.2 If G satisfies (7.35) and (7.36) for any ω ∈ [0, ∞) then the second-law inequality (7.34) holds for any periodic strain function. Proof Let E : R → Sym be a periodic function with period d. Represent E through the Fourier series E(t) =

∞

k=0 Ak

cos kωt + Bk sin kωt,

Ak , Bk ∈ Sym,

where ω = 2π/d. The history Et is a cycle on [0, d]. Let d

˙ w := ∫ T (Et ) · E(t)dt, 0

where T (Et ) is the functional (7.29) of linear viscoelasticity. We have to show that w ≥ 0 as a consequence of (7.35) and (7.36). Since E is periodic with period d then the symmetry of G0 and the Lemma imply that d

˙ ∫ E(t) · G0 E(t)dt = 0. 0

Hence

d ∞ ˙ · ∫ G (u)E(t − u)du dt w = ∫ E(t) 0

Upon substitution of

0

416

7 Materials with Memory

Et (u) =

∞

k=0 Ak

cos kω(t − u) + Bk sin kω(t − u),

we have d ∞ ∞ ∞ k=1 h=0 kω[−Ak sin kωt + Bk cos kωt] 0 0 ·G (u)[Ah cos hω(t − u) + Bh sin hω(t

w=∫∫

− u)]du dt.

Term by term integration of the double series shows that the only nonzero terms are those with h = k. Integration in t times kω provides −π. Hence we are left with w = −π

∞ ∞ T k=1 ∫ {Ak · G (u)Ak + Bk · G (u)Bk ] sin kωu + Ak · [G (u) − G (u)]Bk cos ku}du 0

whence w = −π

∞

k=1 {Ak

· G s (kω)Ak + Bk · G s (kω)Bk + Ak · [G c (kω) − G c (kω)]Bk }. T

By (7.37) each k-th term, in braces, of the series is non-positive and hence it follows that w ≥ 0.  The connection between (7.36) and the energy dissipation is emphasized in [276], Sect. 13, where the energy dissipated in one period of the strain function is determined. As a particular case of the previous evaluation of w, if E(t) = E0 sin ωt, d = 2π/ω, we have w = −πE0 · G s (ω)E0 . That is why −G s (ω) is often called the loss modulus.

7.4.3 Free Energies and Minimal States By the entropy inequality (7.33), in the approximation of linear isothermal viscoelasticity we can write (7.45) − ρψ˙ + T · E˙ = θσ R ≥ 0 ρ being regarded as constant. Since T is given by the functional (7.29) we take the free energy ψ as a functional of the present value E(t) and the past history Ert . Hence ˆ ψ(t) = ψ(E(t), Ert ). Upon substitution in (7.45) we have ˆ ˆ ˙ ˙ −ρ∂E ψ(E(t), − ρ d ψ(E(t), Ert | E˙ rt ) + T (E(t), Ert ) · E(t) = θσ R ≥ 0. Ert ) · E(t)

7.4 Linear Viscoelasticity

417

For any history Et , we can take a strain history E t such that E˙ (t) is arbitrary while E (t) = E(t),

E t − Et H < , E

t E˙ − E˙ t H < ,

where is arbitrarily small. Accordingly the arbitrariness of the present value E˙ (t) implies that ˆ ˆ d ψ(E(t), Ert | E˙ rt ) = −θσ R /ρ ≤ 0. T = ρ ∂E ψ, In view of (7.29) we have ∞

ˆ Ert ) = G0 E(t) + ∫ G (u)E(t − u)du ρ ∂E ψ(E(t), 0

whence ∞

ˆ ˜ Et ). (7.46) ρ ψ(E(t), Ert ) = 21 E(t) · G0 E(t) + E(t) · ∫ G (u)E(t − u)du + ρψ( r 0

In addition to ρ ∂E ψˆ = T , the functional ψˆ has to satisfy d ψˆ ≤ 0 and to be such that, within the histories ending with given values of E, the functional corresponding to constant values of E for all times has the least value. This minimum property is consistent with the general statement for functionals of Ft , θt . Yet we can see it very briefly. By (7.45) it follows that ψ is decreasing while E is constant. Let Ett00 +d be the constant continuation of Et0 up to t0 + d. Hence t0 +d

˙ ∫ T (Ett0 ) · E(t)dt = 0, t0

ψ(t0 + d) ≤ ψ(t0 ).

If d → ∞ then we have t0 † t0 ˆ ˆ ψ(E(t 0 ), E ) ≤ ψ(E(t0 ), Er ).

Even if we assume a normalization condition at the null constant history, ˆ 0t† ) = 0, ψ(0,

(7.47)

the free energy functional ψˆ is non-unique10 ; this fact was first shown by Day [115]. For definiteness, after introducing the concept of a minimal state, we now determine some well-known expressions of this functional.

10

See, e.g. [158], Sect. 3.5.

418

7 Materials with Memory

Minimal states We consider the concept of a minimal state in the context of linear viscoelasticity. It might happen that two different initial past histories, E1t0 and E2t0 , lead to the same strain and stress for t ≥ t0 . From the viewpoint of the dynamics, such two different initial past histories are in a fact indistinguishable. This observation suggests that, rather than the past history Et , one should employ an alternative variable to describe the initial state of the system, satisfying the following natural minimum property: two different initial states produce different evolutions for t ≥ 0. From the philosophical side, this means that the knowledge of E(t) for all t ≥ t0 determines in a unique way the initial state at t0 of the problem, the only object that really influences the future dynamics. Definition. According to [124], two strain histories E1t and E2t are equivalent, or in the same minimal state, if and only if E1 (t + s) = E2 (t + s), s ∈ R+ provided that

(7.48)

Jt (s, E1t ) = Jt (s, E2t ), s ∈ R+

where Jt is the linear functional ∞

Jt (s, Et ) = ∫ G (s + u)Et (u)du.

(7.49)

0

A minimal state is therefore given by an equivalence class according to the equivalence relation so defined. The linearity of the functional Jt means that the requirement of the equivalence of E1t and E2t is the same as that E1t − E2t be equivalent to the zero history. Thus, if the minimal state, including the zero history, is singleton (non-singleton) then all minimal states are singleton (non-singleton). If we introduce the extra condition E1 (t) = E2 (t) in the definition of a minimal state, then (7.48) follows from E˙ 1 (t + s) = E˙ 2 (t + s), s ∈ R+ , and, according to [129], we can introduce the minimal state functional as ∞ ∞ It (s, Edt ) = − ∫ G (s + u)Edt (u) du = ∫ G (s + u) Et (u) − E(t) du, 0

0

s ≥ 0.

For any given history Et , the function It (s, ·) fulfils the partial differential equation ˇ E(t), ˙ ∂t It (s) = ∂s It (s) + G(s) Note that the state variables It and Jt are related by

t, s ∈ R+ .

7.4 Linear Viscoelasticity

419

ˇ It (s, Edt ) = Jt (s, Et ) + G(s)E(t),

u ≥ 0,

ˇ where G(s) = G(s) − G∞ . Therefore, the stress–strain relation (7.30) takes the compact forms T(t) = T ∗ (E(t), Edt ) = G∞ E(t) + It (0, Edt ) = G0 E(t) + Jt (0, Et ). In order to define a free energy as a state functional, rather than It , it seems more convenient to consider as a state the new variable [161] ∞

ξ t (s, Edt ) = −∂s It (s, Edt ) = ∫ G (s + u)Edt (u) du, 0

s ≥ 0,

(7.50)

which, in turn, fulfils the linear transport equation ˙ ∂t ξ t (s) = ∂s ξ t (s) − G (s)E(t),

t, s ∈ R+ .

(7.51)

We mention some further references on minimal states. In [161] the history and the state formulations are compared, showing some advantages of states. In [162, 199] the concept of a minimal state is applied to obtain a general representation of free energies in the frequency domain, including explicit expressions for the minimum and maximum free energies. The Graffi-Volterra free energy First we assume G ∈ W 1,1 (R+ ; Lin(Sym)) and G (u) = G (u), T

Letting

G (u) ≤ 0,

G (u) ≥ 0,

u ∈ R+ .



ρψ˜ G (Ert ) = − 21 ∫ E(t − u) · G (u)E(t − u)du, 0

observing that



G∞ = G0 + ∫ G (u)du, 0

and using the symmetry of G , from (7.46) we obtain ρ ψˆ G (E(t), Edt ) = 21 E(t) · G∞ E(t) −

1 2



∫ Edt (u) · G (u)Edt (u) du. 0

where Edt (u) = E(t) − E(t − u). Since G ≤ 0, this functional is well defined in H∗G ⊂ H∗ , the Hilbert space endowed with the inner product and the associated norm in the form

420

7 Materials with Memory ∞

t t (E1t , E2t )H∗G := E1 (t) · E2 (t) − ∫ E1d (s) · G (s)E2d (s) ds, 0



Et 2H∗G := |E(t)|2 − ∫ Edt (s) · G (s)Edt (s) ds. 0

The functional ψˆ G represents a quadratic free energy. Indeed, it is apparent that ψˆ satisfies the minimum property at constant histories; for any history Et0 we have G

∞ ρ ψˆ G (Et0 ) − ψˆ G (Et0 † ) = − 21 ∫ Edt0 (u) · G (u)Edt0 (u) du ≥ 0, 0

and the conclusion follows. To prove that d ψˆ G (E(t), Ert | E˙ rt ) ≤ 0 we observe ∞

˙ − u) · G (u)[E(t) − E(t − u)]du. ρ d ψˆ G (E(t), Ert | E˙ rt ) = ∫ E(t 0

˙ − u) = −∂u E(t − u), upon an integration by parts we obtain Since E(t ∞

ρ d ψˆ G = ∫ ∂u [E(t) − E(t − u)] · G (u)[E(t) − E(t − u)]du 0

= 21 [E(t) − E(−∞)] · G ∞ [E(t) − E(−∞)] −

1 2



∫ Edt (u) · G (u)Edt (u) du. 0

G ∞ = limu→∞ G (u) and the positive semi-definiteness of G imply The vanishing ofG that d ψˆ G is a functional of the difference history Edt only, and d ψˆ G ≤ 0. Moreover, if we assume the stronger condition11 G (s) + λG (s) ≥ 0, s ∈ R+ ,

(7.52)

then it follows that ∞ d ψˆ G ≤ 21 λ ∫ Edt (u) · G (u)Edt (u) du = −λ ψˆ G (E(t), Edt ) − 21 E(t) · G∞ E(t) . 0

(7.53) The free energy functional ψˆ G traces back to Volterra [436–438]. The thermodynamic admissibility was proved by Graffi [203–205]. This free energy is not in general a functional of the minimal state unless all minimal states are singleton [124]. It is however a positive definite functional on H∗G . By virtue of this property and of the inequality (7.53), energy functionals related to ψˆ G have often been used in the literature to obtain interesting stability results in linear viscoelasticity [112, 198].

11

This condition implies the exponential decay of the relaxation function [198].

7.4 Linear Viscoelasticity

421

The work function Let G ∈ W 1,1 (R+ ; Lin(Sym)) satisfy G (u) = G (u), u ∈ R+ , and the thermodynamic condition (7.36). Assuming that u = |s1 − s2 |, s1 , s2 ∈ R+ , we have T

∂s j G(|s1 − s2 |) = (−1) j+1 sgn (s1 − s2 )G (|s1 − s2 |).

(7.54)

Define G12 (|s1 − s2 |) = ∂s1 ∂s2 G(|s1 − s2 |). It follows that G12 has singular delta function behaviour at s1 = s2 . Indeed, G12 (|s1 − s2 |) = −2δ(s1 − s2 )G (|s1 − s2 |) − G (|s1 − s2 |)

(7.55)

and therefore G12 is not bounded. Nevertheless we can define ρψ˜ W (Ert ) =

1 2

∞∞

∫ ∫ E(t − s1 ) · G12 (|s1 − s2 |)E(t − s2 ) ds1 d2 . 0 0

Lemma 7.2 If Et ∈ L 2 (R+ ; Sym) the thermodynamic condition (7.36) implies ρψ˜ W ≥ 0. Proof By (7.55) and the symmetry of G we obtain ∞∞

ρψ˜ W (Ert ) = − ∫ ∫ Et (s1 ) · δ(s1 − s2 )G (|s1 − s2 |)Et (s2 ) ds1 ds2 − =−

0 0 ∞∞ 1 ∫ ∫ Et (s1 ) · G (|s1 − s2 |)Et (s2 ) ds1 ds2 2 0 0 s1 ∞ ∞ ∫ Et (s) · G 0 Et (s) ds − ∫ Et (s1 ) · ∫ G (s1 0 0 0

− s2 )Et (s2 ) ds1 ds2 .

Upon extending the definition of G and Et to R− by letting, G (s) = Et (s) = 0,

s < 0,

we can write ∞ ρψ˜ W (Ert ) = − ∫ Et (s) · [G + G 0 δ ∗ F Et (s) ds, −∞

where ∗ F denotes the Fourier convolution, ∞ [G + G 0 δ ∗ F Et (s) := ∫ G (s − τ ) + G 0 δ(s − τ ) Et (τ ) dτ . −∞

422

7 Materials with Memory

Hence, since G + G 0 δ represents the distributional derivative of the extension of G then by applying the Parseval–Plancherel theorem we have12 ∞

1 ∫ [Et (ω)]∗ · [G + G δ ρψ˜ W = − 2π F 0 −∞





F

1 ∫ [Et (ω)]∗ · iωG (ω)Et (ω) dω, (ω)EtF (ω) dω = − 2π F F F −∞

Now, since G and Et vanish on (−∞, 0), their cosine transforms G c , Ect are even functions while their sine transforms G s , Est are odd functions. Hence, (7.36) implies ∞ 1 ∫ ω Ect (ω) · G s (ω)Ect (ω) + Est (ω) · G s (ω)Est (ω) dω ≥ 0. ρψ˜ W = − 2π −∞



We go back to the classical functional ψˆ in (7.46) and observe that since ∞∞



0 0

0

∫ ∫ G12 (|s1 − s2 |) ds1 ds2 = − ∫ ∂s1 G(|s1 − s2 |) ds1 = G0 − G∞ ,

then it follows ρ ψˆ W (E(t), Edt ) = 21 E(t) · G∞ E(t) +

1 2

∞∞

∫ ∫ Edt (s1 ) · G12 (|s1 − s2 |)Edt (s2 ) ds1 ds2 , 0 0

where the symmetry of G12 has been used. Apparently the functional ψˆ W is quadratic in E and Edt . To compute d ψˆ W we observe ∞∞

˙ − s1 ) · G12 (|s1 − s2 |)[E(t) − E(t − s2 )]ds1 ds2 ρ d ψˆ W = ∫ ∫ E(t 0 0 ∞

∞∞

0

0 0

˙ − s) ds · E(t) + ∫ ∫ E˙ t (s1 ) · G12 (|s1 − s2 |)Et (s2 ) ds1 ds2 . = ∫ G (s)E(t Upon changing the order in the double integral, an integration by parts yields ∞



˙ − s) ds · E(t) − ∫ E˙ t (s1 ) · G (s1 )E(t)ds1 ρ d ψˆ W = ∫ G (s)E(t 0 ∞

0

˙t



˙t

+ ∫ E (s1 ) · ∫ ∂s1 G(|s1 − s2 |)E (s2 ) ds2 ds1 . 0

0

The first two integrals cancel each other while the third integral vanishes as a con˙ This is why ψ W (t) = sequence of (7.54). Hence, d ψˆ W = 0 so that ρψ˙ W = T · E. t W ˆ ψ (E(t), Ed ) is named work function. The minimum property at constant histories can be proven as a consequence of the thermodynamic restriction G s ≤ 0. First we observe that ρψˆ W (Et0 † ) = 21 E(t0 ) · G∞ E(t0 ). 12

As elsewhere, for any complex-valued quantity E, E ∗ denotes the complex conjugate.

7.4 Linear Viscoelasticity

423

Then applying the previous Lemma with Et replaced by Edt0 we obtain ρ[ψˆ W (Et0 ) − ψˆ W (Et0 † )] =

1 2

∞∞

∫ ∫ Edt0 (s1 ) · G12 (|s1 − s2 |)Edt0 (s2 ) ds1 ds2 ≥ 0. 0 0

The work function ψ W is not in general a functional of the minimal state [123]. It is however a positive definite functional on the Hilbert space H∗W endowed with the inner product and the associated norm in the form ∞∞

t t (E1t , E2t )H∗W := E1 (t) · E2 (t) + ∫ ∫ E1d (s1 ) · G12 (|s1 − s2 |)E2d (s2 ) ds1 ds2 , 0 0

∞∞

Et 2H∗W := E(t) · E(t) + ∫ ∫ Edt ((s1 ) · G12 (|s1 − s2 |)Edt (s2 ) ds1 ds2 . 0 0

We finally note that ψ W is sometimes referred to as maximum free energy [159, 162] since it is the maximum free energy for singleton materials. In particular, it can be shown that ψ G (t) ≤ ψ W (t). Free energy as a functional of the minimal state We recall that the Graffi-Volterra free energy holds under the assumptions that G ∈ W 1,1 (R+ ; Lin(Sym)) and G (u) = G (u), T

G (u) < 0,

G (u) ≥ 0,

u ∈ R+ .

With these conditions we now consider the functional ρψˆ F (E(t), Edt ) = 21 E(t) · G∞ E(t) −

1 2



∫ ξ t (s, Edt ) · [G ]−1 (s)ξ t (s, Edt ) ds, 0

which involves the minimal state representation ξ t (s, Edt ) as defined in (7.50). Since G < 0, this functional is well defined in H∗F ⊂ H∗ , the Hilbert space endowed with the inner product and the associated norm in the form ∞

t t (E1t , E2t )H∗F := E1 (t) · E2 (t) − ∫ ξ t (s, E1d ) · [G ]−1 (s)ξ t (s, E2d ) ds, 0



Et 2H∗F := |E(t)|2 − ∫ ξ t (s, Edt ) · [G ]−1 (s)ξ t (τ , Edt ) ds. 0

To prove that ψˆ F is a (quadratic) free energy functional we first observe that

424

7 Materials with Memory ∞

ξ t (s, Edt ) = −G (s)E(t) − ∫ G (s + u)Et (u) du = −G (s)E(t) − ∂s Jt (s, Et ), 0

and then we can write ∞

ρψˆ F = 21 E(t) · G0 E(t)+E(t) · Jt (0, Et )− 21 ∫ ∂s Jt (s, Et ) · [G ]−1 (s)∂s Jt (s, Et ) ds. 0

Hence it follows ρ∂E ψˆ F (E(t), Edt ) = G0 E(t) + Jt (0, Et ) = G∞ E(t) + It (0, Edt ) = T ∗ (E(t), Edt ). Since G < 0 then it is apparent that ψˆ F enjoys the minimum property at constant histories. To evaluate d ψˆ F we first observe ˆ ˙ ˙ = ρψ˙ F (t) − T(t) · E(t). Ert ) · E(t) ρ d ψˆ F (E(t), Ert | E˙ rt ) := ρψ˙ F (t) − ρ∂E ψ(E(t), Then (7.51) allows us to find ∞



0

0

˙ ˙ ρ d ψˆ F = − It (0) · E(t) − ∫ ξ t (s) · [G ]−1 (s)∂s ξt (s) ds − ∫ ∂s It (s) ds · E(t) =−

1 2

∞ 0

∞ ξ t (s) · ∂s [G ]−1 (s)ξ t (s) ds = [G ]−1 ξ t (s) · G (s)[G ]−1 ξ t (s) ds ≥ 0. 0

The property d ψˆ F ≥ 0 completes the set of thermodynamic conditions for ψˆ F to be a free energy functional. Incidentally, it can be shown that13 ψ F (t) ≤ ψ G (t).

7.4.4 Viscoelastic Solids with Unbounded Relaxation Functions There are models in linear viscoelasticity where the kernel is unbounded; this is particularly the case of kernels in a power-law form [448]. Advances in mechanical testing hardware have revealed that many materials possessing a complex microstructure exhibit a characteristic power-law signature in their creep and relaxation. Examples of such materials include biological materials, gels, polymers, concrete, asphalt, ice (see [57] and refs therein). Constitutive properties are assessed by experimental data, possibly involving the stress due to time-harmonic strain. Next questions arise about the selection of the relaxation function on [0, ∞) [90]. 13

See, e.g. [161].

7.4 Linear Viscoelasticity

425

Within this framework, linear viscoelastic solids are modelled by letting the kernel G = G belong to L 1 (R+ ; Lin(Sym)) in time. This condition implies the boundedness of the instantaneous elastic modulus ∞

G0 := lim+ G(s) = G∞ + ∫ G (s) ds s→0

0

and agrees with the fading memory principle (at least in its weak form) as stated by Day [115]. In addition, it is usually understood that G 0 := lims→0+ G (0) is a bounded tensor, too. In the literature various assumptions have been considered for the singularity of the relaxation function. For instance, some papers [128, 161, 187, 198] showed that a singular kernel can yield smoothing effects for the solution to the evolution problem, even if the gain in regularity cannot be derived without specifying the kind of singularity. Here, we consider a viscoelastic solid whose memory kernel G = G may exhibit an initial singularity [80, 197]. Here, we consider a viscoelastic solid whose memory kernel G = G may exhibit an initial singularity. If it is integrable then G0 is bounded, but G 0 does not necessarily have to be. Otherwise, both are unbounded. In general, it is convenient to introduce ˇ G(s) = G(s) − G∞ and to express the stress–strain constitutive relation as in (7.32), namely ∞

ˇ E(t ˙ − s)ds, T(t) := T (Et ) = G∞ E(t) + ∫ G(s)

(7.56)

0

together with the quite mild constitutive assumptions ˇ ∈ L 1 (R+ ; Lin(Sym)), G

G∞ bounded.

(7.57)

Hence we let both G0 and G 0 be unbounded but require that the possible singularity of ˇ at s = 0 be integrable. It is worth mentioning that viscoelastic solids are modelled G by nonlinear functionals of E˙ t in [361]; the linear term in [361] amounts to taking G∞ = 0 in (7.56). For simplicity we restrict attention to the same set of histories H considered in Sect. 7.4.1. If the second law for isothermal processes is taken in the form (7.34), that is the inequality d

˙ dt ≥ 0 ∫ T(t) · E(t) 0

is assumed to hold for any non-constant cycle, then we obtain the following result [197].

426

7 Materials with Memory

Theorem 7.3 The constitutive stress–strain relation (7.56), subject to conditions (7.57), satisfies the second law of thermodynamics only if T = G∞ , G∞



ˇ c (ω) := ∫ G(s) ˇ G cos ωs ds > 0, ∀ω > 0.

(7.58)

0

Proof Let E1 , E2 ∈ Sym and consider the periodic tensor function with period d = 2π/ω ˜ ω > 0. E(t) = E1 cos ωt + E2 sin ωt, For any finite value of ω we have E˜ t ∈ H and E˜ t = E˜ t+d , d = 2π/ω. Substitution in (7.34) and integration with respect to t on (0, d) yields  T ˇ c (ω)E1 + E2 · G ˇ c (ω)E2 + E2 · G ˇ s (ω) − G ˇ T (ω) E1 > 0 )E1 + ω E1 · G E2 · (G∞ − G∞ s

(7.59) for any ω > 0. The limit case ω → 0 and the arbitrariness of E1 , E2 imply the symmetry of G∞ . Accordingly,  ˇ c (ω)E1 + E2 · G ˇ c (ω)E2 + E2 · G ˇ s (ω) − G ˇ T (ω) E1 > 0, ω > 0, ω E1 · G s ˇ s (ω) − G ˇ T (ω), we whence, letting E1 = E2 and exploiting the skew character of G s ˇ c (ω) must be positive definite for any nonzero, finite ω.  conclude that G The proof shows that the conditions (7.58) are necessary for the validity of the second law on cycles. Really, even in the particular case of time-harmonic histories, the second law leads to the inequality (7.59) which is stronger than (7.58) and involves ˇ This difference disappears if we also the half-range Fourier sine transform of G. T ˇ ˇ (u), u ∈ [0, ∞). In general, however, G ˇ need not be symmetric, assume G(u) =G and therefore (7.59) proves to be sufficient for the validity of the second law. Applying the same procedure as in the previous section, a more general result can be established. ˇ satisfies (7.59), and hence (7.58), then the second-law inequality Theorem 7.4 If G (7.34) holds for every periodic history, with period d. ˇ ∈ L 1 (R+ ), and that Compatibility between the constitutive relation (7.56) with G 1 + related to the standard relation (7.29) with G ∈ L (R ) requires that the condition ˇ ∈ W 1,1 (R+ ). This is so by (7.59) be equivalent to (7.38), obviously by allowing G 14 virtue of the following properties. ˇ ∈ W 1,1 (R+ ; Lin(Sym)). Then for all ω ∈ R Lemma 7.3 Let G ˇ c (ω) = −G (ω), ωG s 14

See, for instance, [197].

ˇ s (ω) = G0 − G∞ + G (ω). ωG c

(7.60)

7.4 Linear Viscoelasticity

427

In order to study existence, uniqueness and asymptotic behaviour of the solution to the dynamic problem, in [187] the Laplace transform method is applied when G has a singularity at t = 0 and satisfies (7.57) together with thermodynamic restrictions (7.58). Free energy with an unbounded memory kernel T

ˇ (u) = G(u), ˇ ˇ ∈ L 1 (R+ ) satisfies G u ∈ R+ , and the thermodynamic Assume that G t condition (7.58). Let W (E ) be the work performed by the stress tensor (7.56) in connection with the history Et , namely t t ∞ ˇ E(τ ˙ ) dτ = ∫ G∞ E(τ ) + ∫ G(s) ˙ − s)ds · E(τ ˙ ) dτ . W (Et ) := ∫ T (Eτ ) · E(τ −∞

−∞

0

ˇ to R− by letting Moreover, let limτ →−∞ E(τ ) = 0 and extend the definition of G ˇ G(s) = 0,

s < 0.

ˇ we can write [8] Thanks to the symmetry of G∞ and G t τ ˇ − u)E(u)du ˙ ˙ ) dτ W (Et ) = ∫ G∞ E(τ ) + ∫ G(τ · E(τ −∞

−∞

=

1 2

E(t) · G∞ E(t) +

1 2

=

1 2

E(t) · G∞ E(t) +

1 2

t

t

ˇ ˙ ) · G(|τ ˙ ∫ ∫ E(τ − u|)E(u) du dτ

−∞ −∞ ∞∞

ˇ 1 − s2 |)E(t ˙ − s1 ) · G(|s ˙ − s2 ) ds1 ds2 . ∫ ∫ E(t 0 0

(7.61) This suggests that we consider the work function as an energy functional, namely ρψˆ w (E(t), Ert ) = Consequently,

1 2

E(t) · G∞ E(t) +

1 2

∞∞

ˇ 1 − s2 |)E˙ t (s2 ) ds1 ds2 . ∫ ∫ E˙ t (s1 ) · G(|s 0 0

˙ ρψ˙ w (t) = W˙ (Et ) = T (Et ) · E(t),

so proving that d ψˆ w = 0. Now, we continuously extend the history Et to R− by ˙ ) = 0 for all τ > t. Accordingly, letting Et (s) = Et (0) = E(t), s < 0, so that E(τ (7.61) yields ρψˆ w (E(t), Ert ) = W (Et ) = 21 E(t) · G∞ E(t) + = 21 E(t) · G∞ E(t) +

∞ −∞ ∞ −∞

˙ )· E(τ

∞ −∞

ˇ − u)E(u) ˙ G(τ du dτ

ˇ ∗ F E˙ (u) du ˙ )· G E(τ

428

7 Materials with Memory

and applying the Parseval–Plancherel theorem it follows ρψˆ w = 21 E(t) · G∞ E(t) +

1 2π



ˇ F (ω)iωE F (ω) dω. ∫ (iωE F )∗ (ω) · G

−∞

ˇ vanishes on (−∞, 0), its cosine transform G ˇ c is an even function while Now, since G ˇ s is odd, and applying (7.58) we conclude that the sine transform G ρψˆ w − 21 E(t) · G∞ E(t) =

1 2π

∞ −∞

ˇ c (ω)Ec (ω) + Es (ω) · G ˇ c (ω)Es (ω) dω ≥ 0, ω 2 Ec (ω) · G

so proving the minimum property at constant histories. On the other hand, extending the definition of Et to R− by letting Et (s) = 0, s < 0, and taking into account that E˙ + E(t) δ represents the distributional derivative of the extension of E, we obtain ∞  ∞  ˇ E(τ ˙ − s) ds · E(τ ˙ ) + δ(t − τ )E(t) dτ W (Et ) = 21 E(t) · G∞ E(t) + G(s)

= 21 E(t) · G∞ E(t) + = 21 E(t) · G∞ E(t) +

−∞ 0 ∞  ∞ 

−∞ −∞ ∞ 

ˇ E(τ ˙ ) · G(s) ˙ − s)dudτ + E(τ

∞ 

ˇ E(t ˙ − s)ds · E(t) G(s)

0

ˇ E(t ˙ − s)ds · E(t) + G(s)

0

∞  ∞  −∞ −∞

ˇ − u)E(u)dudτ ˙ ) · G(τ ˙ E(τ .

Since ρψˆ w (E(t), Ert ) = W (Et ) we obtain ∞

ˇ E(t ˙ − s) ds = T (Et ). ρ∂E ψˆ w (E(t), Ert ) = G∞ E(t) + ∫ G(s) 0

ˇ ˇ T (s) This completes the proof that ψˆ w represents a quadratic free energy. If G(s) =G then it is a positive definite functional on the Hilbert space Hw endowed with inner product and norm ∞∞

ˇ 1 − s2 |)Et (s2 ) ds1 ds2 , (E1t , E2t )Hw := E1 (t) · E2 (t) + ∫ ∫ E1t (s1 ) · G(|s 2 0 0

∞∞

ˇ 1 − s2 |)Et (s2 ) ds1 ds2 . Et 2Hw := E(t) · E(t) + ∫ ∫ Et (s1 ) · G(|s 0 0

ˇ ∈ W 2,1 (R+ ) and extending its Finally, assuming a regular symmetric kernel, G − ˇ definition to R by letting G(s) = 0, s < 0, we can prove that ψ w is equivalent to the work function ψ W introduced in Sect. 7.4.3. Indeed, we first rewrite (7.61) as

7.4 Linear Viscoelasticity

429

ρψˆ w = 21 E(t) · G∞ E(t) +

1 2

t

t

ˇ ˙ ) · G(|τ ˙ ∫ ∫ E(τ − u|)E(u) du dτ .

−∞ −∞

Repeated integrations by parts result in ρψˆ w = 21 E(t) · G∞ E(t) + =

1 E(t) 2

1 2

t

t

ˇ ˙ ) · G(|τ ˙ ∫ ∫ E(τ − u|)E(u) du dτ

−∞ −∞

· G∞ E(t) +

1 2

t

t

ˇ ∫ ∂τ E(τ ) · ∫ G(|τ − u|)∂u E(u) du dτ

−∞

−∞

t

ˇ − u)∂u E(u) du = 21 E(t) · G∞ E(t) + 21 E(t) · ∫ G(t −∞



ˇ ˇ ∫ E(τ ) · ∫ ∂τ G(|τ − u|) + G(0)δ(τ − u) ∂u E(u) dτ du

1 2

t

t

−∞

−∞

t

ˇ = 21 E(t) · G∞ E(t) + E(t) · G(0)E(t) + E(t) · ∫ G (t − u)E(u) du −∞

t

t

−∞

−∞

t

ˇ ˇ − ∫ E(τ ) · ∫ ∂τ G(|τ − u|)∂u E(u) dτ du − ∫ E(τ ) · G(0)∂ τ E(τ ) dτ =

1 E(t) 2

· G∞ E(t) +

1 E(t) 2

−∞ ∞

ˇ · G(0)E(t) + E(t) · ∫ G (s)Et (s) ds 0

t

t

t

ˇ − t)E(τ ) dτ + ∫ ∫ E(τ ) · G12 (|τ − u|)E(u) dτ du. − E(t) · ∫ ∂τ G(τ −∞

−∞ −∞

ˇ vanishes on R− , it follows that Since the extension of G t



−∞

0



ˇ − t)E(τ ) dτ = ∫ G ˇ (−s)Et (s) ds = 0, ∫ ∂τ G(τ

and hence we can write ρψˆ w in the form ∞

ρψˆ w = 21 E(t) · G0 E(t) + E(t) · ∫ G (s)Et (s) ds + 0

1 2

∞∞

∫ ∫ Et (s1 ) · G12 (|s1 − s2 |)Et (s2 ) ds1 ds2 . 0 0

7.4.5 Nonlinear Viscoelastic Models Free energies with quadratic memory terms yield constitutive equations with linear memory terms. Here we show that free energies of nonlinear constitutive equations can be generated from known free energies associated with linear memory-dependent response functional T . Let ψ0 (t) be a free energy, at time t,

430

7 Materials with Memory

ψ0 (t) = ψˆ 0 (E(t), Ert ). The functional ψˆ 0 is thermodynamically consistent in that is subject to the minimum property and the dissipation property, ψˆ 0 (E, Er† ) ≤ ψˆ 0 (E, Ert ), d ψˆ 0 (E, Ert |E˙ rt ) ≤ 0,

(7.62)

and is associated with a stress tensor T 0 (E(t), Ert ) in the form ρ∂E ψˆ 0 (E(t), Ert ) = T 0 (E(t), Ert ).

T0 (t) = T 0 (E(t), Ert ),

(7.63)

Correspondingly the work function W0 is defined by t

˙ )dτ . W0 (t) = ∫ T 0 (E(τ ), Erτ ) · E(τ −∞

The pair ψ0 , T 0 is subject to the entropy inequality so that ˙ = W˙ 0 . ρψ˙ 0 (t) ≤ T 0 (E(t), Ert ) · E(t) The following theorem shows how a nonlinear functional can be generated by the pair {ψ0 , T 0 }. Theorem 7.5 Let ψˆ be the functional ˆ ψ(E(t), Ert ) := f (ψ0 (t))

(7.64)

where f (x) : R+ → R+ is subject to f (0) = 0,

f (x) ≥ 0,

(7.65)

and ψ0 satisfies (7.62) and (7.63). Then ψˆ is a free energy for the stress functional T0 (E(t), Ert ). T(t) = T (E(t), Ert ) := f (ψ0 (t))T

(7.66)

ˆ Proof Consider ψ(t) = ψ(E(t), Ert ) given by (7.64). The definition (7.66) of T and the relations (7.63) allow us to write ˙ = f (ψ0 (t)) ρψ˙ 0 (t) ≤ f (ψ0 (t))T ˙ ˙ T0 (E(t), Ert ) · E(t) = T(E(t), Ert ) · E(t). ρψ(t) Moreover, by (7.63), (7.65), and (7.66), ρ∂E ψˆ = ρ f (ψ0 ) ∂E ψˆ 0 (E(t), Ert ) = T (E(t), Ert ). Now, by (7.62) we have

7.4 Linear Viscoelasticity

431

ˆ d ψ(E, Ert |E˙ rt ) = f (ψ0 ) d ψˆ0 (E, Ert |E˙ rt ) ≤ 0. As to the minimum property we observe that f is an increasing function. Hence ψˆ 0 (E, Er† ) ≤ ψˆ 0 (E, Ert ) implies ˆ ˆ Ert ). ψ(E, Er† ) = f (ψˆ 0 (E, Er† )) ≤ f (ψˆ 0 (E, Ert )) = ψ(E,



This result allows us to establish free energies and nonlinear viscoelastic models as a superposition of linear viscoelastic solids [8]. Given a nonlinear smooth function g : R+ → R+ such that g(0) = 0, g ≥ 0,

(7.67)

we can take f in the form f (ψ0 ) = ψ0 + g(ψ0 ). In that case we obtain the nonlinear stress–strain relation in the form T0 (E(t), Ert )+g (ψ0 (t))T T0 (E(t), Ert ), T (E(t), Ert )=T

ψ(t)=ψ0 (t)+g(ψ0 (t)). (7.68)

7.4.6 Some Examples Let memory kernels G be the memory kernel of a Boltzmann-type model, satisfying the thermodynamic condition (7.36), ∞

T(t) = T 0 (E(t), Ert ) = G∞ E(t) + ∫ G (s)Ert (s)ds, 0

with G∞ = lims→∞ G(s), and let ψˆ 0 be any related free energy functional. Thus, we have ˙ T0 (E(t), Ert ) and ρψ˙0 (t) ≤ T0 (t) · E(t). ψˆ 0 (E(t), Ert ) ≥ 0, ρ∂E ψˆ 0 (E(t), Ert )=T With this pair of functionals ψˆ 0 , T 0 we can determine nonlinear functionals once the function f is selected. For definiteness we let ψˆ 0 be Graffi’s free energy and proceed as follows. A nonlinear model based on Graffi’s free energy Let T0 be given by the linear functional (7.30),

432

7 Materials with Memory ∞

T0 (t) = G∞ E(t) − ∫ G (s)Edt (s)ds, 0

where Edt (u) = E(t) − Et (u);

G (u) = G (u), G (u) ≤ 0, G (u) ≥ 0, T

u ∈ R+ .

Correspondingly we choose ψˆ 0 = ψˆ G , Graffi’s free energy functional, ρ ψˆ G (E(t), Edt ) = 21 E(t) · G∞ E(t) −

1 2



∫ Edt (u) · G (u)Edt (u) du. 0

If f (ψ0 ) = ψ0 + αψ02 then we have T0 (Et , Ert ) T (Et , Ert ) = T 0 (Et , Ert ) + 2αψˆ 0 (Et , Ert )T ∞



0

0

= G∞ E(t) − ∫ G (s)Edt (s)ds + 2αψˆ 0 (E(t), Ert )[G∞ E(t) − ∫ G (s)Edt (s)ds] and ∞

ψ(E(t), Ert ) = 21 [E(t) · G∞ E(t) − ∫ Edt (s) · G (s)Edt (s)ds] 0



+α 41 [E(t) · G∞ E(t) − ∫ Edt (s) · G (s)Edt (s)ds]2 . 0

In terms of the product [A ⊗ B]C = (B · C)A for second-order tensors (see Sect. A.1.2) we can write the stress functional ∞ T(t) = G∞ E(t) − ∫ G (s)Edt (s)ds + G∞ E(t) ⊗ G∞ E(t) E(t) 0   ∞ ∞ t − ∫ G (s)Ed (s)ds ⊗ G∞ E(t) E(t) − ∫ G∞ E(t) ⊗ G (u)Edt (u) Edt (u) du 0 ∞∞

0

+ ∫ ∫ G (s)Edt (s) ⊗ G (u)Edt (u) Edt (u) duds.



0 0

Moreover, the corresponding rate of dissipation is   −ρdψ(t) = 21 0∞ Edt (s) · G (s)Ert (s) ds 1 + E(t) · G∞ E(t) − 0∞ Edt (s) · G (s)Edt (s)ds ≥ 0.

7.5 Viscous Fluids with Memory

433

For isotropic viscoelastic materials the kernel G and the relaxation modulus G∞ take the form G (s) = λ (s)1  1 + 2μ (s)I,

G∞ = λ∞ 1  1 + 2μ∞ I,

where λ , μ ∈ C 1 ∩ L 1 (R+ ; R) and λ∞ = lim λ(u), u→∞

u

λ(u) = λ0 + ∫ λ (s) ds, 0

and the like for μ. Moreover, λ , μ < 0 and λ , μ ≥ 0. Taking advantage of the decomposition, T = 13 (tr T)1 + T0 E = 13 (tr E)1 + E0 , where tr stands for the trace and the subscript 0 denotes the deviator of the tensor, we can write the nonlinear stress–strain relation in the form ∞

t T0 (t) = 2μ∞ E0 (t)[1 + 4μ∞ |E0 (t)|2 ] + 2[1 + 4μ∞ |E0 (t)|2 ∫ μ (s)E0r (s)ds ∞



0 ∞

0

0

0

t t t − 8μ∞ E0 (t) ∫ μ (s)|E0r (s)|2 ds − 8 ∫ μ (u)E0r (u)du ∫ μ (s)|E0r (s)|2 ds,



tr T(t) = K ∞ tr E(t)[1+K ∞ |tr E(t)|2 ]+[1+K ∞ |tr E(t)|2 ∫ K (s)tr Ert (s)ds 0







0

0

0

− K ∞ tr E(t) ∫ K (s)|tr Ert (s)|2 ds− ∫ K (u)tr Ert (u)du ∫ K (s)|tr Ert (s)|2 ds, where K (s) = λ(s) + 23 μ(s) and K ∞ = λ∞ + 23 μ∞ denote the bulk elastic kernel and the bulk relaxation modulus, respectively. The related free energy takes the form   t (s)|2 ds ψ(t) = 21 K ∞ |tr E(t)|2 + μ∞ |E0 (t)|2 − 21 0∞ K (s)|tr Ert (s)|2 ds − 0∞ μ (s)|E0r  2   t (s)|2 ds . + 21 K ∞ |tr E(t)|2 + μ∞ |E0 (t)|2 − 21 0∞ K (s)|tr Ert (s)|2 ds − 0∞ μ (s)|E0r

7.5 Viscous Fluids with Memory There are fluids, e.g. polymer solutions, which exhibit two types of behaviour. They are viscous and flow like a Newtonian fluid. In addition, they can store or release energy, like an elastic solid. In modelling, this behaviour can be interpreted as the

434

7 Materials with Memory

constitutive equations having some memory effect of the flow on the stress. The corresponding models are usually called viscoelastic fluids. Models of fluids with memory require two essential features. As with any material with memory the pertinent quantities (position, velocity, ...) must be related to a fixed point X ∈ R. In fluids, though, the Eulerian description is privileged as is the case for the velocity gradient in Navier–Stokes type models. To make these views operative we proceed as follows. Let  ⊂ E be the time-dependent region occupied by the fluid. At the time t the point x ∈  is occupied by the point X ∈ R such that x = χ(X, t). At any previous time τ ≤ t the position of the point X is given by χ(X, τ ). Hence the function χ(X, τ ) = χ(χ−1 (x, t), τ ) denotes the position, at time τ ≤ t, of the point (X) at x ∈  at time t. Given the times τ , t, τ ≤ t, the function ξ(x, t; τ ) := χ(χ−1 (x, t), τ ),

x ∈ ,

models an Eulerian description with the current region  as a reference configuration; in particular ξ(x, t; t) = x. So ∂ξ = ∇ξ is the gradient in (τ ) while ∇ and ∇X are the gradients in (t) and R. The relative displacement vector δ, with respect to the current configuration, is defined by δ(x, t; τ ) = ξ(x, t; τ ) − x = χ(X, τ ) − χ(X, t); it is δ(x, t; t) = 0. The velocity, at time τ ≤ t, of the point X is given by V(X, τ ) := ∂τ χ(X, τ ) = ∂τ ξ(x, t; τ ). Then the representation with respect to the current configuration is υ(x, t; τ ) := ∂τ ξ(x, t; τ ) = ∂τ δ(x, t; τ ),

x ∈ , τ ≤ t.

(7.69)

Hence we have the Eulerian description of the velocity of the fluid in the form v(ξ(x, t; τ ) = υ(x, t; τ ) and we can define the velocity gradient in the form D (x, t; τ ) := sym ∇υ(ξ, t; τ ) while ∂x υ = ∇ξ v ∇ξ

7.5 Viscous Fluids with Memory

435

At τ = t we have υ(x, t; t) = v(x, t),

D (x, t; t) = D(x, t) = sym∇v(x, t).

Alternatively we might compute D by means of the function V(X, τ ), D (x, t; τ ) = sym∇X V(X, τ ). Likewise the acceleration is determined by ∂τ υ(x, t; τ ) = ∂τ v + (v · ∇ξ )v, where v = v(ξ(x, t; τ ), τ ). At τ = t it follows ∂τ υ(x, t; τ )|τ =t = ∂t v(x, t) + (v(x, t) · ∇)v(x, t).

7.5.1 Incompressible Viscoelastic Fluids At any fixed point x ∈  and any time t ≥ 0, the symmetric Cauchy stress tensor T of a linear incompressible viscoelastic fluid is given by T = − p1 + T where p = p(x, t) is the pressure and ∞

T (x, t) = 2 ∫ G μ (s) D (x, t − s)ds.

(7.70)

0

The memory kernel G μ is called relaxation shear modulus. This constitutive equation implies that the stress response at (x, t) depends linearly on the values of the strain rate experienced in the instants preceding t at the point which is at x at time t. The incompressibility constraint is given by tr D (x, τ ) = ∇ · υ(x, t; τ ) = 0,

x ∈ , τ ≤ t.

If D is nearly constant then D (x, τ ) ≈ D (x, t) = D(x, t), τ ≤ t, and we can use the approximation T (t) = 2μ0 D(t),



μ0 = ∫ G μ (s)ds. 0

where μ0 represents the shear viscosity at constant strain rate. Borrowing from linear viscoelasticity (cf. [18]), we consider E (x, τ ) = sym∇δ(x, t; τ ), subject to the linearized incompressibility constraint

436

7 Materials with Memory

tr E (x, t; τ ) = ∇ · δ(x, t; τ ) = 0,

x ∈ , τ ≤ t.

Then also E (x, t; t) = 0 and in view of (7.69) ∂τ E (x, t; τ ) = D (x, t; τ ). A formal integration by parts transforms (7.70) into ∞

T (x, t) = 2 ∫ G μ (s) E (x, t; t − s)ds

(7.71)

0

The system of equations for the dynamics of an incompressible viscoelastic fluid is then obtained in the form ⎧ ⎨∇ · v = ∇ · δ = 0, ∞ ⎩ρ vt + (v · ∇) v (t) + ∫ G μ (s)ut (t − s) ds + ∇ p = ρb, 0

where ρ is the (constant) mass density and the dependence on x is understood and not written. In a linear incompressible viscoelastic fluid of Jeffreys type the stress–strain rate constitutive relation consists of a Newtonian contribution and a viscoelastic contribution,15 ∞

T (x, t) = 2μD(x, t) + 2 ∫ G μ (s) D t (x, t − s)ds,

(7.72)

0

which leads to the differential system ⎧ ⎨∇ · v = ∇ · u = 0, ∞ ⎩ρ vt + (v · ∇) v (t) − μv + ∫ G μ (s)ut (t − s) ds + ∇ p = ρb, 0

Approximation for small Weissenberg numbers To express the equation of motion in terms of a single variable, we need to establish a relationship,16 possibly approximate, between δ and v. We first observe that δ satisfies the integral equation t

δ(x, t; τ ) = ∫ v(x − ut (x, ξ), ξ)dξ, τ

Indeed, for any fixed t and for all τ ≤ t we have 15 16

See [244]. See, for instance, [186].

τ ≤ t.

7.5 Viscous Fluids with Memory

437

t

t

t

τ

τ

τ

∫ v(x + δ(x, t; ζ)dζ = ∫ υ(x, t; ζ)dζ = ∫ ∂ζ δ(x, t; ζ)dζ = −δ(x, t; τ ). Assume the position is in a neighbourhood of the rest position x. We can then make the first approximation t − δ(x, t; τ ) ≈ ∫ v(x, ζ) + (δ(x, t; ζ) · ∇)v(x, ζ) dζ,

τ ≤ t.

τ

(7.73)

Upon a change of variables s = t − τ and y = t − ζ we have s

−δ(x, t; t − s) ≈ ∫[v(x, t − y) + (δ(x, t; t − y) · ∇)v(x, t − y)]dy 0

Unfortunately, this integral relation is not explicit in v → δ. We then look for a further approximation. Let L and U be the characteristic length and velocity of the fluid flow, respectively. The ratio L/U is called kinematic time of the flow. Moreover, a natural time  can be defined in terms of the memory kernel, which represents in some sense the length of the memory. Following [18, p. 249] we assume  as defined in the form ∞ 2 s G μ (s)ds  = 0 ∞ . (7.74) 2 0 sG μ (s)ds Hence, we consider the dimensionless space and time variables xˆ =

x , L

tˆ =

Ut , L

τˆ =

Uτ , L

sˆ =

s 

and introduce the Weissenberg number We = U /L. It follows t − s = (tˆ − We sˆ )L/U. Then, letting We = and removing the hats, by recursion, the integral equation (7.73) yields the following power expansion of δ(x, t; t − s) with respect to s y s −δ(x, t; t − s) ≈ ∫ v(x, t − y)dy − 2 ∫ ∫ v(x, t − ζ)dζ · ∇ v(x, t − y)dy + 3 . . . 0

0

0

It is readily seen that the first-order approximation in the Weissenberg number gives τ

δ(x, t; t − τ ) ≈ ∫ v(x, t − ξ)dξ, 0

τ ≥ 0.

and then ∂τ δ(x, t; t − τ ) ≈ v(x, t − τ ). This allows us to approximate D with D at small Weissenberg numbers, that is when the natural time of the fluid, , is small if compared to the kinematical time D/U of the flow.

438

7 Materials with Memory

7.5.2 Compressible Viscoelastic Fluids Let T = − p1 + T ∈ Sym be the Cauchy stress. Compressible fluids are modelled through two scalar relaxation functions, μ and λ, named shear and bulk viscosity modulus; the stress T is determined in the form ∞



0

0

T (x, t) = 2 ∫ μ(s) D (x, t; t − s)ds + ∫ λ(s) tr D (x, t; t − s) ds.

(7.75)

For simplicity, we assume that D t can be approximated by Dt . Thermodynamic consistency of a Boltzmann-like model Consider a linear model of the viscous fluid in the form ∞



0

0

T = − p(ρ, θ) 1 + 2 ∫ μ(s)Dt (s)ds + ∫ λ(s)(tr Dt )(s)ds; the stress T = T + p1 is a (linear) functional of the history of D. We can then ask about the thermodynamic consistency of this model where the variables are the present values ρ, θ and the history Dt . By means of the deviator D0 we can write ∞



0

0

T = − p(ρ, θ) 1 + 2 ∫ μ(s)Dt0 (s)ds + ∫ κ(s)(tr Dt )(s)ds 1,

(7.76)

where κ = λ + 2μ/3. In view of (7.76) we assume that ψ and η are given by functionals of ρ, θ, Dt0 , tr Dt . It follows from the entropy inequality ˙ −T·D≥0 − ρ(ψ˙ + η θ)

(7.77)

that as θ, ρ are constants and D = 0 in the interval t ∈ [t0 , tˆ] we have ψ(tˆ) − ψ(t0 ) ≤ 0. As tˆ − t0 → ∞ we find that ψ(ρ, θ, 0† ) − ψ(ρ, θ, Dt0 ) ≤ 0; the free energy has a minimum at the zero history of D. Time differentiation of ψ(ρ, θ, Dt0 , tr Dt ) and substitution in the entropy inequality result in

7.5 Viscous Fluids with Memory

439

˙ t , tr Dt ) − ρ dψ(ρ, θ, Dt , tr Dt |tr D ˙ t) −ρ(∂θ ψ + η)θ˙ + (ρ2 ∂ρ ψ − p)∇ · v − ρ dψ(ρ, θ, Dt0 |D 0 0 ∞



0

0

+D0 · 2 ∫ μ(s) Dt0 (s)ds + tr D ∫ κ(s)(tr Dt )(s)ds ≥ 0.

Hence we have the classical relation η = −∂θ ψ. The remaining inequality is a ˙ t0 , tr Dt ) and dψ(ρ, θ, Dt0 , tr requirement on the functional ψ. Since dψ(ρ, θ, Dt0 |D t t t t ˙ ˙ ˙ D |tr D ) are linear functionals on D0 and tr D we can look for ψ in different ways. First, let



2 ∞ 2 ψ = (ρ, θ) + β ∫ μ(s) Dt0 (s)ds + γ ∫ κ(s) tr Dt (s)ds , 0

(7.78)

0

β and γ being constants. Hence we obtain the inequality in the form ∞



˙ t0 (s)ds (ρ2 ∂ρ ψ − p)∇ · v − 2βρ ∫ μ(s)Dt0 (s)ds · ∫ μ(s)D 0

0





0

0 ∞

˙ t (s)ds −2γρ ∫ κ(s)tr Dt (s)ds ∫ κ(s)tr D ∞

+2D0 · ∫ 0

μ(s)Dt0 (s)ds

+ tr D ∫ κ(s)tr Dt (s)ds ≥ 0. 0

Let tr D = ∇ · v = 0, at any time. The inequality simplifies to ∞





0

0

0

˙ t0 (s)ds + D0 · ∫ μ(s)Dt0 (s)ds ≥ 0. −β ∫ μ(s)Dt0 (s)ds · ∫ μ(s)D Upon an integration by parts we find ∞





0

0

0

˙ t0 (s)ds = − ∫ μ(s)∂s Dt0 (s)ds = μ(0)D0 (t) + ∫ μ (s)Dt0 (s)ds. ∫ μ(s)D

Hence we have ∞





0

0

0

−βρ ∫ μ(s)Dt0 (s)ds · ∫ μ (s)Dt0 (s)ds + [1 − βρμ(0)]D0 (t) · ∫ μ(s)Dt0 (s)ds ≥ 0. ˆ t0 , are independent in that, for any value Now, D(t) and Dt , as well as D0 (t) and D t ˆ ˜ t such that D(t) and history D we can find a continuous function D ˜ ˆ D(t) = D(t),

˜ t − Dt  < , D

being as small as we please. This implies that β=

1 > 0, ρμ(0)

μ(s)μ (s) ≤ 0, s ∈ (0, ∞).

440

7 Materials with Memory

Now instead let D0 = 0. We have ∞





0

0

0

˙ t (s)ds + tr D(t) ∫ κ(s)tr Dt (s)ds ≥ 0. (ρ2 ∂ρ ψ − p)∇ · v − 2γρ ∫ κ(s)tr Dt (s)ds ∫ κ(s)tr D

Owing to the arbitrariness of ∇ · v = tr D it follows p = ρ2 ∂ρ ψ. The inequality for tr D is then exploited as the previous one for D0 . As a result, it implies that 1 , κ(s)κ (s) ≤ 0, s ∈ (0, ∞). γ= 2ρκ(0) The inequalities μμ ≤ 0 and κκ ≤ 0 are satisfied if μ (s) = −αμ μ(s) and κ (s) = ακ κ(s), where αμ , ακ > 0, and hence μ(s) = μ(0) exp(−αμ s), κ(s) = κ(0) exp(−ακ s). The free energy ∞



0

0

ψ = (ρ, θ) + 2| ∫ μ(0) exp(−αμ s)Dt0 (s)ds|2 + | ∫ κ(0) exp(−ακ s)tr Dt (s)ds|2 satisfies the minimum property, at Dt = 0, and moreover complies with the reduced inequality −ρdψ + T · D ≥ 0 for all histories Dt0 , tr Dt . An alternative scheme might be looked at with the free energy in the form ∞



0

0

ψ(ρ, θ, Dt0 , tr Dt ) = (ρ, θ) + β ∫ μ(s)|Dt0 (s)|2 ds + γ ∫ κ(s)|tr Dt (s)|2 ds which satisfies the minimum property provided β, γ > 0. However the corresponding entropy production ∞



−ρdψ + T · D = −βρμ(0)|D0 (t)|2 − βρ ∫ μ (s)|Dt0 (s)|2 + D0 (t) · ∫ μ(s)Dt0 (s)ds 0

0





0

0

−γρκ(0)|tr D(t)|2 − γρ ∫ κ (s)|tr Dt (s)|2 ds + tr D(t) ∫ κ(s)tr Dt (s)ds cannot have a positive sign for every history Dt . It is then of interest to investigate the requirements on the kernels μ, κ of the constitutive equation (7.76) without having recourse to the free energy functional.17 Still let ρ, θ, Dt the set of independent variables. Let C be a cycle occurring in the time interval [0, d], so that (ρ(0), θ(0), D0 ) = (ρ(d), θ(d), Dd ) and then ψ(d) − ψ(0) = 17

In this way we determine necessary conditions.

7.5 Viscous Fluids with Memory

441

0. If θ is kept constant then, upon integration on t ∈ [0, d], the Clausius–Duhem inequality (7.77) leads to ∞

∫ T(t) · D(t)dt ≥ 0.

(7.79)

0

Let μc , κc denote the half-range cosine transform of μ, κ ∈ L 1 (R+ ), e.g. ∞

μc (ω) = ∫ μ(s) cos ωs ds. 0

We then have the following Theorem 7.6 The constitutive stress–strain relation (7.76), at constant temperature, is consistent with (7.79) if and only if μc (ω) > 0,

κc (ω) > 0,

∀ω > 0.

(7.80)

Observe that, in isothermal processes, p is a function of ρ and then d

d

0

0

∫( p1 · D)(t)dt = − ∫[ p(ρ(t))ρ(t)/ρ(t)]dt. ˙ Let h(ρ) be a primitive of p(ρ)/ρ. Hence d

d

0

0

˙ ∫[ p(ρ(t))ρ(t)/ρ(t)]dt ˙ = ∫ h(ρ(t))dt = h(ρ(d)) − h(ρ(0)) = 0. Hence (7.79) amounts to ∞

∫ T (t) · D(t)dt ≥ 0, 0





0

0

T (t) = 2 ∫ μ(s)Dt0 (s)ds + ∫ κ(s)(tr Dt )(s)ds 1.

To prove the theorem we select periodic histories in the form D(t) = D1 cos ωt + D2 sin ωt. The proof proceeds as with viscoelastic solids with unbounded kernel, subject to ˇ = 2μI + λ1 ⊗ 1, or directly as in [196] and [158] Sect. 3.7. G∞ = 0 and G  In addition to thermodynamic consistency of the functional T , with a free energy (7.78), we now look for a work-like function. Assume that μc , κc > 0 and let ψ(t) = (ρ(t), θ(t)) + ψ w (Drt ), where ψ w (Drt ) is the work, per unit mass, performed by T (Drt ); we have

442

7 Materials with Memory t

ρψ w (Drt ) := ∫ T(Drτ ) · D(τ )dτ −∞ t





−∞

0

0

= ∫ [2D(τ ) · ∫ μ(s)D0 (τ − s)ds + tr D(τ ) ∫ κ(s)tr D(t − s)ds]dτ For technical convenience we extend the definition of μ, κ to R− by letting μ(s) = 0, κ(s) = 0, s < 0. Hence we can write t

τ

τ

−∞

−∞

ρψ w (Drt ) = ∫ [2D0 (τ ) · ∫ μ(τ − u)D0 (u)du + tr D(τ ) ∫ κ(τ − u)D0 (u)du]dτ −∞ t t

= ∫ ∫ [D0 (τ ) · μ(|τ − u|)D0 (u) + 21 tr D(τ )κ(|τ − u|)tr D(u)]du dτ −∞ −∞ ∞∞ = ∫ ∫ [Dt0 (s1 ) · μ(|s1 − s2 |)Dt0 (s2 ) + 21 tr Dt (s1 )κ(|s1 − s2 |)tr Dt (s2 )]ds1 ds2 . 0 0

By definition, ρψ˙ w = T · D. Thus the Clausius–Duhem inequality holds as an equality for ψ =  + ψ w with η = −∂θ ,

p = ρ2 ∂ρ ,

−ρψ˙ w + T · D = 0.

Since we know the functional ψ w (Drt ) we ask for the minimum property. Extend the history Dt to R− by letting D(τ ) = 0 when τ > t. Hence we have ∞







−∞ ∞

−∞

−∞

−∞

ρψ w (Drt ) = 2 ∫ D0 (τ ) · ∫ μ(τ − u)D0 (u)du dτ + ∫ tr D(τ )· ∫ κ(τ − u)tr D(u)du dτ = ∫ [2D0 (τ ) · (μ ∗ D0 )(τ ) + tr D(τ )(κ ∗ tr D)(τ )]dτ , −∞

where ∗ denotes the Fourier convolution. Consequently we apply the Parseval– Plancherel theorem to find ∞ ρψ w (Drt ) = π1 ∫ {μc (ω)(|D0c (ω)|2 + |D0s (ω)|2 ) + 2κc (ω)(|tr Dc (ω)|2 + tr Ds (ω)|2 }dω. −∞

In view of (7.80) the minimum property at zero histories follows.

7.5.3 Maxwell-Like Viscoelastic Fluids As outlined in this chapter, memory is naturally modelled by means of functionals. More elementary models of materials with memory are possible in terms of rate-type

7.5 Viscous Fluids with Memory

443

equations (see Chap. 6). Here we examine Maxwell’s model as a rate-type scheme of viscoelastic fluids. Maxwell’s model is based on the differential equation τ T˙ + T = 2μD, for the stress T = T + p1; T and D are considered at the same time (and place). Upon integration we find that ∞

T (t) = 2(μ/τ ) ∫ exp(−s/τ )D(t − s)ds 0

and then G(s) = (μ/τ ) exp(−s/τ ) is the relaxation modulus. Hence Maxwell’s model proves to be a special case of Boltzmann’s model. If D is nearly constant then ∞ T (t)  2(μ/τ ) ∫ exp(−s/τ )ds D(t) = 2μD(t), 0

as with a Newtonian fluid. Incidentally, the parameter τ is called the relaxation time and may be a measure of how long a fluid will remember.18 Let t ∗ be a characteristic time scale (the ratio of a characteristic length L and a characteristic speed U ). The quantity We =

τ t∗

is called the Weissenberg number. The bigger is the Weissenberg number (τ  t ∗ ) the more the fluid behaves like an elastic solid (T˙  2(μ/τ )D), the smaller is We T  2μD). the more the fluid behaves like a Newtonian fluid (T Both Boltzmann’s model and Maxwell’s model are generalized to obtain nonlinear properties. Relative to Maxwell’s model we first observe that the (total) time derivative is not objective. An objective time derivative is in order to replace T˙ and !

the Oldroyd derivative T , the Cotter-Rivlin derivative T , and the corotational deriva◦

tive T are of interest (for incompressible materials). Also linear superposition of any of them can be used; in the Johnson–Segalman model [236] the derivative ♦

!

T ζ = (1 − 21 ζ) T + 21 ζ T

is involved. Other non-linearities are inserted by adding objective nonlinear terms T (Pan-Thien-Tanner model). A generic superT 2 (Giesekus model) or κ(tr T )T like κT !

position (of T and T ) can be written in the form

18

This is consistent with the definition of natural time (7.74), in that  = τ .

444

7 Materials with Memory ♦



T D + DT T ) = T˙ − WT T + T W + ν(T T D + DT T ). T =T +ν(T Thermodynamic consistency of some generalized Maxwell models Memory and non-linearity are modelled via the generalized Maxwell model for T involving a generic objective time derivative. For definiteness we consider the Truesdell time derivative, $

T − T LT + (∇ · v)T T T = T˙ − LT and examine the rate equation $

T ) + α T = 2μD + λ(tr D)1. f(T

(7.81)

where f : Sym → Sym and α > 0. To investigate the thermodynamic consistency of (7.81) we restrict attention to non-conducting fluids and write the entropy inequality in the form ˙ + T · D ≥ 0. −ρ(ψ˙ + η θ) We let ρ, θ, T , D be the independent variables. The free energy ψ is (continuously) differentiable while η is continuous. Time differentiation of ψ results in ˙ + T · D ≥ 0. −ρ(∂θ ψ + η)θ˙ + ( p − ρ2 ∂ρ ψ)∇ · v − ρ∂T ψ · T˙ − ρ∂D ψ · D Upon substitution of $

T˙ =T +LT + T LT − (∇ · v)T =

1 [2μD + λ(∇ · v)1 − f(T )] + LT + T LT − (∇ · v)T α

we obtain the inequality ρλ 2ρμ T − tr ∂T ψ + ρ∂T ψ · T ]∇ · v + (T ∂T ψ) · D −ρ(∂θ ψ + η)θ˙ + [ρ2 ∂ρ ψ − p − α α ρ ˙ ≥ 0. T ) − ρ∂T ψ · [(D + W)T T + T (D − W)] − ρ∂D ψ · D + ∂T ψ · f(T α

˙ and θ˙ implies that The arbitrariness of D ∂D ψ = 0,

η = −∂θ ψ.

Observe that T − T W] = ρ[∂T ψT T − T ∂T ψ] · W. ρ∂T ψ · [WT

7.5 Viscous Fluids with Memory

445

Since T − T ∂T ψ ∈ Skw ∂T ψT then the arbitrariness of W ∈ Skw implies that T − T ∂T ψ = 0. ∂T ψT

(7.82)

Since ψ is independent of D, letting D = 0 we find T ) ≥ 0. ∂T ψ · f(T

(7.83)

Now only linear terms in D are left. By using the decomposition D = D0 + 13 (tr D)1 we can write [ρ2 ∂ρ ψ − p −

ρλ 2ρμ T − tr ∂T ψ + ρ∂T ψ · T + 13 tr (T ∂T ψ − 2ρ∂T ψ T )]∇ · v α α 2ρμ T − +[T ∂T ψ − 2ρ∂T ψ T ] · D0 ≥ 0. α

The arbitrariness of ∇ · v and D0 implies ρ2 ∂ρ ψ − p −

ρλ 2ρμ T − tr ∂T ψ + ρ∂T ψ · T + 13 tr (T ∂T ψ − 2ρ∂T ψ T ) = 0, α α (7.84) 2ρμ ∂T ψ − 2ρ∂T ψ T = 0. T − (7.85) α

Hence the admissible free energy functions are subject to (7.82)–(7.85). If p is a function of ρ and θ we may take p = ρ2 ∂ρ ψ. Otherwise we might let (7.84) be the definition of p.



A simpler scheme arises by considering the corotational derivative T and replacing (7.81) by ◦

T ) + α T = 2μD + λ(tr D)1. f(T

(7.86)

By paralleling the previous derivation we find again that ∂D ψ = 0,

η = −∂θ ψ,

T − T ∂T ψ = 0, ∂T ψT

T ) ≥ 0. (7.87) ∂T ψ · f(T

Consequently it follows that ρ2 ∂ρ ψ − p −

ρλ 2ρμ T − tr ∂T ψ + 13 tr (T ∂T ψ) = 0, α α

T − (T

2ρμ ∂T ψ)0 = 0. α

446

7 Materials with Memory

Letting p = ρ2 ∂ρ ψ − we obtain T −

ρλ tr ∂T ψ α

2ρμ ∂T ψ = 0. α

The obvious integration results in ψ(ρ, θ, T ) = (ρ, θ) +

α T ·T . 4ρμ

Moreover, the inequality in (7.87) holds if T ) = β(ρ, θ, T )T T, f(T

β/μ ≥ 0.

An alternative scheme arises by considering the Lagrangian description of a generalized Maxwell model. Since T = − p1 + T then we obtain the second Piola stress in the form T R R = −J pC−1 + J F−1T F−T . For formal convenience let

T R R := J F−1T F−T .

The power per unit volume then becomes ˙ = − 1 J p C−1 · C ˙ + 1 T R R · C. ˙ T · D = 21 T R R · C 2 2 The stress T R R is assumed to satisfy a rate equation ˙ τ T˙ R R + f(θ, C, T R R ) = λ(θ)C.

(7.88)

˙ is a nonlinear description of a viscous material. If τ = 0 then f(θ, C, T R R ) = λ(θ)C ˙ If τ T R R prevails then the equation can be viewed as representative of a thermoelastic material T R R  (C − 1). We let θ, C, T R R be the independent variables, T˙ R R being determined by the rate equation (7.88). Hence we consider the constitutive equation ψ = ψ(θ, C, T R R ) and the like for η and p. We then investigate the restrictions placed by the entropy inequality in the Lagrangian description, ˙ + 1 TR R · C ˙ ≥ 0. −ρ R (ψ˙ + η θ) 2

7.5 Viscous Fluids with Memory

447

Upon evaluation of ψ˙ and substitution of T˙ R R from (7.88) we have −ρ R (∂θ ψ + η)θ˙ + ( 21 T

RR

ρR λ ˙ + ρ R ∂T ψ · f ≥ 0. − 21 J pC−1 − ρ R ∂C ψ − ∂T R R ψ) · C RR τ

˙ implies that The arbitrariness of θ˙ and C η = −∂θ ψ,

1 T 2

RR

− 21 J pC−1 − ρ R ∂C ψ −

ρR λ ∂T R R ψ = 0, τ

ρ R ∂T R R ψ · f ≥ 0.

(7.89) (7.90)

So far the pressure p is undetermined; given p, Eq. (7.89)2 is a relation between ψ and T R R . For definiteness, let ψ depend on C via (det C)1/2 = det F = J. Hence ∂C ψ = ∂ J ψ ∂C J, Consequently we find

∂C J = 21 (det C)−1/2 ∂C det C =

1 (det C)C−1 = 21 J C−1 . 2J

−ρ R ∂C ψ = −ρ R ∂ J ψ 21 J C−1 .

In addition, since J = ρ R /ρ then ∂ J ψ = −∂ρ ψ ρ R /J 2 . Hence we obtain 1 2

J pC−1 = 21 ρ2 J ∂ρ ψ C−1

whence it follows the classical relation p = ρ2 ∂ρ ψ. The function ψ is subject to 1 T 2

RR



ρR λ ∂T R R ψ = 0 τ

and (7.90). In view of (7.91), inequality (7.90) holds if T RR, f = T /λ being a fourth-order positive definite tensor. In particular we may have

(7.91)

448

7 Materials with Memory

 = α1 ⊗ 1. Hence if T˙ R R = 0 then T RR =

λ˙ C, α

˙ ≥ 0, T RR · C

which characterizes a dissipative model. Further, upon integration of (7.91) we find ψ=

τ T RR · T RR. 4ρλ

Equation (7.88) is now considered in the linear case, ˙ T R R = λC, τ T˙ R R + αT τ , α, λ being constants or known functions of time. Upon integration we find that T

t

RR

= ∫

−∞

t α ∞ λ t λ ˙ ˙ − s) exp[− ∫ α (ξ)dξ]ds. (u)C(u) exp[− ∫ (ξ)dξ]du = ∫ (t − s)C(t τ u τ t−s τ 0 τ

Hence it follows that ∞ λ t 1 ˙ − s) exp[− ∫ α (ξ)dξ]ds FT (t). F(t) ∫ (t − s)C(t J (t) t−s τ 0 τ (7.92) ˙ = 2FT DF then we find Since C

T(t) = − p(t)1 +

T(t) = − p(t)1 +

t α 1 ∞λ ∫ (t − s)F(t)FT (t − s)D(t − s)F(t − s)FT (t) exp[− ∫ (ξ)dξ]ds. J (t) 0 τ t−s τ

Upon an integration by parts the integral (7.92) can be written as T(t) = − p(t)1 + −

λ (t)F(t)[C(t) − 1]FT (t) Jτ

∞ λ t α α 1 F(t) ∫ [( )˙+ ](t − s) exp[− ∫ (ξ)dξ](C − 1)(t − s)ds FT (t). J (t) τ t−s τ 0 τ

˙ Equation (7.92) determines the Cauchy stress T in terms of the history of C. Linear approximations follow by letting J = 1 and F = 1. In that case ∞

T(t) = − p(t)1 + 2 ∫ 0

We can then view

t α λ (t − s) exp[− ∫ (ξ)dξ]Dt (s)ds. τ t−s τ

t α λ m(t, s) = 2 (t − s) exp[− ∫ (ξ)dξ] τ t−s τ

7.5 Viscous Fluids with Memory

449

as the memory function and conclude that ∞

T(t) = − p(t)1 + ∫ m(t, s) Dt (s)ds. 0

It is worth mentioning the BKZ constitutive equation [36], t

F −1 ∂E U F −T ](s)ds T(t) = − p(t)1 + ∫ [F −∞

where F is the relative deformation gradient, Fi j (s) = Fi K (s)FK−1j (t), and U is a function of the strain E = 21 (C − 1). No thermodynamic setting seems to have been established for the BKZ equation. The stress is assumed to be a linear functional of the strain history and the strain is measured with the current configuration as reference.

7.5.4 Acceleration Waves in Viscous Fluids with Memory Here the properties of acceleration waves in viscous fluids are investigated. For definiteness and simplicity the fluid is assumed to be at a constant temperature and hence the pertinent balance equations are ρ˙ = −ρ∇ · v,

ρ˙v = −∇ p(ρ) + ∇ · T .

Two constitutive models for the (viscous) Cauchy stress T are considered in two cases according as the fluid is modelled by a rate-type equation or by a Boltzmann-like equation, ◦





0

0

T + α T = 2μD + λ(tr D)1, T = 2 ∫ μ(s)D(t − s)ds + ∫ λ(s)tr D(t − s)ds. (7.93) The pressure p and the scalars α, μ, ν are allowed to be parameterized by the temperature θ. We now investigate the compatibility of this scheme with wave propagation at a finite speed. Let σ be a time-dependent surface, in the current configuration , and denote by n the unit normal. The surface σ is a second-order wave (or acceleration wave) if – v, ρ are continuous functions of x and t in a neighbourhood N of σ, – v˙ , L, ρ, ˙ ∇ρ and all higher-order derivatives suffer jump discontinuities across σ but are continuous functions of x and t in N \ σ.

450

7 Materials with Memory

Since [[v]] = 0 then Poisson’s condition requires that [[ p]] = 0,

T ]] = 0. [[T

The continuity of p is consistent with the assumption p = p(ρ), the continuity of T is also a consequence of Sect. 2.17.6 if the Boltzmann-like equation is chosen. Let  := [[∂n ρ]] a := [[∂n v]], where ∂n = n · ∇ and denote by U = u n − v · n the local speed of propagation. The Hugoniot compatibility conditions and Maxwell’s theorem imply that [[ρ]] ˙ = −U [[∂n ρ]], [[D]] = 21 (a ⊗ n + n ⊗ a),

[[˙v]] = −U [[∂n v]],

[[W]] = 21 (a ⊗ n − n ⊗ a),

[[∇ · v]] = a · n.

Maxwell-like models The discontinuities a,  are subject to the balance equations. From the continuity equation ρ˙ = −ρ∇ · v it follows U  = ρa · n. Now, by Maxwell’s theorem [[∇ p]] = p [[∇ρ]] = p  n. Moreover, [[∇ · T ]] = [[∂n T ]]n = −

1 ˙ [[T ]]n. U

Hence the equation of motion ρ˙v = −∇ p + ∇ · T implies that ρU 2 a = ρ p (a · n)n + [[T˙ ]]n.

(7.94)

To compute [[T˙ ]] we look at the constitutive equation (7.93)1 to obtain λ 2μ T − T [[W]] + [[D]] + [[tr D]]1 [[T˙ ]] = [[W]]T α α whence λ μ [[T˙ ]] = 21 (a ⊗ n − n ⊗ a)T − 21 T (a ⊗ n − n ⊗ a) + (a ⊗ n + n ⊗ a) + (a · n)1. α α

Upon substitution we find the propagation condition

7.5 Viscous Fluids with Memory

451

λ μ T n − 21 T (a − (a · n)n) + (a + (a · n)n) + (a · n)n. ρU 2 a = ρ p (a · n)n + 21 (a ⊗ n − n ⊗ a)T α α

Solutions for the vector a occur depending on T and hence on the state ahead of the wave. Two cases are considered. First let T = β1. Hence T n − T (a − (a · n)n) = 0 (a ⊗ n − n ⊗ a)T and the equation for a reduces to ρU 2 a = ρ p (a · n)n +

λ μ (a + (a · n)n) + (a · n)n. α α

Two solutions occur. Longitudinal waves, where a ∝ n, happen with U 2 = p +

2μ + λ . ρα

Transverse waves, where a · n = 0, happen with U2 =

2μ . ρα

The second case is T = T n ⊗ n, which represents a normal traction along n. Hence T n − T (a − (a · n)n) = T a⊥ , (a ⊗ n − n ⊗ a)T where a⊥ = a − (a · n)n. The propagation equation takes the form ρU 2 a = ρ p (a · n)n + T a⊥ +

λ μ (a + (a · n)n) + (a · n)n. α α

Longitudinal waves still propagate with U 2 = p + (2μ + λ)/ρα whereas transverse waves happen with 2μ + λ T . (7.95) U 2 = p + + ρ ρα Hence Eq. (7.93)1 allows for the propagation of acceleration waves, the parameters μ, λ showing a role analogous to that of Lamé moduli in elasticity. The propagation mode is affected by the stress T ahead of the front σ and hence depends on the chosen derivative. As with (7.95) where T /ρ occurs, the speed U and the polarization a are affected by the type of objective time derivative as it follows with the generic T + T W + ν(DT T + T D). derivative T˙ − WT Boltzmann-like models

452

7 Materials with Memory

T ]] = 0 Let T be given by the Boltzmann-like relation (7.93)2 . We observe that [[T while [[T˙ ]] = 2μ(0)[[D]] + λ(0)[[tr D]]1 = μ(0)(a ⊗ n + n ⊗ a) + λ(0)(a · n)1 and

[[T˙ ]]n = μ(0)[a + (a · n)n] + λ(0)(a · n)n.

Substitution in (7.94) yields ρU 2 a = ρ p (a · n)n + μ(0)[a + (a · n)n] + λ(0)(a · n)n. Longitudinal waves, a ∝ n, happen with U 2 = p +

2μ(0) + λ(0) ρ

while transverse waves, a · n = 0, happen with U2 =

μ(0) . ρ

Measurements of wave propagation speeds allow us to determine the initial values μ(0), λ(0) of the relaxation functions.

7.6 Electromagnetic Materials with Memory We consider polarizable and magnetizable undeformable solids acted upon by electromagnetic fields. For simplicity the solid is taken to be macroscopically rigid so that we can select the reference frame at rest with the body. The constitutive properties are expressed by letting the polarization P, the magnetization M, and the density current J depend on the whole history of the electric field E and the magnetic field H. Moreover, thermal properties are modelled by letting the constitutive functions depend on the histories θt and ∇θt . For the sake of generality we let  = (θ, E, H, θrt , Ert , Hrt , ∇θrt ) be the set of variables for the constitutive functionals of the free energy ψ, the entropy η, the polarization P, the magnetization M, the electric current J, the heat flux q, and the extra-entropy flux k. The constitutive functionals are required to be consistent with the second-law inequality (2.85), that is

7.6 Electromagnetic Materials with Memory

453

˙ + E · P˙ + μ0 H · M ˙ + E · J − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ(ψ˙ + η θ) θ For technical convenience we consider the free energy density19 φ given by ρφ = ρψ − E · P − μ0 H · M. Hence, since ρ is constant then we obtain the inequality in the form ˙ − P · E˙ − μ0 M · H ˙ + E · J − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ(φ˙ + η θ) θ We now derive some thermodynamic restrictions placed on the constitutive functionals φ, η, P, M, J, q, k. We let φ be a differentiable functional of ; the entropy inequality can be written in the form ˙ + E · J − 1 q · ∇θ + θ∇ · k −ρ(∂θ φ + η)θ˙ − (ρ∂E φ + P) · E˙ − (ρ∂H φ + μ0 M) · H θ ˙ rt ) − ρdφ(|∇ θ˙ rt ) ≥ 0. −ρdφ(|θ˙ rt ) − ρdφ(|E˙ rt ) − ρdφ(|H

Formally, the extra-entropy flux k might depend on θ and θrt so that ∇ · k = ∇ · k(θ, θrt ); other dependences on  would be ruled out by the entropy inequality in view of the arbitrariness of ∇E, ∇H, ∇Ert , ∇Hrt . Yet the dependence of the vector k on the scalars θ and θrt is ruled out unless anisotropic material vectors occur. Hence we let k = 0. As for the degree of arbitrariness of the electromagnetic fields we observe that Maxwell’s equations are constraints on the admissible fields. Now ∇ × H = J + 0 ∂t E + ∂t P,

∇ × E = −∂t B

˙ are allowed provided ∇ × H show that arbitrary values of ∂t E = E˙ and ∂t H = H and ∇ × E satisfy the two equations. We can then assume that at a point x and at a time t the vectors ∇ × H and ∇ × E make Maxwell’s equations to hold for arbitrarily ˙ Furthermore, the set of variables  does not involve gradients of E selected E˙ and B. ˙ H ˙ is consistent and H. As a consequence the selection of arbitrary (vector) values E, with Maxwell’s equations and meanwhile does not affect the variables in . ˙ E, ˙ H ˙ implies that The arbitrariness of θ, η = −∂θ φ,

P = −ρ∂E φ,

μ0 M = −ρ∂H φ.

This function is considered also in [97, 98] where it is denoted by ζ and called free enthalpy density.

19

454

7 Materials with Memory

Hence it follows the dissipation inequality 1 E · J − q · ∇θ − ρdφ(|θ˙rt ) − ρdφ(|E˙ rt ) − ρdφ(|B˙ rt ) − ρdφ(|∇ θ˙rt ) ≥ 0. θ For definiteness particular models are now investigated.

7.6.1 Dielectrics with Memory A dielectric is a continuum that can be polarized by an applied electric field. When a dielectric is placed in an electric field, electric charges do not flow through the material as they do in an electrical conductor but only slightly shift from their average equilibrium positions causing dielectric polarization. We let J = 0 as with electrical insulators.20 It is assumed that M = 0, q = 0, and that the constitutive dependence on the field H and the temperature gradient ∇θ is negligible. Hence  = (θ, E, θrt , Ert ) is the set of independent variables and the constitutive properties are subject to −ρφ˙ − ρη θ˙ − P · E˙ ≥ 0. Consequently we find that η = −∂θ φ,

P = −ρ∂E φ,

dφ(|θ˙rt ) + dφ(|E˙ rt ) ≤ 0.

Moreover, given two histories θt0 , Et0 consider the constant continuations θt , Et , as θt (s) =

 θ(t ), s ∈ [0, t − t0 ), 0 θt0 (s − t + t0 ), s ∈ [t − t0 , ∞),

and the like for Et . Hence ˙ ) ≤ 0, τ ∈ (t0 , t); φ(τ

φ(t) ≤ φ(t0 ).

As t − t0 → ∞ we have φ(θ† (t0 ), E† (t0 )) ≤ φ(θt0 , Et0 ). Among the histories ending with the values θ(t0 ), E(t0 ) the constant histories make φ minimum. In the simplest model, (linear) dielectrics are characterized by the constitutive equation P = 0 χE 20

Faraday introduced the term dielectric as synonymous with insulator.

7.6 Electromagnetic Materials with Memory

455

where 0 is the permittivity of free space and χ is the dielectric susceptibility. Hence D = 0 r E, where r = 1 + χ is the relative permittivity. In anisotropic dielectrics the permittivity is a second-order tensor. To fix ideas we model dielectrics as linear materials with memory in the form ∞

P(t) =  0 E(t) + ∫  (s)E(t − s)ds,

(7.96)

0

where  0 and  are allowed to depend on the (present) temperature θ(t). Since −ρ∂E ∂E φ =  0 then it follows that  0 ∈ Sym. We assume that  ∈ Sym too and that lim  (s) = 0.

s→∞

We might observe that, by (7.96), ∞

ˆ t) ρφ = 21 E ·  0 E + E · ∫ E(t − s)ds + ρφ(E 0

and determine φˆ so that φ is consistent with thermodynamics. Instead, more conveniently, let ∞

 ∞ =  0 + ∫  (s)ds ∈ Sym 0

and write (7.96) in the form ∞

P(t) =  ∞ E(t) − ∫  (s)[E(t) − E(t − s)]ds. 0

Then ρφ can be taken in the form ρφ = − 21 E ·  ∞ E +

1 2



∫ [E(t) − E(t − s)] ·  (s)[E(t) − E(t − s)]ds + ρφ˜ 0

where ρφ˜ depends on θ. The functional φ is required to satisfy the minimum property at constant histories and (7.97) dφ(|E˙ rt ) ≤ 0. The minimum property holds if and only if

456

7 Materials with Memory

 ≥ 0. Now notice that ∞

˙ − s) ·  (s)[E(t) − E(t − s)]ds. ρdφ(|E˙ rt ) = − ∫ E(t 0

An integration by parts and the observation that ∞ [E(t) − E(t − s)] ·  (s)[E(t) − E(t − s)] 0 = 0 result in ∞

ρdφ = − ∫ ∂s [E(t) − E(t − s)] ·  (s)[E(t) − E(t − s)]ds 0

=

1 2



∫ [E(t) − E(t − s)] ·  (s)[E(t) − E(t − s)]ds. 0

In view of (7.97) it follows that

 ≤ 0.

Hence, by s

 (s) =  (0) + ∫  (ξ)dξ 0

it follows that

s

 (0) = − ∫  (ξ)dξ > 0 0

and  is a non-negative, decreasing tensor. Furthermore s

(s) = (0) + ∫  (ξ)dξ 0

implies that ∞ > 0. It is worth remarking the analogy and a difference between viscoelasticity and dielectrics. A strictly formal analogy holds between stress–strain (7.29) and polarizationelectric field (7.96) with the correspondence G ↔ . Nevertheless we have G∞ < G0 ,

∞ > 0.

These different inequalities follow from the terms ˙ T · E,

˙ −P · E,

7.6 Electromagnetic Materials with Memory

457

and hence with different signs, in the entropy inequality.

7.6.2 Conductors with Memory Both electric and thermal conduction are considered and the conductor is modelled by letting  = (θ, E, ∇θ, Ert , ∇θrt ) be the independent variables so that21 ψ, η, J, q are given by functionals of . The second-law inequality becomes 1 −ρ(∂θ ψ + η)θ˙ − ρ∂E ψ · E˙ − ρ∂∇θ ψ · ∇ θ˙ + E · J − q · ∇θ − ρdψ(|E˙ rt ) − ρdψ(|∇ θ˙ rt ) ≥ 0. θ

˙ and θ˙ implies ˙ ∇ θ, The arbitrariness of E, ∂E ψ = 0,

∂∇θ ψ = 0,

η = −∂θ ψ.

Hence it follows the dissipation inequality 1 E · J − q · ∇θ − ρdψ(|E˙ rt ) − ρdψ(|∇ θ˙rt ) ≥ 0. θ

(7.98)

We remark that the occurrence of E and ∇θ, instead of time derivatives, does not allow ψ to enjoy the minimum property at constant histories. We now look for possible functionals associated with J, q, and ψ. First observe that ψ cannot depend on the present values E, ∇θ. Moreover functionals in the form of a quadratic integral (in Et ) as ργ(Et ) =

1 2



∫ Et (s) · (s)Et (s)ds, 0

(∞) = 0,

and the like for q, would result in the inequality E · (J − 21 (0)E) −

1 2



∫ Et (s) ·  (s)Et (s)ds ≥ 0 0

which holds if J = 21 (0)E,

 (s) ≤ 0,

possibly with (0) ≥ 0. Though this is thermodynamically consistent, the constitutive equation for J does not model a material with memory. 21

Since P = 0 then φ = ψ.

458

7 Materials with Memory

We then let  0 , K0 ∈ Sym be positive definite and look for linear functionals of Et and ∇θt ,

1/2 ∞

2

1/2 ∞

2 ρψ = ρψ0 (θ) + 21 α(θ)  0 ∫ λ(s)Et (s)ds + 21 β(θ) K0 ∫ ν(s)∇θt (s)ds , 0

0

(7.99) where λ, ν are differentiable, λ(∞), ν(∞) = 0 while α and β are positive-valued. Let ρdψ stand for ρdψ(|E˙ rt ) + ρdψ(|∇ θ˙rt ). Upon integrations by parts we find ∞



0

0





ρdψ = α ∫ λ(u)E˙ t (u)du ·  0 ∫ λ(s)Et (s)ds + β ∫ ν(u)∇ θ˙ t (u)du · K0 ∫ ν(s)∇θt (s)ds 0



0



= α{λ(0)E + ∫ λ (u)Et (u)du} ·  0 ∫ λ(u)Et (u)du 0 ∞

0 ∞ t +β{ν(0)∇θ + ∫ ν (u)∇ θ˙ (u)du} · K0 ∫ ν(s)∇θt (s)ds. 0 0

Hence we can write the dissipation inequality (7.98) in the form ∞



E · {J − αλ(0) 0 ∫ λ(s)Et (s)ds} − ∇θ · {θ−1 q + βν(0)K0 ∫ ν(s)∇θt (s)ds}

0 0 ∞ ∞ ∞ t t t −α ∫ λ (s)E (s)ds ·  0 ∫ λ(s)E (s)ds − β ∫ ν (s)∇θ (s)ds ·  0 ∫ ν(s)∇θt (s)ds ≥ 0. 0 0 0 0 ∞

This inequality holds if and only if ∞

J = αλ(0) 0 ∫ λ(s)Et (s)ds,

(7.100)

0



q = −θβν(0)K0 ∫ ν(s)∇θt (s)ds,

(7.101)

0

λ λ ≤ 0,

ν ν ≤ 0.

(7.102)

Inequalities (7.102) imply that (λ2 ) ≤ 0,

(ν 2 ) ≤ 0,

and hence that λ and ν do not change sign and that |λ|, |ν| are decreasing functions. This in turn implies that λ(0)λ(s) > 0,

ν(0)ν(s) > 0.

When E and ∇θ are slowly variable in time we have

7.6 Electromagnetic Materials with Memory

459

J  a(θ) 0 E,

q  −b(θ)K0 ∇θ,

where ∞



a(θ) = α(θ)λ(0) ∫ λ(s)ds > 0,

b(θ) = θβ(θ)ν(0) ∫ ν(s)ds > 0.

0

0

Nonlinear electric conductors Borrowing from the previous section we now look for nonlinear models; for simplicity we restrict attention to electric conductors (and hence q = 0). Let ∞

1/2 w =  0 ∫ λ(s)Et (s)ds, 0

where Sym (  0 > 0,

λ(s) → 0 as s → ∞.

We look for the free energy in terms of w as an appropriate polynomial form, viz. ρψ = ρψ0 (θ) +

n k=1

αk (θ)

1 (w · w)k , 2k

αk ≥ 0.

Hence ψ = ψ(θ, Ert ) and the entropy inequality is −ρ(∂θ ψ + η)θ˙ + E · J −

n k=1

˙ ≥ 0. αk (θ)|w|2(k−1) w · w

An integration by parts yields ∞ ∞ ∞ 1/2 ˙ − s)ds =  1/2 ˙ =  0 ∫ E(t w 0 {− λ(s)E(t − s) 0 + ∫ λ (s)E(t − s)ds} 0

0 ∞

1/2 1/2 =  0 λ(0)E(t) +  0 ∫ λ (s)E(t − s)ds. 0

Hence we can write the entropy inequality in the form  1/2 −ρ(∂θ ψ + η)θ˙ + E · (J − nk=1 αk (θ)λ(0)|w|2(k−1) w 0 ) · E ∞ ∞  − nk=1 αk (θ)|w|2(k−1) ∫ λ(s)Et (s)ds ·  0 ∫ λ (u)Et (u)du ≥ 0 0

0

The arbitrariness of θ˙ and E implies η = −∂θ ψ,

J=

n

k=1 αk (θ)λ(0)|w|

2(k−1)



 0 ∫ λ(s)Et (s)ds, 0

460

7 Materials with Memory

n k=1





0

0

αk (θ)|w|2(k−1) ∫ λ(s)Et (s)ds ·  0 ∫ λ (u)Et (u)du ≤ 0.

The inequality holds if λλ ≤ 0. The density current J then shows a dependence on the history of E in the form J=

n

k=1 αk (θ)λ(0)|







0

0

0

∫ λ(s)Et (s)ds ·  0 ∫ λ(u)Et (u)du|k−1  0 ∫ λ(v)Et (v)dv. (7.103)

If n = 1 then (7.103) simplifies to (7.100). Waves in rigid conductors Evolution equations in rigid conductors comprise Maxwell’s equations, in the form ∇ · E = q/ 0 , ∇ · H = 0,

∇ × H = J + 0 ∂t E, ∇ × E + μ0 ∂t H = 0,

and the balance of energy ρε˙ = E · J − ∇ · q + ρr. With the given functional (7.99) for the free energy ψ we find

1/2 ∞

2 ρε = ρ(ψ + θη) = ρ(ψ0 − θψ0 ) + 21 (α − θα )  0 ∫ λ(s)Et (s)ds

0

+ 21 (β

1/2 ∞

2 − θβ ) K0 ∫ ν(s)∇θt (s)ds .

0

Hence it follows

2

2

1/2 ∞

1/2 ∞ ρε˙ = −ρθψ0 θ˙ − 21 θα θ˙  0 ∫ λ(s)Et (s)ds − 21 θβ θ˙ K0 ∫ ν(s)∇θt (s)ds

0

0





+(α − θα ) ∫ λ(u)E˙ t (u)du ·  0 ∫ λ(s)Et (s)ds 0





0

˙t



+(β − θβ ) ∫ ν(u)∇ θ (u)du · K0 ∫ ν(s)∇θt (s)ds. 0

0

We now look at solutions in the form of discontinuity waves characterized as follows. Let σ be a time-dependent surface in the region  and denote by n the unit normal. The surface σ is a discontinuity wave such that E, H, θ are continuous functions of x and t in a neighbourhood N of σ, ˙ ∇E, ∇H, ∇θ and all higher-order derivatives suffer jump discontinuities ˙ H, ˙ θ, E, across σ but are continuous functions of x and t in N \ σ.

7.6 Electromagnetic Materials with Memory

461

Consistent with the model of conductors we let q = 0. Maxwell’s equations and the geometrical conditions of compatibility imply n · [[∂n E]] = 0,

˙ n × [[∂n H]] = μ0 [[J]] + μ0 0 [[E]],

(7.104)

˙ = 0. n × [[∂n E]] + μ0 [[H]]

(7.105)

n · [[∂n H]] = 0,

We assume that [[r ]] = 0 and hence the balance of energy implies ρ[[ε]] ˙ = E · [[J]] − [[∇ · q]].

(7.106)

By (7.100), the continuity of E and θ across σ implies the continuity of J. Though ∇θ suffers a jump discontinuity the integral in (7.101) is continuous and hence q is t continuous too. Moreover, despite the jump of ∇θ the integral ∫∞ 0 ν(s)∇θ (s)ds is continuous. Hence, upon an integration by parts we obtain ∞

[[ ∫ ν(s)∇ θ˙t (s)ds]] = ν(0)[[∇θ]]. 0

Now, by (7.101) it follows that ∞



0

0

t ˙ ˙t q˙ = −(β + θβ )θν(0)K 0 ∫ ν(s)∇θ (s)ds − θβν(0)K0 ∫ ν(s)∇ θ (s)ds.

Thus we obtain

˙ − K[[∇θ]], ˙ = −h[[θ]] [[q]]

where



h = (β + θβ )ν(0)K0 ∫ ν(s)∇θt (s)ds, 0

K = θβν 2 (0)K0 .

Likewise, from the expression of ρε˙ it follows ˙ + γ · [[∇θ]], ρ[[ε]] ˙ = ξ[[θ]] where

2

2

1/2 ∞

1/2 ∞ ξ := −ρθψ0 − 21 θα  0 ∫ λ(s)Et (s)ds − 21 θβ K0 ∫ ν(s)∇θt (s)ds , 0

0



γ = (β − θβ )ν(0)K0 ∫ ν(s)∇θt (s)ds. 0

We now apply the kinematical conditions of compatibility, [[ f˙]] = −U [[∂n f ]],

˙ · n = −U [[∇ · w]] [[w]]

462

7 Materials with Memory

if f and w are continuous across σ, to obtain the equations for, e.g. [[∂n H]] and [[∂n θ]]. As a consequence of (7.104) and (7.105) we have ˙ = −U 0 n × [[∂n E]] = −U 2 μ0 0 [[∂n H]], n × (n × [[∂n H]]) = −[[∂n H]] = 0 n × [[E]] whence (1 − U 2 μ0 0 )[[∂n H]] = 0. √ A nonzero discontinuity [[∂n H]] can propagate at the speed U = 1/ μ0 0 . Thus, the propagation condition for the electromagnetic discontinuities is not affected by the thermal conduction. To investigate the existence of temperature-rate discontinuities, we consider Eq. (7.106) and apply the geometrical and kinematical conditions of compatibility to obtain ˙ + n · K[[∇θ]]) ˙ + γ · [[∇θ]] = −[[∇ · q]] = U −1 [[q]] ˙ · n = −U −1 (h · n[[θ]] ξ[[θ]] whence (ξU 2 + 2p · n U − n · Kn)[[∂n θ]] = 0, where



p = 21 (h − γ) = θβ ν(0)K0 ∫ ν(s)∇θt (s)ds. 0

Since K is positive definite, nonzero discontinuities [[∂n θ]] can propagate at any of the two (positive) speeds U=

1 ( n · Kn + (p · n)2 ± p · n). ξ

The past history ∇θrt affects the value of U via p · n. The vector p is zero if the region ahead is kept at a uniform temperature up to the arrival of the wave or if β is constant. In these cases only one speed is allowed for the temperature-rate wave. The electric field affects the value of U via the value of ξ. If α and β are constants then the result for U coincides with that for the model in Sect. 7.3.1.

7.6.3 Polarizable Conductors with Memory Simple physical pictures of polarization and magnetization may serve as an introduction to the modelling of electromagnetic materials with memory. Following the Lorentz model of polarization, while the electronic cloud of an atom can be regarded as centred at the atomic nucleus, as a consequence of an electric field E the mean position of the cloud is displaced at d relative to the nucleus. Let m be the mass of

7.6 Electromagnetic Materials with Memory

463

an electron. The restoring force is denoted by −mω02 d and −mγ d˙ is the frictional force. Hence the equation of motion becomes m d¨ + mγ d˙ + mω02 d = −qE By Fourier transform, f (t)





f F (ω) = ∫ f (t) exp(−iωt)dt, −∞

we obtain m(−ω 2 + iωγ + ω02 )d(ω) = −qE(ω) and hence d(ω) = −

E(ω) q , 2 m ω0 − ω 2 + iωγ

where d(ω) and E(ω) stand for the Fourier transforms. Hence we get the dipole moment q2 E(ω) p (ω) = −q d(ω) = . m ω02 − ω 2 + iωγ Let α(ω) be the polarizability of a single atom so that p (ω) = α(ω)E(ω),

α(ω) =

q2 1 . 2 m ω0 − ω 2 + iωγ

Letting P(ω) = npp(ω), n being the number density of atoms, we have P(ω) =

1 q2 E(ω). 2 m ω0 − ω 2 + iωγ

Hence the relative permittivity is a function of the frequency ω. Drude’s model involves a non-interacting electron gas that flows in a metal with motionless positive ions. The equation of motion of electrons is m v˙ + mγv = −qE. Hence v(ω) = −

q E(ω) m iω + γ

464

7 Materials with Memory

and the conduction current J reads J(ω) = −nqv =

nq2 1 E(ω) m iω + γ

thus showing a frequency-dependent conductivity. In a non-polarizable material the displacement current Jd is given by Jd = ∂t D = 0 ∂t E. Hence the total current is expressed by (J + Jd )(ω) = [iω 0 +

nq2 1 ]E(ω). m iω + γ

The general feature of these models, as well as of more involved ones, is that the physical properties like permittivity and conductivity are frequency dependent. This feature holds in materials with memory too and then it is of interest to examine connections between materials with memory and frequency-dependent constitutive properties. As an example, let ∞

J(t) = ∫ σ(s)E(t − s)ds. 0

By Fourier transform we have ∞





−∞

−∞

0

∫ J(t) exp(−iωt)dt = ∫ dt exp(−iωt) ∫ σ(s)E(t − s)ds = σ F (ω)E F (ω),

where



σ F (ω) = ∫ σ(s) exp(−iωs)ds. 0

The kernel σ on R+ is associated with a frequency-dependent transform σ F : R → C. We consider a rigid body. Hence the mass density ρ is constant and the balance of energy simplifies to ˙ · H − ∇ · q + ρr. ρε˙ = E · J + P˙ · E + μ0 M From the standard entropy inequality ρη˙ ≥ −∇ · (q/θ + k) + ρr/θ it follows ˙ · H + E · J − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ(ε˙ + θη) ˙ + P˙ · E + μ0 M θ In terms of the free energy φ = ε − θη − (P · E + μ0 M · H)/ρ the entropy inequality can be written in the form

7.6 Electromagnetic Materials with Memory

465

˙ − P · E˙ − μ0 M · H ˙ + E · J − 1 q · ∇θ + θ∇ · k ≥ 0. − ρ(φ˙ + η θ) θ

(7.107)

Denote by  the region occupied by the body. Let  be the 4-tuple  = (E, H, θ, ∇θ) whose values are functions of x ∈  and t ∈ R. Denote by H the vector space whose elements are the pairs of present values and past histories ((t), rt ), for any t ∈ R, the dependence of  on x ∈  being understood. For ease in writing we suppress the suffix r and let  t denote the past history. The vector space H is endowed with the L 2 norm ∞  1/2 , ((t),  t ) =  2 (t) + ∫ [ t (s)]2 h(s)ds 0

where  2 = E2 + H2 + θ2 + (∇θ)2 and the like for [ t (s)]2 while h is positive and integrable on (0, ∞). As the constitutive assumption we let φ, η, P, M, J, q, k, at x ∈  at time t ∈ R, be given by functionals of ((t),  t ) = (E(t), H(t), θ(t), ∇θ(t), Et , Ht , θt , ∇θt ), at the same point x. Hence we say that, e.g. φ(t) is given by ˆ φ(t) = φ(E(t), H(t), θ(t), ∇θ(t), Et , Ht , θt , ∇θt ), the dependence on x being understood. In words, φ, η, P, M, J, q, k depend on the present values and on the past histories of E, H, θ, ∇θ. We assume the functionals ˆ η, ˆ M, ˆ J, ˆ q, ˆ on H, are continuous. Moreover we assume φˆ is differentiable ˆ k, φ, ˆ P, with respect to  and Fréchet differentiable with respect to rt so that ˆ ˆ ˆ φ((t) + (t),  t + t ) = φ((t),  t |t ) + o(((t), t )),  t ) + ∂ φˆ · (t) + d φ((t), t ˆ where d φ(·| ) is the Fréchet differential induced by the change of the past history. t If (t) and  are differentiable, with respect to t, then

ˆ ˙ = ∂ φˆ · (t) ˙ + d φ((t),  t |˙ t ) φ(t) or, more explicitly,

466

7 Materials with Memory ˙ ˙ + ∂∇θ φˆ · ∇ θ(t) ˙ + d φ((t), ˆ ˙ ˙ ˙ t , θ˙ t , ∇ θ˙ t ). φ(t) = ∂E φˆ · E(t) + ∂H φˆ · H(t) + ∂θ φˆ θ(t)  t |E˙ t , H

To derive the thermodynamic restrictions we now examine the entropy inequality (7.107). Substitution of φ˙ results in ˙ − ρ∂∇θ φˆ · ∇ θ˙ −ρ(∂θ φˆ + η)θ˙ − (ρ∂E φˆ + P) · E˙ − (ρ∂H φˆ + μ0 M) · H 1 ˆ ˙ t , θ˙t , ∇ θ˙t ) + θ∇ · k ≥ 0,  t |E˙ t , H +E · J − q · ∇θ − ρd φ((t), θ ˙ E, ˙ H, ˙ and ∇ θ˙ occur linearly. Now, each term being evaluated at time t. The values θ, at any x ∈  and t ∈ R, these values can be chosen arbitrarily. This is so because, as is usually the case, θ˙ enters the balance of energy and the equation holds with a corresponding value of r . Likewise ∇ θ˙ can be chosen arbitrarily and the balance of ˙ can be chosen energy holds with a corresponding value of ∇r . Moreover, E˙ and H arbitrarily since Maxwell’s equations hold with appropriate values of ∇ × H and ∇ × E, which do not enter the inequality. Hence the inequality has to hold with ˙ E, ˙ This happens if and only if ˙ H, ˙ and ∇ θ. arbitrary values of θ, ∂∇θ φˆ = 0, ˆ η = −∂θ φ,

ˆ P = −ρ∂E φ,

ˆ μ0 M = −ρ∂H φ,

(7.108)

and 1 ˆ ˙ rt , θ˙rt , ∇ θ˙rt ) + θ∇ · k ≥ 0. rt |E˙ rt , H E · J − q · ∇θ − ρd φ((t), θ

(7.109)

Further restrictions follow from the reduced inequality (7.109) if particular functionals are considered. Two remarkable models hold. First, let φ = φ0 (θ, E, H) +

∞ 2 2 1 ∞ 1 ∫ σ(s)E(t − s)ds + ∫ κ(s)∇θ(t − s)ds , 2ρσ(0) 0 2ρκ(0)θ 0

subject to σ(∞) = 0, κ(∞) = 0. Hence it follows η = −∂θ φ0 +

∞ 2 1 ∫ κ(s)∇θ(t − s)ds , 2ρκ(0)θ2 0

P = −ρ∂E φ0 ,

μ0 M = −ρ∂H φ0

and ∞ 1 ∞ 1 ˙ − s)ds ∫ σ(s)E(t − s)ds · ∫ σ(s)E(t E · J − q · ∇θ + θ∇ · k − θ σ(0) 0 0 ∞ 1 ∞ ˙ − s)ds ≥ 0. ∫ κ(s)∇θ(t − s)ds · ∫ κ(s)∇ θ(t − κ(0) 0 0

˙ − s) = −∂s E(t − s) and integrate by parts to obtain Notice that E(t

7.6 Electromagnetic Materials with Memory

467





0

0

˙ − s)ds = σ(0)E(t) − ∫ σ (s)E(t − s)ds ∫ σ(s)E(t

˙ The inequality can be written as and the like for the integral with ∇ θ. ∞ ∞  q E(t) · J(t) − ∫ σ(s)E(t − s)ds − ∇θ(t) · (t) + ∫ κ(s)∇θ(t − s)ds θ 0 0 ∞ 1 ∞ ∫ σ(s)E(t − s)ds · ∫ σ (s)E(t − s)ds +θ∇ · k − σ(0) 0 0 ∞ 1 ∞ ∫ κ(s)∇θ(t − s)ds · ∫ κ (s)∇θ(t − s)ds ≥ 0. − κ(0)θ 0 0

This inequality holds if k = 0 and σ(s) = σ(0) exp(−αs), α > 0, σ(0) > 0, ∞

J(t) = ∫ σ(s)E(t − s)ds, 0

κ(s) = κ(0) exp(−βs), β > 0, κ(0) > 0 ∞

q(t) = − ∫ κ(s)∇θ(t − s)ds. 0

Let (0, ) be the support of σ and κ and

σ˜ = ∫ σ(s)ds, 0



κ˜ = ∫ κ(s)ds. 0

The limit as → 0, that is the passage from fading memory to instantaneous response, yields J(t) → σ˜ E(t), q(t) → −κ˜ ∇θ(t), thus recovering Ohm’s law and Fourier’s law. Another model allows for memory effects also for the polarization P and the magnetization M. The purpose is to obtain the constitutive relations ∞

P(t)=G0 E(t)+ ∫ G (s)E(t − s)ds, 0



μ0 M(t) = A0 H(t) + ∫ A (s)H(t − s)ds, 0

G0 , G , A0 , A being symmetric tensors. Let ∞

G∞ = G0 + ∫ G (s)ds, 0



A∞ = A0 + ∫ A (s)ds. 0

Borrowing from the previous scheme and that of linear viscoelasticity we consider

468

7 Materials with Memory ∞

ρφ = ρ f (θ) + 21 {E(t) · G∞ E(t) − ∫ [E(t) − E(t − s)] · G (s)[E(t) − E(t − s)]ds} 0 ∞

+ 21 {H(t) · A∞ H(t) − ∫ [H(t) − H(t − s)] · A (s)[H(t) − H(t − s)]ds} 0

− 21  −1 (0)

∞ 2 ∞ 2 1 ∫ (s)E(t − s)ds − K−1 (0) ∫ K(s)∇θ(t − s)ds . 2θ 0 0

By (7.108) and (7.109) it follows that ∞



0

0





0

0

P(t) = G∞ E(t) − ∫ G (s)[E(t) − E(t − s)]ds = G0 E(t) + ∫ G (s)E(t − s)ds, μ0 M(t)=A∞ H(t)− ∫ A (s)[H(t)−H(t − s)]ds = A0 H(t) + ∫ A (s)H(t − s)ds, and ∞



˙ − s)ds − ∫ [H(t) − H(t − s)] · A (s)H(t ˙ − s)ds − ∫ [E(t) − E(t − s)] · G (s)E(t 0

0

∞ ∞ 1 ˙ − s)ds +E · J − q · ∇θ −  −1 (0) ∫ (s)E(t − s)ds · ∫ (s)E(t θ 0 0 ∞ ∞ 1 ˙ − s)ds + θ∇ · k ≥ 0. − K−1 (0) ∫ K(s)∇θ(t − s)ds · ∫ K(s)∇ θ(t θ 0 0

Since

˙ − s) = ∂s [E(t) − E(t − s)] E(t

then an integration by parts yields ∞

˙ − s)ds = − ∫ [E(t) − E(t − s)] · G (s)E(t 0

1 2



∫ [E(t) − E(t − s)] · G (s)[E(t) − E(t − s)]ds, 0

and likewise for the second integral. Further, ∞



0

0



˙ − s)ds = −E(t) · ∫ (s)E(t − s)ds − −1 (0) ∫ (s)E(t − s)ds · ∫ (s)E(t 0

−1





0

0

− (0) ∫ (s)E(t − s)ds · ∫  (s)E(t − s)ds, and likewise for the integrals on ∇θ. The inequality is then written as

7.6 Electromagnetic Materials with Memory 1 2



∫ Ed (t, s) · G (s)Ed (t, s)ds + 0

469 1 2



∫ ∇θd (t, s) · A (s)∇θd (t, s)ds 0





0

0

− −1 (0) ∫ (s)E(t − s)ds · ∫  (s)E(t − s)ds ∞ ∞ 1 − K−1 (0) ∫ K(s)∇θ(t − s)ds · ∫ K (s)∇θ(t − s)ds θ 0 0 ∞ ∞ 1 +E · (J − ∫ (s)E(t − s)ds) − (q + ∫ K(s)∇θ(t − s)ds) + θ∇ · k ≥ 0, θ 0 0

where Ed (t, s) = E(t) − E(t − s), ∇θd (t, s) = ∇θ(t) − ∇θ(t − s). The whole scheme holds if A (s) ≥ 0, G (s) ≥ 0,  (s) = −α(s), α > 0, (s) > 0, K (s) = −βK(s), β > 0, K(s) > 0 and



J = ∫ (s)E(t − s)ds, 0



q = − ∫ K(s)∇θ(t − s)ds. 0

The positive definiteness of the electrical conductivity tensor function  and of the thermal conductivity tensor function K generalize to memory functionals the wellknown features of Ohm’s law and Fourier’s law. If further G and A have the form of exponentials, G(s) = G0 exp(−s/τ E ),

A(s) = A0 exp(−s/τ B ),

τ E , τ B > 0,

then G (s) ≥ 0 and A (s) ≥ 0 imply G(s) ≥ 0 and A(s) ≥ 0.

7.6.4 Magneto-Viscoelasticity There are materials (magnetorheological elastomers) that change their mechanical behaviour in response to the application of a magnetic field. Depending on the preparation method, the resulting material may be isotropic or anisotropic. The response of magnetization to an applied magnetic field is not instantaneous and hence the magnetoelastic properties are to be generalized by allowing for the viscoelastic nature. The time delay in response is essential in a proper modelling. Sometimes this delay effect is realized by superposing a viscous overstress on elastic deformation. A description is realized by allowing for an intermediate configuration; a viscous motion produces the intermediate configuration and next the elastic deformation leads to the current configuration. Both the deformation and the mag-

470

7 Materials with Memory

netic variables are the sum of an elastic and a viscous term, the viscous terms being determined by nonlinear rate-type equations [385]. Here we avoid the introduction of an intermediate configuration and let the viscoelastic scheme represent the elastic part via the instantaneous term and the dissipative (delayed) term via the memory integral. Look at a magnetisable, deformable, and heat-conducting body. The balance of energy is written in the form ˙ + T · L − ∇ · q + ρr ρε˙ = μ0 ρH · m while the balance of angular momentum results in skwT = skw(μ0 M ⊗ H). To allow for the interaction between the deformation and temperature fields with magnetization, in a scheme where both instantaneous response and delayed effects occur, we let the material properties be described by θ, F, H, ∇θ, both via the present values and the past histories. Indeed, we consider  = (θ, F, H, ∇θ, Ft , Ht , ∇θt ) be the set of independent variables. To guarantee the invariance we let φ depend on F, H via the invariants E = 21 (FT F − 1), H = FT H; observe that H = H R is the magnetic field in the reference configuration and is Euclidean invariant, (FT H)∗ = (QF)T QH = FT H. To make it apparent that the reference configuration is stress-free we use the strain E = 21 (FT F − 1) as the deformation variable, in place of F. The second-law inequality and the balance of energy yield ˙ + T · L − μ0 M · H ˙ − 1 q · ∇θ ≥ 0, −ρ(φ˙ + η θ) θ where φ = ε − θη − μ0 m · H; we restrict attention to models with zero extra-entropy flux. We let φ, η, T, q depend on . The free energy φ is required to satisfy a minimum property. Let θ, F, and H be defined on R. Constant continuations of θt0 , Ft0 , Ht0 are defined by  θ(t ) = θt0 (0), u ∈ [0, t − t ), 0 ˜ = θ˜ t (0), θ˜ t (u) = t0 0 θ(t) u ∈ [t − t0 , ∞), θ (u − t0 ),

and the like for Ft and Ht . In addition, we let ∇θ = 0 on [t0 , t]. Then it follows from the entropy inequality that ˙ ) ≤ 0, φ(τ Hence we have

τ ∈ (t0 , t).

7.6 Electromagnetic Materials with Memory

471

˜ ˜ ˜ ˜ t , ∇θt ) ≤ φ(θ(t0 ), F(t0 ), H(t0 ), Ft0 , Ht0 , 0). φ(θ(t), F(t), H(t), 0, F˜ t , H By the continuity of the functional, as t − t0 → ∞ ˜ ˜ t , ∇θt ) → φ(θ(t0 ), F(t0 ), H(t0 ), F0† , H0† , 0† ), ˜ ˜ φ(θ(t), F(t), H(t), 0, F˜ t , H ˜ 0 ) = F0 , H0† (u) = H(t ˜ 0) = where F0† and H0† are the constant histories F0† (u) = F(t H0 , up to time t, and u ∈ [0, ∞). Hence φ(θ(t0 ), F(t0 ), H(t0 ), 0, F0† , H0† , 0† ) ≤ φ(θ(t0 ), F(t0 ), H(t0 ), 0, Ft0 , Ht0 , ∇θt0 ). Hence, among all histories θt , Ft , Ht , ∇θt with given present values θ0 , F0 , H0 and ∇θ(t0 ) = 0, none yields a smaller value of the free energy than that corresponding to the constant histories θ0† , F0† , H0† and zero temperature gradient. The free energy φ depends on , through scalar invariants, and is supposed to be continuously differentiable. If φ depends on F and H via E , H then the inequality becomes ˙ +T·L−μ M·H ˙ − 1 q · ∇θ −ρ(∂θ φ + η)θ˙ − ρ∂E φ · E˙ − ρ∂H φ · H˙ − ρ∂∇θ φ · ∇θ 0 θ t t t ˙ ˙ ˙ −ρdφ(|E ) − ρdφ(|H ) − ρdφ(|∇θ ) ≥ 0.

˙ implies ˙ ∇θ The arbitrariness of θ, ∂∇θ φ = 0,

η = −∂θ φ.

˙ = FT D F and H˙ = FT LT H + FT H ˙ we can write the inequality in the Since E˙ = 21 C form ˙ (T − ρF∂E φFT + ρF∂H φ ⊗ H) · D + (T + ρF∂H φ ⊗ H) · W − (μ0 M + ρF∂H φ) · H t 1 t t ˙ ) ≥ 0. − q · ∇θ − ρdφ(|E˙ ) − ρdφ(|H˙ ) − ρdφ(|∇θ θ

˙ implies The arbitrariness of D , W, H T = ρF∂E φFT − ρF∂H φ ⊗ H,

μ0 M = −ρF∂H φ.

Since F∂E φFT ∈ Sym then skwT = skw(μ0 M ⊗ H), as is required by the balance of angular momentum. The inequality then reduces to 1 t t ˙ t ) ≥ 0. − q · ∇θ − ρdφ(|E˙ ) − ρdφ(|H˙ ) − ρdφ(|∇θ θ Restrictions placed by the reduced inequality are determined by considering some particular cases. First we suppose the temperature is uniform, ∇θt = 0. Nonlinear

472

7 Materials with Memory

constitutive equations for T and M are established by looking for the partial derivatives ∞

E|)E E + g(θ) ∫ G (u)E E(t − u)du, ∂E φ = G0 (θ, |E

G0 , G ∈ Lin,

0



H|)H H − a(θ) ∫ A (u)H H(t − u)du, ∂H φ = −A0 (θ, |H 0

A0 , A ∈ Lin.

E|) ∈ Lin, with fully symmetric values, we have Now, given G (θ, |E ∂E 21 E · G E = G E +

1 E · ∂|E E. (E E|G E )E E| 2|E

H|). Hence we obtain The analogue follows for A (θ, |H G0 = G +

1 E · ∂|E (E E| G E )I, E| 2|E

A0 = A +

1 H · ∂|H (H H| AH )I, H| 2|H

I being the identity operator in Lin. The functional φ(θ, E , H , E t , H t ) then takes the form ∞



0

0

E · ∫ G (u)E E(t − u)du − 21 H · AH − a(θ)H H · ∫ A (u)H H(t − u)du. φ = (θ, E t , H t ) + 21 E · G E + g(θ)E

Among the possible free energies consider ∞



0

0

Et (u)du − 21 a(θ) ∫ H t (u) · A (u)H Ht (u)du. (θ, E t , Ht ) = φ0 (θ) + 21 g(θ) ∫ E t (u) · G (u)E

The whole functional φ(θ, E , H , E t , H t ) is required to satisfy t t dφ(|E˙ ) + dφ(|H˙ ) ≤ 0.

(7.110)

To this end observe that ∞

E(t) − E (t − u)] · G (u)[E E(t) − E (t − u)]du φ = φ0 (θ) + 21 E · G ∞E − 21 g(θ) ∫ [E 0



H(t) − H (t − u)] · A (u)[H H(t) − H (t − u)]du =: φm . − 21 H · A ∞H + 21 a(θ) ∫ [H 0

(7.111) It follows that

7.6 Electromagnetic Materials with Memory

473



t E(t) − E (t − u)] · G (u)E˙ (t − u)]du dφ(|E˙ ) = − 21 g(θ) ∫ [E 0

E(t) − E (t − u)] · G (u)[E E(t) − E (t − u)]∞ = 21 g(θ){[E 0 ∞

E(t) − E (t − u)] · G (u)[E E(t) − E (t − u)]du}. − ∫ [E 0

The assumption G (∞) = 0 implies ∞

t E(t) − E (t − u)] · G (u)[E E(t) − E (t − u)]du. dφ(|E˙ ) = − 21 g(θ) ∫ [E 0

Likewise, letting A (∞) = 0 we find ∞

t H(t) − H (t − u)] · G (u)[H H(t) − H (t − u)]du. dφ(|H˙ ) = 21 a(θ) ∫ [H 0

Hence (7.110) holds if and only if g(θ)G (u) ≥ 0,

a(θ)A (u) ≤ 0

for all u ∈ [0, ∞). The minimum property of φ at constant histories E † , H † holds if and only if g(θ)G (u) ≤ 0,

a(θ)A (u) ≥ 0.

Consequently the functional φ of (7.111) is thermodynamically consistent if G and G , as well as A and A , have opposite types of definiteness. Now we let ∇θ )= 0. Let q and φ depend on ∇θt via the referential gradient ∇R θt = ∇θt Ft . For simplicity we look for models where the reduced inequality splits into two separate inequalities, which is anyway a sufficient condition for the validity of the second-law inequality. Hence we require that 1 ρ θ dφ(|∇R θ˙t ) + q · ∇θ ≥ 0. θ Multiply by J = det F and observe that, by means of the referential heat flux Q = J F−T q, we can write (7.112) ρ R θdφ(|∇R θ˙t ) + Q · ∇R θ ≤ 0. For definiteness we look for the thermodynamic consistency of the constitutive equation ∞

Q (θ(t), ∇R θt ) = α(θ)K0 ∫ β(u)∇R θ(t − u)du, 0

474

7 Materials with Memory

where K0 is supposed to be positive definite while α and β are so far undetermined; we let β(∞) = 0. We let φ = φm + φc where φc is taken in the form

1/2 ∞

2 φc = α(θ) K0 ∫ β(u)∇R θ(t − u)du . 0

The minimum property of φ at ∇R θ ≡ 0 implies that α > 0. Inequality (7.112) can be written as ∞





0

0

0

˙ − u)du + (K0 ∫ β(u)∇R θ(t − u)du) · ∇R θ ≤ 0. ρ R θ(K0 ∫ β(u)∇R θ(t − u)du) · ∫ β(u)∇R θ(t

By integration by parts we have ∞



˙ − u)du = − ∫ β(u)∂u ∇R θ(t − u)du ∫ β(u)∇R θ(t 0

= −[β(u)∇R θ(t

− u)]∞ 0

0







+ ∫ β (u)∇R θ(t − u)du = β(0)∇R θ(t) + ∫ β(u)∇R θ(t − u)du. 0

0

Hence the inequality can be written in the form ∞

(ρ R θβ(0) + 1)(K0 ∫ β(u)∇R θ(t − u)du) · ∇R θ(t) 0





0

0

+ρ R θ(K0 ∫ β(u)∇R θ(t − u)du) · ∫ β (u)∇R θ(t − u)du ≤ 0. The arbitrariness of ∇R θ(t) implies β(0) = −

1 , ρR θ

β(u)β (u) ≤ 0.

Since β(0) < 0 and (β 2 ) ≤ 0 then β(u) ≤ 0 for all u ∈ R+ . Consequently α(θ)K0 β(u) ≤ 0 as expected. Let

β(u) ˆ ≥ 0. β(u) = β(0)

Hence Q=−

α(θ) ∞ ˆ K0 ∫ β(u)∇R θ(t − u)du, ρR θ 0

At the limit of short memory it follows

7.6 Electromagnetic Materials with Memory

Q=−

475

∞ α(θ) ˆ K0 ( ∫ β(u)du)∇ R θ. ρR θ 0

In the spatial description we have q=−

∞ α(θ) FK0 ∫ β(u)(∇θF)(t − u)du. J ρR θ 0

7.6.5 Rate Equations in the Eulerian Description It is a crucial point of magnetoelasticity, as well as of magneto-viscoelasticity, that the stress tensor need not be symmetric. Hence the mechanical power T · L need not equal T · D and moreover22 T · L = J −1 (FT R R FT ) · L = J −1 T R R · (FT D F) + J −1 T R R · (FT WF). Since E˙ = FT D F then T · L = J −1 T R R · E˙ + J −1 T R R · (FT WF). For simplicity we now ignore heat conduction. Hence in the Eulerian description we write the Clausius–Duhem inequality in the form ˙ + T · L − μ0 M · H ˙ = θσ ≥ 0. −ρ(ψ˙ + η θ) Since we look for rate equations, objectivity indicates that the independent variables are invariant so that their time derivatives be invariant too. Hence we assume that ψ = ψ(θ, T , E, H), where T is a stress-like, invariant, variable to be identified. Consequently the inequality can be written as ˙ ≥ 0. −ρ(∂θ ψ + η)θ˙ − ρ∂τ ψ · T˙ − ρ∂E ψ · E˙ − ρ∂H ψ · H˙ + T · L − μ0 M · H ˙ then we have Since H˙ = FT LT H + FT H −ρ(∂θ ψ + η)θ˙ − ρ∂τ ψ · T˙ − ρ∂E ψ · E˙ + (T − ρF∂E ψFT − ρH ⊗ F∂H ψ) · (F−T E˙ F−1 ) ˙ ≥ 0. +(T − ρH ⊗ F∂H ψ) · W − (ρF∂H ψ + μ0 M) · H

22

In this section, D is the stretching tensor, E is the Green-St Venant tensor and H = FT H.

476

7 Materials with Memory

˙ H, ˙ and W implies The arbitrariness of θ, η = −∂θ ψ,

μ0 M = −ρF∂H ψ,

skw(T + μ0 H ⊗ M) = 0.

(7.113) (7.114)

Thus we replace −ρF∂H ψ with μ0 M and write the remaining inequality in the form −ρ∂τ ψ · T˙ + (−ρ∂E ψ + J −1 T R R + μ0 F−1 H ⊗ F−1 M) · E˙ ≥ 0. This result suggests that we define T := T R R + J μ0 F−1 H ⊗ F−1 M so that we have T − ρ∂E ψ) · E˙ = J θσ ≥ 0. − ρ R ∂τ ψ · T˙ + (T

(7.115)

Observe that F−1 H and F−1 M are invariant vectors. This follows by a direct check, (F−1 H)∗ = (QF)−1 QH = F−1 Q−1 QH = F−1 H, and the like for F−1 M. This in turn proves the invariance of T . Moreover letting M = J F−1 M we can write (7.113)2 in the form μ0M = −ρ R ∂H ψ.

(7.116)

If T˙ and E˙ are independent and σ = 0 then it follows from (7.115) that ∂T ψ = 0,

T = ρ R ∂E ψ;

(7.117)

Equations (7.117) are said to characterize hyper-magnetoelastic materials. The results (7.116) and (7.117) allow us to write the incremental relations

If, again, σ = 0 but

then we have

˙ = −ρ R ∂E ∂H ψ E˙ − ρ R ∂H ∂H ψ H˙ , μ0M

(7.118)

T˙ = ρ R ∂E ∂E ψ E˙ + ρ R ∂H ∂E ψ H˙ .

(7.119)

∂τ ψ )= 0,

∂E ψ )= 0

T − ρ R ∂E ψ) · E˙ = 0; − ρ R ∂T ψ · T˙ + (T

(7.120)

7.6 Electromagnetic Materials with Memory

477

Equation (7.120) is said to characterize hypo-magnetoelastic materials. We can then apply the representation formula to T˙ , T˙ = (T˙ · N)N + (I − N ⊗ N)G, where G is any second-order tensor. Since ∂T ψ )= 0 then we let N = ∂T ψ/|∂T ψ|. By (7.120) we have T − ρ R ∂E ψ) · E˙ (T ∂T ψ ∂T ψ T˙ = ⊗ )G ∂T ψ + (I − 2 ρ R |∂T ψ| |∂T ψ| |∂T ψ| or T˙ =

1 ∂T ψ ∂T ψ T − ρ R ∂E ψ)]E˙ + (I − [∂T ψ ⊗ (T ⊗ )G ρ R |∂T ψ|2 |∂T ψ| |∂T ψ|

If, further, σ )= 0 then the representation of T˙ generalizes to T − ρ R ∂E ψ) · E˙ − J θσ ∂T ψ (T ∂T ψ ∂T ψ + (I − T˙ = ⊗ )G. 2 ρ R |∂T ψ| |∂T ψ| |∂T ψ|

(7.121)

Definite forms of (7.121) are now established by having in mind fluid or solid behaviours. Fluids We assume ψ and σ in the forms T |2 − 21 ξ|H H |2 , ρ R ψ = ρ R ψ0 (θ) + 21 α|T H |2 , T |2 + γ|H J θσ = β|T

α, ξ > 0,

β, γ > 0.

The dependence of ψ on H results in the magnetization relation H. μ0M = ξH T then (7.121) can be written in the form Since ∂E ψ = 0 and ρ R ∂T ψ = αT H |2 γ|H T T ⊗T ˙ T β T − ⊗ )G. T˙ = T + (I − E − T |2 T |2 T | |T T| α|T α α|T |T If, for definiteness, we let G = E˙ /α then it follows H |2 1 γ|H β T˙ = E˙ − T − T. T |2 α α α|T

478

7 Materials with Memory

T |2 then we obtain If, in particular, we let γ = γ0 |T T, T˙ = aE˙ − bT

H|2 /α > 0; a = 1/α > 0, b = β/α + γ0 |H

(7.122)

equation (7.122) is a generalization of the Maxwell (fluid) model, the parameter b being affected by the magnetic field H . In this connection we observe that, given H| is assumed to be known (7.122) can be solved on (0, t) to obtain T 0 = T (0), if |H t

t

t

0

0

τ

T (t) = T 0 exp[− ∫ b(λ)dλ] + ∫ a(τ ) exp[− ∫ b(λ)dλ]E˙ (τ )dτ . 0

If b ≥ b0 > 0 then T (−y) exp[− ∫ b(λ)dλ vanishes as y → ∞. Hence we let 0

−y

0

˙ T 0 = ∫ a(τ ) exp[− ∫ b(λ)dλ]E˜ (τ )dτ , −∞

τ

where E (τ ) = E˜ (τ ), τ ≤ 0. Substituting the value of T 0 we find t

t



s

−∞

τ

0

0

T (t) = ∫ a(τ ) exp[− ∫ b(λ)dλ]E˙ (τ )dτ = ∫ a(t − s) exp[− ∫ b(ξ)dξ)]E˙ (t − s)ds.

(7.123) As a comment we observe that the memory kernel of (7.123) has the form t

G(τ , t − τ ) = a(τ ) exp[− ∫ b(λ)dλ]; τ

it describes an aging effect in that the function G changes in time as a consequence of the factor a(τ ). We get a standard memory kernel t

G(t − τ ) = a0 exp[− ∫ b(λ)dλ] τ

if a = a0 is constant. Another model is obtained by letting σ be a viscous term in the form J θσ = ν|E˙ |2 + ζ|H˙ |2 ,

ν, ζ > 0.

It follows that T ⊗T ˙ T ν|E˙ |2 + ζ|H˙ |2 T T˙ = T + (I − E − ⊗ )G. 2 2 T| T| T | |T T| α|T α|T |T If G = E˙ /α then we have ν|E˙ |2 + ζ|H˙ |2 1 T. T˙ = E˙ − T |2 α α|T

(7.124)

7.6 Electromagnetic Materials with Memory

479

As to the influence of the magnetic field through H˙ we observe that ◦

˙ = FT (H +DH), H˙ = FT (LT H + H) ◦



˙ − WH. Hence we have H denoting the corotational derivative, H= H ◦



|H˙ |2 = (H +DH) · FFT (H +DH) and the representation (7.124) can be written in the form ◦



T ⊗T ˙ T ζ(H +DH) · FFT (H +DH) T ν|E˙ |2 T˙ = ⊗ )G. T − T + (I − E − 2 2 2 T| T| T| T | |T T| α|T α|T α|T |T

7.6.6 Solids Solids are characterized by a stress dependence such that, asymptotically, T = G∞E . Hence we let formally replace T of the fluid model with T − G∞E . Define T − G∞E ) · A(T T − G∞E ) − 21 H · H H ρ R ψ = ρ R ψ0 (θ) + 21 E G∞E + 21 (T and H)A(T T − G∞E )], T − G∞E ) · [(β + H · H J θσ = (T

β > 0,

where ,  ∈ Sym+ and A, G∞ are fourth-order positive definite tensors. Observe that H, ρ R ∂H ψ = −H

ρ R ∂T ψ = A(T − G∞E ),

T ρ R ∂E ψ = G∞E − G∞ A(T − G∞E ).

H. Moreover, by (7.116) it follows M = J F−1 M = μ−1 0 H We now apply the representation (7.121) by letting N=

T − G∞E ) A(T ∂T ψ = . T − G∞E )| |∂T ψ| |A(T

To within (I − N ⊗ N)G, the representation formula yields T H]A(T − G∞E ) A(T − G∞E )]E˙ − (T − G∞E ) · [β + H H [T − G∞E + G∞ N |A(T − G∞E )| H](T − G∞E ) A(T − G∞E ) · [A−1 + G∞E )]E˙ − A(T − G∞E ) · [β + H H N. = |A(T − G∞E )|

T˙ 

480

7 Materials with Memory

Hence we have H](T − G∞E ) + (I − N ⊗ N)G. T˙ = (N ⊗ N)[A−1 + G∞ ]E˙ − (N ⊗ N)[β + H · H

Choosing, e.g., T − G∞E ) G = [A−1 + G∞ ]E˙ − [β + H H](T we find

whence

H](T T − G∞E ), T˙ = [A−1 + G∞ ]E˙ − [β + H · H H](T T − G∞E ) = A−1E˙ . T˙ − G∞E˙ + [β + H · H

(7.125)

Equation (7.125) shows that T − G∞E evolves with a relaxation time τ=

1 . H β + H · H

Moreover, if E˙ = 0 then asymptotically we have T = G∞E as we expect for a solid model.

7.7 Causality and Kramers–Kronig Relations Consider a generic response function of materials with memory, ∞

R(t) = G 0 F(t) + ∫ G(u)F(t − u)du, 0

G(u) → 0 as u → 0.

The restriction of the kernel G to R+ , or rather its vanishing for u < 0, means that at time t only values of the cause F prior to that time enter in determining the response R(t). If F is time harmonic, F(t) = F0 exp(−iωt), then R(t) = G(ω)F(t),



G(ω) := G 0 + ∫ G(u) exp(iωu)du.

Let ω ∈ C. Letting ω = ωr + iωi we have

0

7.7 Causality and Kramers–Kronig Relations

481





0

0

ˆ G(−ω) = G 0 + ∫ G(u) exp(−iωu)du = G 0 + ∫ G(u) exp(−iωr u + ωi u)du ∞

= [G 0 + ∫ G(u) exp(iω ∗ u)du]∗ 0

whence

ˆ G(−ω) = Gˆ ∗ (ω ∗ ).

Let z = x + i y. The function ∞



0

0

ˆ G(z) := ∫ G(u) exp(i zu)du = ∫ G(u) exp(i xu) exp(−yu)du ˆ is bounded as y ≥ 0. In addition, we assume G(z) → 0 as |z| → ∞. Hence ∫ C

ˆ G(z) dz = 0 z−ω

if z = ω is outside the contour C. We then choose the contour to trace the real axis, with a hump over the point z = ω ∈ R, and a semicircle in the upper half plane. By Cauchy’s residue theorem we then find that ∞

0= ∫

−∞

where

ˆ G(x) ˆ d x − iπ G(ω), x −ω







−∞



∫ = lim { ∫ + ∫ }

−∞

→0

is also viewed as the principal value. Hence it follows ˆ G(ω) =

1 iπ





−∞

ˆ G(x) d x. x −ω

This in turn implies that the real and imaginary parts Gˆ r , Gˆ i satisfy Gˆ r (ω) =

1 π





−∞

Gˆ i (x) d x, x −ω

ˆ r (x) ∞ G Gˆ i (ω) = − π1 ∫ d x. −∞ x − ω

For any function f on R the Hilbert transform f H (ω) is defined by f H (ω) =

1 π





−∞

f (x) d x, ω−x

(7.126)

482

7 Materials with Memory

the integral being understood in the principal value sense. Thus, by (7.126) we can say that Gˆ i is the Hilbert transform of Gˆ r while Gˆ r is the Hilbert transform of −Gˆ i . By definition ∞





0

0

0

ˆ G(x) = ∫ G(u) exp(i xu)du, Gˆ r (x) = ∫ G(u) cos(xu)du, Gˆ i (x) = ∫ G(u) sin(xu)du,

and hence

Gˆ r (x) = Gˆ r (−x), Gˆ i (x) = −Gˆ i (−x),

x ∈ R;

Gˆ r (x) is even, Gˆ i (x) is odd. Consider the relations in (7.126) and multiply the numerator and denominator of the integrand by x + ω. In view of the even and odd properties we obtain Gˆ r (ω) =

2 π



∫ 0

x Gˆ i (x) d x, x 2 − ω2

ˆ r (x) ∞ ωG Gˆ i (ω) = − π2 ∫ 2 d x. 2 0 x −ω

Since Gˆ r = Gr − G 0 then we have Gr (ω) − G 0 =

2 π



∫ 0

xGi (x) d x, x 2 − ω2



Gi (ω) = − π2 ∫ 0

ω[Gr (x) − G 0 ] d x. x 2 − ω2

(7.127)

Equations (7.127) are commonly referred to as Kramers–Kronig relations of the complex-valued response function G.

7.8 Memory Models via Fractional Derivatives In a letter dated September 30th, 1695, L’Hospital wrote to Leibniz asking him about a particular notation he had used for the nth derivative, D n /Dx n . L’Hospital posed the question to Leibniz, what would the result be if n = 1/2. Leibniz answered “An apparent paradox, from which one day useful consequences will be drawn”. In these words fractional calculus is regarded to be born. Among the mathematicians who first investigated fractional calculus, along with the mathematical properties, are Fourier, Euler and Laplace. Many definitions have been given for a non-integer-order derivative. Perhaps the most popular one is that of Riemann–Liouville. Most of the mathematical theory of fractional calculus was developed prior to the turn of the twentieth century. Yet it is in the past 100 years that the most intriguing laps in engineering and scientific application have been found.

7.8 Memory Models via Fractional Derivatives

483

7.8.1 Preliminaries on Fractional Derivatives Understanding of definitions and use of fractional calculus become more immediate by a preliminary review on the gamma function, the beta function, and the MittagLeffler function. For *z > 0 the gamma function (z) is defined by Euler’s formula ∞

(z) = ∫ exp(−t)t z−1 dt. 0

Differentiation with respect to z can be carried out and it follows that ∞

 (z) = ∫ exp(−t)t z−1 ln tdt. 0

Integration by parts of (7.128) yields ∞

∞ tz ∫ exp(−t)t z−1 dt = ∫ exp(−t) dt z 0 0

whence (z + 1) = z(z). Since (1) = 1 then

(n + 1) = n!

so that  is a generalized factorial function. The beta function B( p, q) is defined by 1

B( p, q) = ∫ t p−1 (1 − t)q−1 dt, 0

* p > 0, *q > 0.

Replacing t by 1 − τ we have 1

1

0

0

∫ t p−1 (1 − t)q−1 dt = ∫(1 − τ ) p−1 (τ )q−1 dτ and hence B( p, q) = B(q, p). By the change of variable t → τ : t = τ /(1 + τ ), τ ∈ [0, ∞), we find ∞

B( p, q) = ∫ 0

τ p−1 dτ . (1 + τ ) p+q

(7.128)

484

7 Materials with Memory

It is possible to express the beta function in terms of gamma functions. To this end, letting * p > 0, *q > 0 we have ∞ ∞ ∞  ∞ ( p)(q) = ∫ exp(−t)t p−1 dt ∫ exp(−τ )τ q−1 dτ = ∫ exp(−t)t p−1 dt t q ∫ exp(−t x)x q−1 d x 0

0 0 0 ∞ ∞ ∞ x q−1 q−1 p+q−1 ∫ exp(−τ )τ p+q−1 dτ . = ∫x d x ∫ exp(−t (x + 1)) t dt = ∫ p+q d x 0 0 0 (1 + x) 0 ∞

Hence we can write B( p, q) =

( p)(q) . ( p + q)

In terms of the gamma function, the Mittag-Leffler functions are defined by E α (z) = E α,β (z) =

∞

∞ k=0

k=0

zk , (αk + 1)

zk , (αk + β)

α ∈ C, *α > 0,

α, β ∈ C, *α > 0, *β > 0.

7.8.2 Fractional Derivatives We start from Cauchy’s formula on iterated integrals, x x1

xn−1

a a

a

x 1 ∫(x − y)n−1 f (y)dy, (n − 1)! a

∫ ∫ · · · ∫ f (xn )d xn ...d x2 d x1 =

where a < x and n ∈ N. The proof is established by induction. If n = 1 this formula holds trivially as x

x

a

a

∫ f (y)dy = ∫ f (y)dy. Let

xn−1

F(xn−1 ) = ∫ f (xn )d xn . a

Hence the validity for n − 1 is expressed in the form x x1

xn−1

x x1

xn−2

a a

a

a a

a

∫ ∫ · · · ∫ f (xn )d xn ...d x2 d x1 = ∫ ∫ · · · ∫ F(xn−1 )d xn−1 ...d x2 d x1 x 1 ∫(x − y)n−2 F(y)dy. = (n − 2)! a

7.8 Memory Models via Fractional Derivatives

485

An integration by parts yields x x 1 1 ∫(x − y)n−2 F(y)dy = ∫(x − y)n−1 f (y)dy, (n − 2)! a (n − 1)! a

which completes the proof. By Cauchy’s formula, the right-hand side is the n-fold integral of f . Also, since (n) = (n − 1)!, we can define the (Riemann–Liouville) integral of order α > 0 as (I α f )(x) =

1 x ∫(x − y)α−1 f (y)dy. (α) a

The Riemann–Liouville fractional derivative of order α ∈ (0, 1) is defined for absolutely-integrable functions such that (D αR L f )(x) =

d 1−α (I f )(x). dx

For α > 1 the definition is extended as (D αR L f )(x) =

d n n−α (I f )(x), dxn

n = [α] + 1,

n then being the smallest integer larger than α. Hence we have (D αR L f )(x) =

x 1 dn ∫(x − y)n−α−1 f (y)dy, n d x (n − α) a

α ∈ [n − 1, n).

For integer-order derivatives we find the customary derivative. Letting α = n − 1 we have dn 1 x ∫ f (y)dy = f (n−1) (x). (D n−1 f )(x) = d x n (1) a The Caputo fractional derivative is defined in the form (DCα f )(x) = (I n−α

x dn f 1 ∫(x − y)n−α−1 f (n) (y)dy. )(x) = n dx (n − α) a

Direct differentiation and an integration by parts lead to x d x ∫(x − y)n−α−1 f (y)dy = (x − a)n−α−1 f (a) + ∫(x − y)n−α−1 f (y)dy. dx a a

Consequently

D αR L f = DCα f

for functions vanishing at the initial point, f (a) = 0.

486

7 Materials with Memory

The Marchaud fractional derivative of order α is defined as (D Mα f )(x) =

x f (x) − f (y) f (x) α ∫ + dy. α (1 − α)(x − a) (1 − α) a (x − y)α+1

For a = −∞ the derivative reduces to (D Mα f )(x) =

x f (x) − f (y) α ∫ dy. (1 − α) −∞ (x − y)α+1

The change of variable, y → τ = x − y, results in (D Mα f )(x) =

∞ f (x) − f (x − τ ) α ∫ dτ . (1 − α) 0 τ α+1

The idea of Marchaud’s derivative arises by the purpose of extending the Riemann–Liouville integral from α > 0 to α < 0. Now, if −α ∈ [−1, 0) then formally x 1 ∫(x − y)−α−1 f (y)dy. (I −α f )(x) = (−α) a If we let a → −∞, by a change of variable we have (I −α f )(x) =

1 ∞ −α−1 ∫y f (x − y)dy. (−α) 0

However the integral diverges because of 1/y α+1 as y → 0. The idea is to subtract the divergent part and let (D Mα f )(x) =

∞ f (x) − f (x − τ ) α ∫ dτ . (1 − α) 0 τ α+1

Denote by -·. the floor function so that -α. = max{m ∈ Z : m ≤ α}. Since n = -α. + 1 it is convenient to write (DCα f )(x) =

1 x ∫(x − y)-α.−α f (-α.+1) (y)dy, () a

where  = 1 + -α. − α ∈ (0, 1]. It is worth remarking that -α. − α ∈ (−1, 0], which is of interest in that the integral is bounded if f (-α.+1) is bounded as y → x. Let a → −∞. By a change of variable

7.8 Memory Models via Fractional Derivatives

we have (DCα f )(x) =

487

1 ∞ -α.−α (-α.+1) ∫y f (x − y)dy. () 0

We now check whether the fractional derivative and the ordinary derivative commute, (D DCα f )(x) = (DCα D f )(x). By definition, (D DCα f )(x) =

1 ∞ -α.−α ∫y D D -α.+1 f (x − y)dy, () 0

(DCα D f )(x) =

1 ∞ -α.−α -α.+1 ∫y D D f (x − y)dy. () 0

The commutation property then holds. We now compute the Laplace transform of the fractional derivative. In this regard we restrict attention to functions vanishing on (−∞, 0) and then (DCα f )(x) =

1 x -α.−α (-α.+1) 1 x ∫y ∫(x − y)-α.−α f (-α.+1) (y)dy. f (x − y)dy = () 0 () 0

For any s ∈ R++ we consider the Laplace transform of DCα f , ∞

∫ (DCα f )(x) exp(−sx)d x = 0

1 ∞x ∫ ∫(x − y)-α.−α f (-α.+1) (y) exp(−sx)d y d x. () 0 0

Replace x by x = y + (x − y) to obtain ∞

∫ (DCα f )(x) exp(−sx)d x= 0

∞ 1 ∞ -α.−α ∫ξ exp(−sξ)dξ ∫ f (-α.+1) (y) exp(−sy)dy. () 0 0

The change of variable z = s ξ allows us to write ∞



0

0

∫ ξ [α]−α exp(−sξ)dξ = s − ∫ z −1 exp(−z)dz = ()s − .

Hence we have ∞



0

0

∫ (DCα f )(x) exp(−sx)d x = s − ∫ f -α.+1 (y) exp(−sy)dy.

The integral on the right-hand side is the Laplace transform of the (-α. + 1)-th derivative of f . Denote by the subscript L the Laplace transform and recall the known property ( f L( p) (s) = s p f L (s) −

 p−1

k=0 s

p−1−k

f (k) (0+ ),

p ∈ N.

488

7 Materials with Memory

Letting p = -α. + 1 we obtain (DCα f )L (s) = s α f L (s) −

-α.

k=0 s

α−1−k

f (k) (0+ ).

7.8.3 Heat Conduction via Fractional Derivatives The constitutive equation ∞

q(t) = − ∫ κ(s)∇θ(t − s)ds

(7.129)

0

models linear heat conductors with memory. By a direct integration, it follows that the heat flux satisfying the MC equation (6.81) can be written q(t) = −

k∞ ∫ exp(−s/τ )∇θ(t − s)ds, τ 0

which amounts to letting κ(s) =

k exp(−s/τ ) τ

in (7.129). In the literature fractional derivatives are widely applied to model memory effects in connection with equations similar to (6.81) mainly by identifying q with the diffusion flux and replacing ∇θ with the density gradient [131, 226]. Applications are widely developed in viscoelasticity [292] often by referring to the rheological models. Here, for definiteness, we consider a model of heat conduction involving a fractional (time) derivative. Assume the heat flux is characterized by the fractional differential equation [156] q + τα ∂tα q = −k∇θ,

(7.130)

which becomes the MC equation when α = 1. A vector wα arises naturally by the fractional derivative of q. In this regard we let α ∈ [0, 1] so that the relation with the MC equation is immediate (α = 1). By definition we take the α-th derivative in the form ∂tα q =

1 ∞ −α+-α. -α.+1 ∫s ∂t q(t − s)ds, (ν) 0

Letting wα (t) :=

ν = -α. + 1 − α.

1 ∞ −α+-α. -α. ∫s ∂t q(t − s)ds (ν) 0

(7.131)

(7.132)

7.8 Memory Models via Fractional Derivatives

we have

489

∂tα q = ∂t wα .

(7.133)

Hence it follows that (7.130) can be written in the form ˙ α + q = −k∇θ. τα w

(7.134)

As α = 1 Eq. (7.134) becomes the MC equation τ q˙ + q = −k∇θ.

(7.135)

The parameters τα , τ , k are allowed to depend on the temperature θ. We now investigate the compatibility of (7.134) and (7.135); while the MC equation (7.135) proves at once to be thermodynamically consistent the same is not true for (7.134). Thermodynamic consistency The entropy inequality is taken in the standard form ˙ − 1 q · ∇θ ≥ 0. −ρ(ψ˙ + η θ) θ As to (7.135) let θ, q, ∇θ be the independent variables. Hence it follows 1 ρ(∂θ ψ + η)θ˙ + ρ∂q ψ · q˙ + ρ∂∇θ ψ · ∇ θ˙ + q · ∇θ ≤ 0 θ Substitution of q˙ from (7.135) results in ρ kρ 1 ρ(∂θ ψ + η)θ˙ + ρ∂∇θ ψ · ∇ θ˙ − ( ∂q ψ − q) · ∇θ − ∂q ψ · q ≤ 0 τ θ τ ˙ θ, ˙ ∇θ implies The arbitrariness of ∇ θ, ∂∇θ ψ = 0,

η = −∂θ ψ,

∂q ψ =

τ q kρθ

and hence ∂q ψ · q ≥ 0. By the obvious integration we have ψ = (θ) +

τ q2 2kρθ

(7.136)

490

7 Materials with Memory

and the reduced inequality implies k > 0. As to (7.134), let θ, wα , ∇θ be the independent variables. Hence 1 ˙ α − ρ∂∇θ ψ · ∇ θ˙ − q · ∇θ ≥ 0. −ρ(∂θ ψ + η)θ˙ − ρ∂wα ψ · w θ ˙ α from (7.134) and use of the arbitrariness lead to Substitution of w ∂wα ψ =

ρ ∂w ψ · q ≥ 0. τα α

τα q, ρkθ

By integration we find ψ = (θ) +

τα wα · q 2ρkθ

and k > 0. Yet q is not a variable and hence the dependence of ψ on q is contradictory. If, rather, we start with q as a variable then we might conclude ∂q ψ = 0 and this would lead again to the contradiction. The same problem arise if we replace q with wα in (7.134). Compatibility is investigated in [156] by setting the model within equations with summed histories.

7.8.4 Wave Propagation Properties Similarities and differences between the models (7.135) and (7.134) are now investigated. We first examine the system of equations τ q˙ + q = −k∇θ.

ε˙ = −∇ · q + r, By (7.136) we have

(7.137)

ε = (θ) − θ (θ) + λ(θ)q2 ,

where λ=−

θ  τ (θ) . 2ρ θk(θ)

We look for the possibility of discontinuity wave propagation. Let θ, q, r , and then ˙ ∇θ, and ε, ε(θ, q), τ (θ), and k(θ), be continuous whereas θ, ˙ q˙ suffer jump discontinuities across the wave front. The continuity of q and θ, [[q]] = 0,

[[θ]] = 0,

implies that time and space derivatives are related by

7.8 Memory Models via Fractional Derivatives

[[∇ · q]] = −

491

1 ˙ · n, [[q]] U

[[∇θ]] = −

1 ˙ [[θ]]n. U

Since ∂θ ε, ∂q ε, τ , k are continuous across the wave the system (7.137) yields ˙ + ∂q ε · [[q]] ˙ = ∂θ ε[[θ]] ˙ + [[q]] = τ [[q]] Hence we have

1 ˙ · n, [[q]] U

k ˙ [[θ]]n. U

˙ = 0. (τ ∂θ ε U 2 + k∂q ε · n U − k)[[θ]]

˙ )= 0, occur at the speed Nontrivial waves, where [[θ]] U=

  √ k/τ ∂θ ε 1 + k(∂q ε · n)2 /4τ ∂θ ε − 21 k/τ ∂θ ε ∂q ε · n .

Since ∂q ε = 2λq then the speed U depends on the direction of propagation through q · n. For waves propagating in a region at equilibrium, i.e. q = 0, then U=

k/τ ∂θ ε.

We now investigate the system of equations within the scheme of fractional derivatives, (7.138) ε˙ = −∇ · q + r, τα ∂tα q + q = −k∇θ. Again we let θ, q, and r be continuous whereas their derivatives suffer jump discontinuities at x at time t. Let α ∈ R+ \ N. To compute the discontinuity of [[∂tα q]] we observe that, by (7.131), ∂tα q =

1 (ν)



∞ 0

-α.+1

s −β ∂t

q(t − s)ds, β = α − -α. ∈ (0, 1), ν = 1 − β ∈ (0, 1). -α.+1

It is apparent that ∂tα q is well-defined23 for any bounded function ∂t q(t − s). We -α.+1 α now need to compute the possible jump of ∂t q associated with the jump of ∂t q. For generality consider the integral ∞

∫ s −β h(t − s)ds 0

and let h suffer a jump discontinuity [[h]] at time t0 [156]. Hence, as t > t0 , we represent h in the form 23

Moreover, we see that we cannot integrate by parts because s −β is unbounded as s → 0+ .

492

7 Materials with Memory

˜ − s) + [[h]]H (t − s − t0 ), h(t − s) = h(t where h˜ is continuous in a neighbourhood of t0 and H is the Heaviside step function. Consequently ∞



t−t0

0

0

0

˜ − s)ds + [[h]] ∫ s −β ds. ∫ s −β h(t − s)ds = ∫ s −β h(t

Since β ∈ (0, 1) then t−t0

∫ s −β ds = 0

(t − t0 )1−β → 0 as t → t0+ . 1−β

Notice that, for any function g(x, t), [[g]](x, t0 ) = g(x, t0+ ) − g(x, t0− ). Hence it follows that ∞ ∞ ˜ − s)ds]] = 0, [[ ∫ s −β h(t − s)ds]] = [[ ∫ s −β h(t 0

0

though [[h]] )= 0. The crucial point for this result is that β ∈ (0, 1). This implies that -α.+1 [[∂tα h]] = 0 for any order α ∈ R+ \ N though ∂t h has a jump discontinuity. Back to Eqs. (7.138) we observe that [[∂tα q]] = 0 -α.+1

though [[∂t

q]] )= 0. Consequently it follows that [[∇θ]] = 0.

The occurrence of the fractional derivative in (7.138) hinders the appearance of a discontinuity of ∇θ.

7.9 Viscoelastic Models of Fractional Order A simple stress–strain relation of fractional order might be considered in the form24 T R R (t) = Mv ∂tα E(t), where Mv ∈ Lin(Sym, Sym), Mv > 0. For formal convenience let ˜ M(s) = 24

1 Mv s -α.−α . (-α. + 1 − α)

For formal convenience, in this section ∂tα and ∂t denote Lagrangian time derivatives.

7.9 Viscoelastic Models of Fractional Order

493

Hence the constitutive equation might be written in the form ∞

-α.+1 ˜ T R R (t) = ∫ M(s)∂ E(t − s)ds. t 0

Yet a stress–strain relation for solids should allow also for a instantaneous elastic response. In this regard we recall the Kelvin-Voigt model ˙ T R R (t) = M∞ E(t) + Mv E(t),

M∞ , Mv > 0.

At any constant deformation process, E(t) = E0 , as t ≤ t0 , it follows T R R (t) = M∞ E0 . This may suggest that we take the stress–strain relation in the form T R R (t) = M∞ E(t) + Mv ∂tα E(t),

(7.139)

Hence we have ∞

-α.+1 t ˜ T R R (t) := T (E(t), Ert ) = M∞ E(t) + ∫ M(s)∂ E (s)ds. t 0

Letting α ∈ (n − 1, n) and β := α − -α. ∈ (0, 1) we consider T R R in the form ∞

n ˜ T R R (t) = M∞ E(t) + ∫ M(s)∂ t E(t − s)ds, 0

˜ M(s) = [(1 − β)]−1 Mv s −β .

(7.140) Restrict attention to n = 1 and hence α ∈ (0, 1). The constitutive equation (7.140) ˜ ∈ L 1 (R+ ) rather than L 1 (R+ ). By analogy with the looks like (7.56) but here M loc procedure for (7.56) we obtain the following result. Theorem 7.7 As n = 1, the constitutive equation (7.140) is consistent with the second law of thermodynamics only if T . Mv > 0, M∞ = M∞

(7.141)

−β cos ωs ds > 0, ∀ω > 0. while25 ∫∞ 0 s

We now let T , Mv = MvT , M∞ = M∞

lim E(τ ) = 0,

τ →−∞

and look for the whole thermodynamic consistency of (7.140) as n = 1. The work performed by the stress tensor in connection with the history Et is t t ∞ τ ˜ W (Et ) = ∫ T (Eτ ) · ∂τ E(τ )dτ = ∫ M∞ E(τ ) + ∫ M(s)∂ τ E (s)ds · ∂τ E(τ )dτ . −∞

25

−∞

−β cos ωsds = πω β−1 /2(β) cos(βπ/2). ∫∞ 0 s

0

494

7 Materials with Memory

˜ to R− by letting For technical convenience we extend M ˜ M(s) = 0 as s < 0. A change of variables yields t

τ

−∞

−∞

˜ − u)∂u E(u)du] · ∂τ E(τ )dτ W (Et ) = ∫ [M∞ E(τ ) + ∫ M(τ = 21 E(t) · M∞ E(t) +

1 2

= 21 E(t) · M∞ E(t) +

1 2

t

t

˜ ∫ ∫ ∂τ E(τ ) · M(|τ − u|)∂u E(u)du dτ (7.142)

−∞ −∞ t

t

˜ 1 − s2 |)∂t Et (s2 )ds1 ds2 . ∫ ∫ ∂t Et (s1 ) · M(|s

−∞ −∞

This suggests that we consider the work function as an energy functional, namely ρψˆ w (E(t), Ert ) = 21 E(t) · M∞ E(t) +

1 2

t

˜ 1 − s2 |)∂t Et (s2 )ds1 ds2 . ∫ ∂t Et (s1 ) · M(|s

−∞

In order to prove that ρ∂E ψˆ w (E(t), Ert ) = T(E(t), Ert ) we extend the definition of to R− by letting Et (s) = 0 as s < 0. Since E˙ + Eδ(t) is the distributional derivative of the extension of Et we obtain ∞ ∞ τ ˜ ρψˆ w (t) = W (t) = 21 E(t) · M∞ E(t) + ∫ ∫ M(s)∂ τ E (s)ds · [∂τ E(τ ) + δ(t − τ )E(t)]dτ

−∞ 0 ∞ ∞ ∞ τ t 1 ˜ ˜ = 2 E(t) · M∞ E(t) + ∫ ∫ ∂τ E(τ ) · M(s)∂ τ E (s)ds dτ + E(t) · ∫ M(s)∂ t E (s)ds −∞ −∞ 0 ∞ ∞ ∞ t ˜ − u)∂u E(u)du dτ . ˜ = 21 E(t) · M∞ E(t) + E(t) · ∫ M(s)∂ t E (s)ds + ∫ ∫ ∂τ E(τ ) · M(τ −∞ −∞ 0

The last term is independent of the present value E(t) and the sought result ∞

t ˜ ρ∂E φˆ w = M∞ E(t) + ∫ M(s)∂ t E (s)ds 0

follows. Moreover we find ˙ ˙ 0 = W˙ (t) − T R R (t) · E(t) = ρψ˙ w (t) − T R R (t) · E(t) ˙ ˙ + ρd ψˆ w (E(t), Ert |E˙ rt ) − T R R (t) · E(t) = ρd ψˆ w (E(t), Ert |E˙ rt ) = ρ∂E ψˆ w (E(t), Ert ) · E(t)

and hence d ψˆ w = 0. To check the minimum property at constant histories we extend by continuity the ˙ ) = 0 as τ > t. history Et to R− by letting Et (s) = Et (0) = E(t) as s < 0 so that E(τ Accordingly by Eq. (7.142) we have

7.9 Viscoelastic Models of Fractional Order

495 ∞

˙ )du dτ ρψˆ w (E(t), Ert ) = W (t) = 21 E(t) · M∞ E(t) + ∫ E(τ −∞ ∞

˜ ∗ E]τ ˙ ) · [M ˙ )dτ , = 21 E(t) · M∞ E(t) + ∫ E(τ −∞

where ∗ denotes the Fourier convolution. Accordingly by Parseval–Plancherel theorem we obtain ρψˆ w = 21 E(t) · M∞ E(t) +

1 2π



˜ F (ω)iωE F (ω)dω, ∫ (iωE F )∗ (ω)M

−∞

whence ρψˆ w = 21 E(t) · M∞ E(t) +

1 2π



˜ c (ω)Ec (ω) + Ec (ω)M ˜ c (ω)Ec (ω)]dω. ∫ ω 2 [Ec (ω)M

−∞

˜ c > 0 and Ec , Es = 0 for constant histories then the minimum property holds. Since M The thermodynamic consistency of (7.140) with n > 1 seems to be an open problem [333]. The present proof does not apply any longer. To look for a functional of ˜ of finite extent, M(s) ˜ the Graffi-Volterra type we might take M = 0 as s ≥ d. In this case, letting u

˜ ˇ M(u) = ∫ M(s)ds, 0

d

˜ Md = ∫ M(s)ds, 0

we can integrate by parts to obtain ∞

d

d

0

0

0

n t n t n+1 t ˜ ˜ ˇ ∫ M(s)∂ E (s)ds, t E (s)ds = ∫ M(s)∂t E (s)ds = ∫ M(s)∂t

thus avoiding the unboundedness of the kernel. Hence the condition T R R = ρ∂E ψˆ (E(t), Ert ) holds with d

n+1 t ˇ ˆ E (s)ds + ρ(θ, Ert ). ψ(E(t), Ert ) = 21 E(t) · M∞ E(t) + E(t) · ∫ M(s)∂ t 0

ˇ is bounded, so far a functional  with n > 1 seems to be unavailable. Though M

Chapter 8

Aging and Higher-Order Grade Materials

This chapter investigates some aspects of non-simple materials. First, aging materials are examined. Aging is a general problem of the modelling of materials. Mathematically the problem is made formal by letting constitutive parameters depend on time. It is pointed out that the second law of thermodynamics might select physically admissible evolutions of the parameters. Next constitutive equations are considered which involve higher-order derivatives, in time or in space (materials of differential type). The dependence on time derivatives is an alternative to the functionals of histories when memory effects are of interest. Yet the time derivative is not objective and hence recourse is needed to objective time derivatives. The dependence on higher-order space derivatives is a way of modelling a non-simple behaviour where the response of the material, at a point, is influenced by appropriate fields in a neighbourhood of the point, and this is modelled by some space derivatives, often of the deformation gradient. In this framework materials of the Korteweg type are investigated. Materials of higher-order grade are often associated with an hyperstress tensor thus generalizing the form of the mechanical power. The thermodynamic consistency of a scheme with a balance of energy involving an hyperstress is investigated.

8.1 Aging In general, aging of materials consists of any change in the constitutive properties. In essence there are two types of aging: a physical aging, which is due to thermodynamic processes, and a chemical aging, which is determined by chemical reactions. Although their origins differ, the macroscopic manifestations of physical and chemical aging may be modelled by a variation of the constitutive parameters. In materials with memory, the response of a non-aging substance to an external stimulus changes with the passage of time in a way, described e.g. by the relaxation © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_8

497

498

8 Aging and Higher-Order Grade Materials

modulus or the creep compliance, that is independent of the time at which the experiment is started. Instead, the properties of aging materials change with the passage of time even in the absence of external agents. Hence two time scales are required to unambiguously describe the constitutive properties of an aging material. One time scale is needed to keep track of the time since the manufacturing of the material and the other time scale is needed to keep track of the time the material is under an external stimulus. Aging is a time-dependent process. Aging degradation of mechanical structures reduce the strength of structures or their components. The aging degradation of the strength of a structure can be expressed by R(t) = R0 g(t), where R0 is the component capacity in the initial state (say at t = 0) and g(t) is time dependent degradation function. The literature shows many examples of functions g. For instance aging degradation for steel is assumed to be given by g(t) = 1 −

1 , 1 + a exp(−bt)

(8.1)

where a and b are constant parameters [69]. Another example is related to the aging of polymers; a classical model of degradation is expressed by g(t) = exp[−(t/τ )β ],

(8.2)

where β is a shape parameter and τ is the relaxation time [257]. It seems natural to view aging as a process which is related to the constitutive properties of the material. This suggests that we incorporate aging within the set of constitutive properties. Hence admissible aging processes are required to comply with the second law of thermodynamics. This view is embodied within a thermodynamic theory by comprising the time t among the independent variables. Hence if α is the set of physical variables we let ψ = ψ(α, t) and the like for the constitutive equations of, e.g., η, T, q. The entropy inequality then becomes 1 ˙ + ∂t ψ) + T · D − q · ∇θ ≥ 0 −ρ(∂α ψ · α θ where ∂t ψ is the time derivative given by the explicit dependence on t that models the aging property. We now examine how aging effects may be modelled within the schemes of thermoelastic solids, rate-type materials, and thermo-viscoelastic materials.

8.2 Aging of Thermoelastic Solids

499

8.2 Aging of Thermoelastic Solids A thermoelastic solid is characterized by the constitutive equation T R R = CE + ϑM,

(8.3)

possibly with T in place of T R R if the linear approximation is considered. For a nonaging material C and M are constant tensors. For an aging material it is natural to take C and M as dependent on time. This in turn suggests that the constitutive functions depend on E, θ, ∇θ, t, the dependence on t being thought as representing the changing properties of the material. Hence we let ψ(E, θ, ∇θ, t) be the constitutive function for the free energy ψ and the like for η, T and q. The entropy inequality becomes 1 −ρ R (∂θ ψ + η)θ˙ − ρ R ∂∇R θ ψ · ∇R˙θ + (T R R − ρ R ∂E ψ) · E˙ − q R · ∇R θ − ρ R ∂t ψ ≥ 0. θ ˙ E˙ implies that The arbitrariness of ∇R˙θ, θ, ∂∇R θ ψ = 0, and

η = −∂θ ψ,

T R R = ρ R ∂E ψ,

1 − q R · ∇R θ − ρ R ∂t ψ ≥ 0. θ

Since ∂∇R θ ψ = 0 then ∂t ψ remains unaltered by letting ∇R θ = 0, which implies that ∂t ψ ≤ 0. Restrict attention to the linear stress relation (8.3). To account for aging we let C and M depend on time. Since ρ R ψ = (θ) + 21 E · CE + ϑM · E then we have

˙ + ϑM ˙ · E. ρ R ∂t ψ = 21 E · CE

For definiteness we look at isotropic solids so that M = m1,

CE = 2μE + λ(tr E)1 = 2μE0 + κ(tr E)1,

500

8 Aging and Higher-Order Grade Materials

μ, λ being the Lamé moduli and κ = λ + 2μ/3, and T R R = 2μE0 + κ(tr E)1 + m ϑ 1. In stress-free conditions we have E0 = 0,

0 = κ(tr E) + mϑ.

Since tr E  ∇R · u and ∇R · u is the relative variation of the volume then −m/κ is the coefficient of thermal expansion (in R). We assume that m < 0 so that, since κ > 0, the body expands when the temperature increases. For isotropic solids we have ρ R ψ = (θ) + μE0 · E0 + κ(tr E)2 + mϑtr E and hence ˙ 0 · E0 + κ(tr ˙ E)2 + mϑtr ˙ E. ρ R ∂t ψ = μE In stress-free conditions ϑ = −κ tr E/m. Consequently ˙ 0 · E0 + (κ˙ − κm/m)(tr ˙ E)2 . ρ R ∂t ψ = μE This implies that μ˙ ≤ 0,

κ˙ −

κm˙ ≤ 0. m

Let κ > 0. The second inequality can be given the form κ˙ m˙ + ≤ 0. κ |m| In a thermoelastic material, aging produces a decrease of μ. A joint decrease of μ, κ, and m is consistent with thermodynamics. Yet, since m < 0 then an increase of m is more realistic, m˙ ≥ 0. This is consistent provided that m˙ κ˙ ≤− , |m| κ

κ˙ ≤ 0.

The coefficient of thermal expansion α = −m/3κ satisfies α˙ = −

κ˙  m  m˙ − 3κ m κ

and hence α˙ ≥ 0.

8.3 Aging of Rate-Type Materials

501

As a consequence of aging the ratio tr E ϑ increases; aging solids expand more and more per increment of temperature.

8.3 Aging of Rate-Type Materials For definiteness we set up a model of a rate-type material by starting with a rheological model; next we establish a model of material within the continuum framework by the standard analogies stress-force and strain-elongation. We consider the rheological scalar model called standard viscoelastic solid. It consists of a Maxwell unit, where a spring and a dashpot are connected in series, placed in parallel with a single spring. Let εs , εd be the strain1 of the spring and that of the dashpot of the Maxwell unit. The common strain ε of the elements in parallel is the sum of εs and εd , ε = εs + ε d . Let σe be the stress on the isolated spring and σm the stress on the Maxwell unit. The total applied stress is the sum of them, σ = σe + σm . Let k, ke be the elastic modulus of the spring in the Maxwell unit and γ the viscosity of the dashpot so that σm = kεs = γ ε˙d . σe = ke ε, Hence we have ε = εs + εd =

1 ε˙d + εd , α

α=

k . γ

(8.4)

Letting k and ke be constants we obtain ε˙ = ε˙s + ε˙d =

1 1 1 1 σ˙ m + σm = (σ˙ − ke ε) ˙ + (σ − ke ε). k γ k γ

Let g0 = ke + k,

g∞ = ke .

We then find that the evolution of the standard viscoelastic solid is governed by

1

Denoted ε(s) and ε(d) in Chap. 6.

502

8 Aging and Higher-Order Grade Materials

σ˙ = g0 ε˙ − α(σ − g∞ ε).

(8.5)

A rate-type model of materials is considered by viewing the stress as the sum of an elastic part and a viscoelastic part; for formal simplicity we let the material be a non-conductor. In the referential scheme we let T R R = Tˆ R R (θ, E) + T R R . To account for aging we let the time t be one of the independent variables. Hence we let T R R = Tˆ R R (θ, E, t) + T R R and let θ, E, T R R , t be the set of independent variables. The entropy inequality reads ˆ R R + T R R − ρ R ∂E ψ) · E˙ − ρ R ∂t ψ ≥ 0. −ρ R (∂θ ψ + η)θ˙ − ρ R ∂T R R ψ · T˙ R R + (T To determine consequences of this inequality we need to know possible relations ˙ among T˙ R R and E. We now let the stress T R R satisfy the analogue of the differential equation (8.5). Both a rate-type behaviour and possible anisotropies are modelled by assuming the relation τ T˙ R R −  0 E˙ + T R R − G∞ E = 0, where τ > 0 may be viewed as a relaxation time. For formal convenience we let G0 =  0 /τ so that we can write ˙ + T R R − G∞ E = 0 τ (T˙ R R − G0 E)

(8.6)

and assume G0 , G∞ are symmetric and positive-semidefinite fourth-order tensors on Sym. Both τ and G0 , G∞ are allowed to depend on the temperature θ. As τ → 0 Eq. (8.6) simplifies to T R R = G∞ E, whereby the material behaves elastically with G∞ as elastic relaxation. If instead τ → ∞ then 1 T R R − G∞ E) = 0 T˙ R R − G0 E˙ + (T τ shows an instantaneous hypoelastic behaviour, ˙ T˙ R R = G0 E, G0 being the elastic modulus. Borrowing from the viscoelastic model we let

8.3 Aging of Rate-Type Materials

503

G0 > G∞ ; further G∞ > 0 for solids and G∞ = 0 for fluids. Upon computation of ψ˙ and substitution of T˙ R R from (8.6) we obtain ρ ˆ R R + T R R − ρ R ∂E ψ − ρ R G0 ∂T −ρ R (∂θ ψ + η)θ˙ + R ∂T ψ · (T R R − G∞ E) + (T ψ) · E˙ − ρ R ∂t ψ ≥ 0. RR RR τ

The arbitrariness and the linearity of θ˙ and E˙ imply the classical relation η = −∂θ ψ and (8.7) ρ R ∂E ψ = Tˆ R R + T R R − ρ R G0 ∂T R R ψ, ρR T R R − G∞ E) − ρ R ∂t ψ ≥ 0. ∂T R R ψ · (T τ

(8.8)

To determine the free-energy function we consider the inequality T R R − G∞ E) ≥ 0, ρ R ∂T R R ψ · (T

(8.9)

which follows from (8.8) for non-aging materials or by letting the aging effect ∂t ψ as small as we please. Inequality (8.9) holds if T R R − G∞ E), ρ R ∂T R R ψ = A(T

(8.10)

A being a non-negative, symmetric fourth-order tensor, possibly dependent on θ, E, T R R and t. For definiteness, assume A is positive definite and depends only on θ and t. Moreover let ψ be a C 2 function. Now2 ρ R ∂E ∂T R R ψ = −AG∞ . Substitution of ρ R ∂T R R ψ results in T R R − G∞ E). ρ R ∂E ψ = Tˆ R R + T R R − G0 A(T Hence we have ρ R ∂T R R ∂E ψ = I − G0 A. The requirement ∂T R R ∂E ψ = (∂E ∂T R R ψ)T then implies I − G0 A = −G∞ A whence

2

A = (G0 − G∞ )−1 .

For any pair of tensors E, T we let (∂E ∂T ψ)hki j = ∂ Ei j ∂Thk ψ.

504

8 Aging and Higher-Order Grade Materials

Thus A is positive definite if G0 > G∞ . The function ψ is subject to (8.10) and, since G0 = G∞ + A−1 , ρ R ∂E ψ = Tˆ R R + T

RR

T − G0 A(T

RR

Let ψ0 (θ, E) be subject to

T R R − G∞ E) − G∞ E) = Tˆ R R + T R R − (G∞ + A−1 )A(T ˆ T R R − G∞ E) + G∞ E. = T R R − G∞ A(T

ˆ RR. ρ R ∂E ψ 0 = T

Therefore upon integration of ρ R ∂E ψ and ρ R ∂T R R ψ we find ρ R ψ in the form T R R − G∞ E) · A(T T R R − G∞ E). ρ R ψ = ρ R ψ0 (θ, E) + 21 [E · G∞ E + (T

(8.11)

Hence, to within the function ψ0 , the free energy ρ R ψ is a quadratic form and is positive definite provided G∞ and A (and hence G0 ) are positive definite.

8.3.1 Aging Properties We now go back to the dissipation inequality (8.8). The aging properties are modelled by letting G∞ and A depend on time. To save writing let Y = T R R − G∞ E. Hence we have ˙ ∞ E + 1 Y · AY ˙ − (G ˙ ∞ AY) · E. ρ R ∂t ψ = 21 E · G 2 Inequality (8.8) then takes the form ˙ ∞ E − 1 Y · AY ˙ + (G ˙ ∞ AY) · E ≥ 0. τ −1 Y · AY − 21 E · G 2

(8.12)

To characterize the properties of G∞ and A we establish the following result. Theorem 8.1 Let A > 0, G∞ ≥ 0. If ˙ ∞ < 0, G

˙ 0 ≥ −2τ −1 (G0 − G∞ ) G

(8.13)

then the free energy (8.11) satisfies (8.8). ˙ ∞ A > 0. Let K be a fourth-order positive˙ ∞ < 0 then −G Proof Since A > 0 and G definite tensor. Hence we can write

8.3 Aging of Rate-Type Materials

505

˙ ∞ AY] · E = [K−1/2 G ˙ ∞ AY] · K1/2 E. [G ˙ ∞ . It follows Let K = −G ˙ ∞ AY] · E = [K1/2 AY] · K1/2 E]. [G We now apply Cauchy’s inequality |ME1 · NE2 | ≤ 21 ME1 · ME1 + 21 NE2 · NE2 , E1 , E2 ∈ Sym, M, N ∈ Lin(Sym) to obtain [K1/2 AY] · [K1/2 E] ≤ 21 K1/2 AY · K1/2 AY + 21 KE · KE ≤ 21 [AKAY] · Y + 21 [KE] · E. Consequently

and

˙ − AG ˙ ∞ A)Y ρ R ∂t ψ ≤ 21 Y · (A

˙ − AG ˙ ∞ A)Y. τ −1 Y · AY − ρ R ∂t ψ ≥ τ −1 Y · AY − 21 Y · (A

Now, since A−1 A = I then

˙ ˙ −1 A−1 = −A−1 AA

and hence ˙ − AG ˙ ∞ A = A[A−1 AA ˙ ∞ ]A = −A[A˙−1 + G ˙ ∞ ]A ˙ −1 − G A Since A−1 = G0 − G∞ then

whence

˙ ˙∞ =G ˙ 0, A−1 + G

˙ 0 A]Y. τ −1 Y · AY − ρ R ∂t ψ ≥ Y · [τ −1 A + 21 AG

In view of (8.13) we find

τ −1 Y · AY − ρ R ∂t ψ ≥ 0

and hence inequality (8.8) holds.



In essence, this theorem tells us that G0 > G∞ ≥ 0 allows for a positive-definite tensor A satisfying the dissipation inequality for non-aging materials. Furthermore it follows that the entropy inequality is guaranteed by non-increasing tensor func˙ 0 is bounded by (8.13). Indeed, integration of the second tions G0 (t), G∞ (t) while G inequality in (8.13) on [0, t] results in

506

8 Aging and Higher-Order Grade Materials

G0 (t) ≥ G0 (0) exp(−2t/τ ) +

2 t ∫ G∞ (s) exp(−2(t − s)/τ )ds; τ 0

˙ 0 < 0) bounded by the right-hand side. If G∞ = 0 then G0 is allowed to decrease (G G0 (t) ≥ G0 (0) exp(−2t/τ ).

(8.14)

This result for G0 may be contrasted with the models (8.1) and (8.2). By (8.1) it follows b exp(−bt) ≥ −ba exp(−bt), g(t) ˙ =− [1 + a exp(−bt)]2 consistent with (8.14). By (8.2) we find g(t) ˙ =−

β β−1 t g(0) exp[−(t/τ )β ]. τβ

Consistency with (8.14) requires that β ≥ 1. The entropy inequality does not seem to place significant restrictions on the possible dependence of τ on time. Since G0 =  0 /τ > 0 if, e.g.,  0 is constant then (8.13), i.e. 1 2 − 2  0 τ˙ ≥ − (G0 − G∞ ), τ τ implies G0 τ˙ ≤ 2(G0 − G∞ ). Any negative value of τ˙ would be admissible but also τ˙ I ≤ G−1 0 (G0 − G∞ ) would be allowed.

8.4 Aging of Thermo-Viscoelastic Materials A model of thermo-viscoelastic material (with memory) is established by looking at the rheological Eq. (8.4), (8.15) ε˙d + α εd = α ε. Viewing (8.15) as a differential equation for εd we can integrate on [t0 , t] to obtain t

εd (t) = εd (t0 ) exp[−α(t − t0 )] + ∫ α exp[−α(t − s)]ε(s)ds. t0

Let α > 0. Hence the limit as t0 → −∞ results in t

εd (t) = ∫ α exp[−α(t − s)]ε(s)ds. −∞

Since α = k/γ and

8.4 Aging of Thermo-Viscoelastic Materials

507

σ = ke ε + kεs = (ke + k)ε − kεd we find the stress-strain relation t

σ(t) = (ke + k)ε(t) − ∫ (k 2 /γ) exp[−α(t − s)]ε(s)ds −∞

which involves both the present value ε(t) and the past history ε(s), s ∈ (−∞, t). In terms of g0 = ke + k, g∞ = ke we define g(ξ) = g∞ + (g0 − g∞ ) exp(−αξ). Hence the stress σ can be written in the form t

σ(t) = g0 ε(t) − ∫ ∂s g(t − s)ε(s)ds. −∞

This is a further argument supporting the model of viscoelastic solids via a Boltzmannlike representation of the stress-strain relation. A tensorial representation of the stress-strain relation is defined as follows. Let D = {(t, s) : t ∈ R, s ∈ (−∞, t]} or D = {(t, s) : t ∈ R, s ≤ t}. Let G : D → Lin(Sym) be the relaxation function such that the stress T is given by t

T(x, t) = G0 (t)E(x, t) − ∫ ∂s G(t, s)E(x, s)ds, −∞

G0 (t) := G(t, t).

(8.16)

Upon a change of variable, s → u = t − s, the t-dependent relaxation function ˜ : R × R+ → Lin(Sym), G

˜ u) = G(t, t − u) G(t,

allows the stress-strain relation to be written in the form ∞

˜ u)E(x, t − u)du, T(x, t) = G0 (t)E(x, t) + ∫ ∂u G(t, 0

where ˜ 0) = G(t, t), G0 (t) = G(t, Alternatively, letting

˜ u) = ∂u G(t, t − u) = −∂s G(t, s). ∂u G(t,

˜ u) G∞ (t) = lim G(t, u→∞

508

8 Aging and Higher-Order Grade Materials

we can write ∞

˜ u)[E(x, t) − E(x, t − u)]du. T(x, t) = G∞ (t)E(x, t) − ∫ ∂u G(t, 0

For ease in writing let ˜ u), G (t, u) = −∂u G(t,

(t, u) ∈ R × R+ ;

the function G will be referred to as t-dependent relaxation kernel. Hence we have ∞

T(x, t) = G∞ (t)E(x, t) + ∫ G (t, u)[E(x, t) − E(x, t − u)]du.

(8.17)

0

We now restrict attention to the stress-strain relation (8.17). Consistent with the scalar case, in order to model viscoelastic solids the relaxation modulus G∞ (t) is assumed to be positive definite for every t ∈ R, namely E · G∞ (t)E > 0

∀E ∈ Sym.

˜ u) reduce to G(t − We remark that, for non-aging materials, G(t, s) and G(t, s) and G(u), respectively. In addition, ∂s G(t, s) reduces to ∂s G(t − s) = −G (t − ˜ u) reduces to −G (u). Hence, as expected, G0 (t) and s) while G (t, u) = −∂u G(t, G∞ (t) reduce to the constant tensors G0 and G∞ . The relaxation kernel is assumed to satisfy the following properties. (M1) (t, u) → G (t, u) ∈ L ∞ (C ) for every compact set C ⊂ R × R+ (M2) For every fixed t ∈ R the map u → G (t, u) is positive semi-definite, absolutely continuous and summable on R+ . Then, for every t ∈ R, ∞

∫ G (t, u)du = G0 (t) − G∞ (t) ≥ 0. 0

Moreover, the map is differentiable for all t ∈ R+ and (t, u) → ∂u G (t, u) ∈ L ∞ (C ) for every compact set C ⊂ R × R+ . (M3) For every fixed u > 0 the map t → G (t, u) is differentiable for all t ∈ R. Moreover, (t, u) → ∂t G (t, u) ∈ L ∞ (C ) for every compact set C ⊂ R × R+ . ˜ may be represented as In view of (M2) the t-dependent relaxation function G u

˜ u) = G0 (t) − ∫ G (t, ξ)dξ. G(t, 0

8.4 Aging of Thermo-Viscoelastic Materials

509

If E(x, ·) is a fading strain history, namely s

E(x, s) = ∫ ∂ξ E(x, ξ)dξ,

lim E(x, s) = 0,

s→−∞

−∞

an integration by parts of (8.16) results in t

T(x, t) = ∫ G(t, s)∂s E(x, s)ds. −∞

This representation allows G(t, s) to be unbounded at s = t. Consequently, t

ˇ s)∂s E(x, s)ds, T(x, t) = G∞ (t)E(x, t) + ∫ G(t, −∞

where

ˇ s) = G(t, s) − G∞ (t). G(t,

Let Edt (s) = E(t) − E(t − s)

s ∈ [0, ∞)

be the relative strain history. Hence we can write T in the form ∞



0

0

T(x, t) = G∞ (t)E(x, t) + ∫ G (t, s)Edt (s)ds = G0 (t)E(x, t) − ∫ G (t, s)E(t − s)ds.

Any admissible free energy ψ is required to satisfy the entropy inequality −ρψ˙ + T · E˙ ≥ 0 whence

(−ρ∂E ψ + T) · E˙ − ρdψ(E(t), Et |E˙ t , t) − ρ∂t ψ ≥ 0.

The arbitrariness of E˙ implies T = ρ∂E ψ. Hence we are left with the dissipation inequality dψ(E(t), Et |E˙ t , t) + ∂t ψ ≤ 0.

(8.18)

The relation T = ρ∂E ψ holds with the free energy in the Graffi-Volterra singleintegral form, ψG (E(t), Edt , t) = 21 E(t) · G∞ (t)E(t) +

1 2



∫ Edt (s) · G (t, s)Edt (s)ds. 0

510

8 Aging and Higher-Order Grade Materials

Aging effects via the functions G∞ (t) and G (t, s) are established by means of the following Theorem 8.2 The stress-strain relation (8.17) is consistent with the dissipation inequality (8.18) if ˙ ∞ ≤ 0 ∀t ∈ R, G

∂t G (t, s) + ∂s G (t, s) ≤ 0 ∀(t, s) ∈ R × R+ .

Proof Let ψ = ψG and observe that ∞

dψG (E(t), Et |E˙ t , t) = ∫ ∂t Et (s) · G (t, s)Edt (s)ds, 0

˙ ∞ (t)E(t) + ∂t ψG = 21 E(t) · G

1 2



∫ Edt (s) · ∂t G (t, s)Edt (s)ds. 0

Upon an integration by parts, since G (t, s) → 0 as s → ∞ and Ert (0) = 0 then it follows ∞



∫ ∂t Et (s) · G (t, s)Edt (s)ds = − ∫ ∂s Edt (s) · G (t, s)Edt (s)ds 0

=

1 2

0 ∞

∫ Edt (s) · ∂s G (t, s)Edt (s)ds. 0

Consequently ∞ ˙ ∞ (t)E(t) + 1 ∫ Et (s) · [∂t G (t, s) + ∂s G (t, s)]Et (s)ds dψ(E(t), Et |E˙ t , t) + ∂t ψ = 21 E(t) · G d d 2 0

and inequality (8.18) follows.



It is worth remarking that for non-aging materials G is independent of t in that G (t, s) = G (s) whence ˜ ∂t G (t, s) + ∂s G (t, s) = ∂s G (s) = −∂s2 G(s). ˜ Hence the assumption ∂t G (t, s) + ∂s G (t, s) ≤ 0 reduces to the convexity of G, ˜ ∂s2 G(s) ≥ 0. A more restrictive assumption implies the validity of the dissipation inequality [105] as is shown in the following Theorem 8.3 The stress-strain relation (8.17) is consistent with the dissipation inequality (8.18) if ˙ ∞ ≤ 0 ∀t ∈ R, G where M(t) ≥ 0.

G (t, s) ∀(t, s) ∈ R × R+ , ∂t G (t, s) + ∂s G (t, s) ≤ −M(t)G

8.5 An Aging Model of Viscous Fluid with Memory

511

Proof Let ψ = ψG and evaluate dψG , ∂t ψG as in the previous theorem. Hence ∞ ˙ ∞ (t)E(t) + 1 ∫ Et (s) · [∂t G (t, s) + ∂s G (t, s)]Et (s)ds dψ(E(t), Et , t|E˙ t ) + ∂t ψ = 21 E(t) · G d d 2

0 ∞ 1 ˙ ≤ 2 E(t) · G∞ (t)E(t) − 21 M(t) ∫ Edt (s) · G (t, s)Edt (s)ds. 0

The two assumptions imply the dissipation inequality (8.18).



8.5 An Aging Model of Viscous Fluid with Memory An aging model of viscous fluid with memory might be expressed by letting the memory function depend on time. Formally we might replace m(s) of (7.76) with α2 (t)m(s), which means that the viscous property changes in time according to the function α2 (t). Hence let ∞

T = − p(ρ, θ)1 + α2 (t) ∫ m(s)D(t − s)ds. 0

For formal convenience let α > 0. Owing to the occurrence of α(t) we can view ρ, θ, Dt , t as the set of independent variables, the partial dependence on t being induced by α(t). Upon evaluation of ψ˙ and substitution in the entropy inequality we find ˙ t ) − ρ∂t ψ −ρ(∂θ ψ + η)θ˙ + (ρ2 ∂ρ ψ − p)∇ · v − ρ dψ(ρ, θ, Dt |D ∞

+D · α2 (t) ∫ m(s) D(t − s)ds ≥ 0. 0

In addition to the classical relations η = −∂θ ψ, p = ρ2 ∂ρ ψ we obtain the dissipation inequality ∞

˙ t ) − ρ∂t ψ + D · α2 (t) ∫ m(s) D(t − s)ds ≥ 0. −ρ dψ(ρ, θ, Dt |D 0

By taking the free energy function in the form ∞  2 ψ = (ρ, θ) + β α(t) ∫ m(s) D(t − s)ds  0

512

8 Aging and Higher-Order Grade Materials

we have ∞ 2 ∞  α(1 − 2ρβm(0))D · ∫ m(s) D(t − s)ds − 2βρα˙  ∫ m(s) D(t − s)ds  0

0

∞  ∞  −2ρβα ∫ m(s) D(t − s)ds · ∫ m (s) D(t − s)ds ≥ 0. 0

0

The inequality holds if and only if β = 1/2ρm(0), m(s) = m(0) exp(−γs), and, furthermore α˙ ≤ 0. This condition means that, by the aging effect the viscosity property described by the memory function α(t)m(s) decreases in time.

8.6 Damage Continuum mechanics describes the damage as a degradation of the material. The degradation may result from different mechanisms typically related to the microstructure of the material and defects such as microcracks, growth of pores, rupture of fibres embedded in a matrix. Detailed descriptions may model the evolution of defects; here we look at macroscopic models of damage in terms of macroscopic variables. The damage variable D is most often a scalar and D ∈ [0, 1]; D = 1 represents the undamaged state and D = 0 corresponds to the fully degraded state. If D is expressed by an internal variable, say d, examples of dependences are D = 1 − d, d ∈ [0, 1],

D = exp(−ηd d), d ∈ [0, ∞),

the internal variable d being positive and typically restricted by d˙ ≥ 0. Sometimes D is determined by a number of processes undergone by the material. For instance, if Ni (Si ), i = 1, 2, ..., N D , is the number of cyclic processes with amplitude Si and i Ni (Si ) leads to failure then a sequence n i (Si ) is said to produce a damage ND  n i (Si ) . D= Ni (Si ) i=1 In a damaged linearly elastic body the effective Lamé constants are regarded to be 1 − D times the values of the constants for an undamaged body, T = 2μ(1 − D)ε + λ(1 − D)(tr ε)1. In turn, the effective Young’s modulus E˜ is the fraction (1 − D) of the nominal value,

8.6 Damage

513

μ[2μ + 3λ] μ(1 − D)[2μ(1 − D) + 3λ(1 − D) = (1 − D) = (1 − D)E. E˜ = μ(1 − D)[μ(1 − D) + λ(1 − D) μ+λ Consistently, Poisson’s modulus ν = λ/2(μ + λ) is left unchanged. In the case of anisotropic damage, due to a distribution of cracks, the damage is modelled as a second-order tensor D in the form [246] D=



ωi ni ⊗ ni ,

i

where ωi is the cracks density in the direction ni . If θ, ε, D are the independent variables then the Clausius-Duhem inequality3 −ρ(∂θ ψ + η)θ˙ + (T − ρ∂ε ψ) · ε˙ − ρ∂D ψ D˙ ≥ 0 reduces to

∂D ψ D˙ ≤ 0.

This suggested that −ρ∂D ψ D˙ may be viewed as the dissipation of the material due to damage and hence that F = −ρ∂D ψ be the force due to the damage process; we let F ≥ 0. Then it is introduced a damage yield function f in the form [296] f (F, D) = F − κ(D), where κ is a positive function representing the resistance of the material to the damage propagation. For the damage evolution it is assumed that  = 0 if f ≤ 0, D˙ > 0 if f = 0,

f˙ < 0, f˙ = 0;

the damage D is constant if the material is below the yield limit and increases if the material is in the yield limit. If ρψ = ρψ0 (θ) + 21 ε · C(D)ε then F = −ε · ∂D Cε; the assumption F ≥ 0 means that ∂D C ≤ 0, which is consistent with the decreasing property of the Lamé moduli.

3

Here we take T · ε˙ as the linear approximation of the power in elasticity.

514

8 Aging and Higher-Order Grade Materials

8.7 Fluids of Higher-Order Grade The inadequacy of the classical Navier-Stokes theory to describe the dynamics of rheologically complex fluids has led to the development of several theories of nonNewtonian fluids. Among the many models which have been used to model the nonNewtonian behaviour, the fluids of differential type have been developed at various levels of mathematical complexity. In general, fluid of differential type means that the constitutive equations involve appropriate derivatives of the stretching D, in addition to D itself. A fluid model of grade n denotes a model where time derivatives of the stretching D occurs up to the (n − 1)-order. In this sense, a Newtonian fluid is a first-order fluid. Among fluids of higher-order grade, the incompressible fluid of grade 2 has received an especially large amount of attention. Further, the fluid has been modelled via the Rivlin-Ericksen tensors. Other descriptions are equally admissible in connection with the objectivity principle. As with the Rivlin-Ericksen tensors, we can define a sequence of kinematic tensors by letting 

A n+1 = A n ,

A 1 = D, 

denoting a generic objective time derivative. For definiteness we let A of any tensor A be defined by 



˙ − WA A − A WT + ν1 D A + ν2 A D + ξ tr D A , A =A ν1 , ν2 , ξ being scalar quantities; they are allowed to be given by functions of ρ and θ. For technical convenience we require that symmetry is preserved by the derivative in that A ∈ Sym



=⇒ A ∈ Sym.

˙ ∈ Sym and Since A ∈ Sym implies A A + (WA A)T ∈ Sym, A + A WT = WA WA (ν1 D A + ν2 A D)T = ν1 A D + ν2 D A it follows that symmetry is preserved for any A and D if and only if ν1 = ν2 . Hence we define  ˙ − WA A − A WT + ν(DA A + A D) + ξ tr D A . A := A

8.7 Fluids of Higher-Order Grade

515

8.7.1 Second Grade Fluid For an easy comparison with the standard fluid of grade 2 in the literature we generalize the definition of fluid by letting the Cauchy stress T be given by 

T = − p1 + 2μ D + λtr D 1 + β1 D +β2 D D.

(8.19)

The essential difference with the standard model of fluid of grade 2 is that the generic 

derivative  is used for D while the Rivlin-Ericksen tensors involve the Cotter-Rivlin derivative. Moreover the fluid is allowed to be compressible and the compressibility effects occur explicitly in λtr D and ξtr D besides the implicit occurrence in D. Since Eq. (8.19) is an equation of differential type, and not of rate type, an obvious consistency suggests that we describe heat conduction via a Fourier-type law rather than a rate-type law. To investigate the thermodynamic consistency of the stress function (8.19) we let 

ρ, θ, D, D, ∇θ be the independent variables. The constitutive functions ψ, η, q, and T are required to satisfy the second-law inequality 1 −ρψ˙ − ρη θ˙ + T · D − q · ∇θ ≥ 0. θ Substitution of ψ˙ and (8.19) allow the inequality to be given the form 

˙ − ρ∂ ψ · (D)˙− ρ∂∇θ ψ · (∇θ)˙ −ρ(∂θ + η)θ˙ + (ρ2 ∂ρ ψ − p)tr D − ρ∂D ψ · D D



+2μD · D + λ(tr D)2 + β1 D ·D + β2 (DD) · D ≥ 0. Observe



¨ + [−WD − DWT + 2νDD + ξ(tr D)D]˙, (D)˙ = D 

and (∇θ)˙ = ∇ θ˙ − LT ∇θ. Hence any value of (D)˙ ∈ Sym is obtained by an appro¨ which, thanks to b, can be taken to assume arbitrary values. priate value of D ˙ Of course Likewise any value of (∇θ)˙ is obtained by an appropriate value of ∇ θ.   ˙ ∂ ψ · (D)˙, and ∂∇θ ψ · (∇θ)˙are linear terms in θ, ˙ (D)˙and (∇θ)˙. The arbi(∂θ + η)θ, D



˙ (D)˙and (∇θ)˙implies that trariness of θ, η = −∂θ ψ,

∂ ψ = 0, D

∂∇θ ψ = 0.

516

8 Aging and Higher-Order Grade Materials

Moreover, since only q can depend on ∇θ it follows that q · ∇θ ≤ 0 and 

˙ + 2μD · D + λ(tr D)2 + β1 D ·D + β2 (DD) · D ≥ 0. (ρ2 ∂ρ ψ − p)tr D − ρ∂D ψ · D Observe (WD) · D = (DD) · W = 0, and hence

(DWT ) · D = (DD) · W = 0



˙ · D + 2ν(DD) · D + ξ(tr D)D · D. D ·D = D The reduced inequality takes the form ˙ + 2μD · D + λ(tr D)2 + (2νβ1 + β2 )(DD) · D ≥ 0. (ρ2 ∂ρ ψ − p + ξβ1 D · D)tr D + (β1 D − ρ∂D ψ) · D

˙ implies The arbitrariness of D β1 D − ρ∂D ψ = 0 and hence, upon integration, ψ = (ρ, θ) +

β1 D · D. 2ρ

(8.20)

To complete the set of thermodynamic restrictions we now compute the separate contributions of tr D and the deviator D0 = D − 13 tr D 1. Observe DD = D0 D0 + 23 tr D D0 + 19 (tr D)2 1, (DD) · D = (D0 D0 ) · D0 + tr D D0 · D0 + 19 (tr D)3 Hence the reduced inequality can be written [ρ2 ∂ρ ψ − p + ξβ1 (D0 · D0 + 13 (tr D)2 )]tr D + 2μD0 · D0 + ( 23 μ + λ)(tr D)2 +(2νβ1 + β2 )[(D0 D0 ) · D0 + tr DD0 · D0 + 19 (tr D)3 ] ≥ 0. The occurrence of (D0 D0 ) · D0 implies the restriction 2νβ1 + β2 = 0.

8.7 Fluids of Higher-Order Grade

517

The remaining inequality holds if and only if μ ≥ 0,

2 μ 3

+ λ ≥ 0,

ρ2 ∂ρ ψ − p + ξβ1 (D0 · D0 + 13 (tr D)2 ) = 0. If p is assumed to be dependent only on ρ and θ then we set ξ = 0 and find the standard relation for fluids p = ρ2 ∂ρ ψ. Otherwise p = ρ2 ∂ρ ψ + ξβ1 (D0 · D0 + 13 (tr D)2 ) means that the dynamic pressure p depends also on the stretching D via the additive term νβ1 D · D. The restrictions on μ and λ are the classical ones of Newtonian fluids. Strictly speaking, thermodynamics does not place any restrictions on β1 , though the result (8.20) suggests that β1 > 0 so that the free energy has a minimum at equilibrium (when D = 0). Much work has been developed in the literature on the incompressible fluid of grade 2 modelled by4 T = − p1 + μA1 + α1 A2 + α2 A1 A1 ,

(8.21)

A1 , A2 being the first two Rivlin-Ericksen tensors. The thermodynamic restrictions are μ ≥ 0, α1 + α2 = 0. The rest state proved to be unstable if α1 < 0 and meanwhile experiments show that α1 ≥ 0, α1 + α2 = 0 are not satisfied. Attempts to overcome this difficulty have been developed by looking at higherorder grade fluids [223, 297]. Here we show that, though within a model of grade 2, the parameters ν, ξ, occurring in the chosen derivative, allow us to avoid the joint restriction α1 ≥ 0, α1 + α2 = 0. First observe that since A1 = 2D then an immediate comparison between (8.19) and (8.21) indicates the identifications β1 = 2α1 ,

β2 = 4α2 .

Consequently (8.20) can be written ψ = (ρ, θ) +

α1 D·D ρ

while 0 = 2νβ1 + β2 = 4(να1 + α2 ). 4

See [141] and references therein.

518

8 Aging and Higher-Order Grade Materials

Again we find it natural to set α1 ≥ 0 so that ψ has a minimum at D = 0. However, να1 + α2 = 0 implies that α2 = −να1 . In the Rivlin-Ericksen tensors ν = 1. Here instead ν is allowed to take any value, positive or negative. Hence it follows that both α1 and α2 can be positive, not necessarily with α2 = α1 .

8.7.2 Third Grade Fluid In addition to fluids of second grade, third grade fluids have been investigated to account more properly for the effective shear viscosity (see [223] and refs therein). Here we consider the third grade fluid with two improvements, the fluid is compressible and the derivative need not be the Cotter-Rivlin one. 

We consider the objective derivative A , of a tensor A , defined by5 

˙ − WA A − A WT + ν(DA A + A D). A=A The third grade fluid involves the stretching D and the derivatives 

˙ − WD − DWT + 2νDD, D= D 











D = (D)˙− W D − D WT + ν(D D + D D).

We then let T be given by the constitutive relation 







T = − p 1 + 2μD + λtr D 1 + α1 D +α2 D2 + β1 D +β2 (D D + D D) + β3 (tr D2 )D

which generalizes that in [171]. The viscosities μ, λ and the parameters α1 , α2 , β1 , β2 , β3 are allowed to depend on the density ρ and the temperature θ. By analogy with a misprint in [171], formula (1.2c) versus (4.1), we might have 

considered the last term as β3 (tr D2 ) D. In such a case we would have found that, as a thermodynamic restriction, β3 = 0. The thermodynamic consistency is investigated by means of the entropy inequality and the possible restrictions are derived, by letting ψ, η, and q be functions of 



ρ, θ, D, D, D , ∇θ. 5

The Cotter-Rivlin derivative, which generates the Rivlin-Ericksen tensors, is the particular case ν = 1.

8.7 Fluids of Higher-Order Grade

519

Computation of ψ˙ and substitution in the entropy inequality result in 



˙ − ρ∂ ψ · (D)˙− ρ∂ ψ · ( D )˙− ρ∂∇θ ψ · (∇θ)˙ −ρ(∂θ ψ + η)θ˙ + ρ2 ∂ρ ψtr D − ρ∂D ψ · D D

D

+T · D −

Observe that

1 q · ∇θ ≥ 0. θ

    ¨ + ... D = D = (D)˙+ ... = D



˙ D. ˙ Hence it follows the dots denoting terms in W, D, W,    ... D ˙ = D + ... ... ˙ D, ˙ W, ¨ D. ¨ The arbitrariness of D, (∇θ)˙, θ˙ implies the dots denoting terms in W, D, W, that ∂∇θ ψ = 0, η = −∂θ ψ. ∂ ψ = 0, D

Moreover, since only q can depend on ∇θ then the function q is required to satisfy q · ∇θ ≤ 0. We now exploit the reduced inequality 

˙ − ρ∂ ψ · (D)˙+ T · D ≥ 0. ρ2 ∂ρ ψ tr D − ρ∂D ψ · D D

Notice that    ¨ + ... T · D − ρ∂ ψ · D ˙ = β1 D ·D − ρ∂ ψ · D ˙+ ... = (β1 D − ρ∂ ψ) · D

D

D

D

¨ The arbitrariness of D ¨ implies that the dots containing terms independent of D. β1 D − ρ∂ ψ = 0. D

A direct integration yields  ˆ ρ, D) + 1 β1 D ·D. ψ = ψ(θ, ρ

The entropy inequality then reduces to 





˙ + β1 D · [ D −(D)˙] + T ˇ · D ≥ 0, ˙ − β1 D ·D (ρ2 ∂ρ ψ − p)tr D − ρ∂D ψˆ · D

520

8 Aging and Higher-Order Grade Materials 

where Tˇ = T + p1 − β1 D . We compute 











D = (D)˙− W D − D WT + ν(D D + D D)



˙ − WD − DWT + 2νDD) − (D ˙ − WD − DWT + 2νDD)WT = (D)˙− W(D ˙ − WD − DWT + 2νDD) + ν(D ˙ − WD − DWT + 2νDD)D. +νD(D Hence 



˙ − WD − DWT + 2νDD)] D · [ D −(D)˙] = −D · [W(D ˙ − WD − DWT + 2νDD)WT ] −D · [(D T ˙ − WD − DW + 2νDD) + (D ˙ − WD − DWT + 2νDD)D]. +νD · [D(D Some terms (D · WDD, D · DDWT , D · DWD, D · DDWT , D · WDD, D · DDWT ) are zero. It follows that 



˙ · W + 2(DD) · (WW) + 2(DW) · (WD) + 2ν(DD) · D ˙ + 4ν 2 D · (DDD). D · [ D −(D)˙] = −2(DD)

Likewise we find



˙ + 2ν(DD) · D, D ·D = D · D 

˙ =D ˙ ·D ˙ + 2(DD) ˙ · W + 2ν(DD) · D, ˙ D ·D

˙ + (2α1 ν + α2 )D · (DD) Tˇ · D = 2μD · D + λ(tr D)2 + α1 D · D ˙ + DD) ˙ +β2 D · (DD + β3 (D · D)2 . Upon substitution in the entropy inequality we find ˙ + 2μD · D + λ(tr D)2 + β3 (D · D)2 ˙ − 4β1 W · (DD) (ρ2 ∂ρ ψ − p)tr D − ρ∂D ψˆ · D ˙ · D] ˙ +β1 [2D · (WWD) + 2D · (WDWT ) + 4ν 2 D · (DDD) − D 2 ˙ · (DD) + 4νβ2 D · D2 ≥ 0 ˙ + (2α1 ν + α2 )D · (DD) + 2β2 D +α1 D · D ˙ and quadratic, in β1 [2D · (WWD) + The dependence on W is linear, in 4β1 W · (DD), 2D · (WDWT )]. The arbitrariness of W and the linear term imply that β1 = 0. The inequality reduces to

8.8 Interaction Effects via Materials of Higher Order

521

˙ + (2α1 ν + α2 )D · (DD) (ρ2 ∂ρ ψ − p)tr D + (−ρ∂D ψˆ + α1 D + 2β2 DD) · D +2μD · D + λ(tr D)2 + β3 (D · D)2 + 4νβ2 D2 · D2 ≥ 0. ˙ implies that The occurrence of D −ρ∂D ψˆ + α1 D + 2β2 DD = 0, whence, by integration, ˆ θ, D) = (ρ, θ) + ψ = ψ(ρ,

2β2 α1 D·D+ D · (DD). 2ρ 3ρ

(8.22)

The reduced inequality holds if and only if p = ρ2 ∂ρ ψ, μ ≥ 0,

2α1 ν + α2 = 0,

2μ + 3λ ≥ 0,

β3 ≥ 0.

The result (8.22) for the free energy may suggest further restrictions if we require that ψ has a minimum at D = 0. Since D · (DD) is not definite then this requirement implies that we let β2 = 0 while α1 ≥ 0. Again, the arbitrary value of ν allows α2 to be positive or negative. Hence the present model of compressible, third-grade fluid amounts to letting 

T = − p1 + 2μD + λ(tr D)1 + α1 D +α2 D2 + β3 (tr D2 )D subject to 2α1 ν + α2 = 0,

α1 ≥ 0,

μ ≥ 0,

2μ + 3λ ≥ 0,

β3 ≥ 0.

8.8 Interaction Effects via Materials of Higher Order Motivated by the purpose of describing spatial interaction effects of longer range, some constitutive models have been set up by means of various inequivalent approaches. A well-known model, due to Korteweg [258], describes a compressible fluid by letting the Cauchy stress T be given by T = (− p + αρ + β|∇ρ|2 )1 + δ∇ρ ⊗ ∇ρ + γ∇ 2 ρ, where ∇ 2 = ∇∇. The purpose of Korteweg’s model was to describe capillarity effects.

522

8 Aging and Higher-Order Grade Materials

In elastic materials a similar idea is applied by letting the stress depend not only on the strain (or the deformation gradient) but also on higher-order gradients of deformation. We then use the terminology of (elastic) material of grade n if the gradients of deformation are involved up to order n. It is a drawback of higher-grade models that they are in general incompatible with the standard theory of thermodynamics. To overcome the incompatibility various approaches have been developed. In [142] a rate of supply of mechanical energy u, per unit area, is assumed to take place. The supply u is named interstitial working . By an analogue of Cauchy’s theorem it follows that u depends linearly on the unit normal n via a vector field u, u = u · n, u being denoted as interstitial work flux, though conceptually we need to know that the heat supply is represented by the heat flux vector. Next, the balance of energy and the second law inequality are assumed in the form ρε˙ = T · D + ρr − ∇ · (q − u), ρη˙ ≥ −∇ ·

q ρr + . θ θ

Physically q is the heat flux while u is a mechanical work flux. Formally, though, we might view h = q − u as the surface energy flux. If q/θ is the entropy flux then q h u = + θ θ θ might suggest that we interpret u/θ as an extra-entropy flux, originated by longer range spatial interactions. Borrowing from works of Toupin [421, 422] on elastic materials, involving a virtual-work principle, an approach has been developed where the stress power is not merely T · D. Instead, the power comprises higher-order gradients of the velocity. For second-gradient materials the expression of the stress power is taken in the form T · L + T · ∇ 2 v, where T and T (which is often named hyperstress) are basically seen as elements of the dual space of the first and second velocity gradient. Quite naturally it is usually stated that T is symmetric with respect to the second and third index, the motivation being related to the occurrence of ∇ 2 . A skew-symmetric part would do no work for any motion of the body and then would appear to be useless. Instead, allowing for non-symmetric hyperstresses proves crucial to obtain thermodynamically consistent schemes [332]. As is shown in the next section a connection can be established between the modelling of longer range interactions via the interstitial working, as well as via the extra-entropy flux, and the expression of power via the hyperstress.

8.9 Modelling via the Interstitial Working

523

8.9 Modelling via the Interstitial Working In addition to the standard equation of balance of mass and the equation of motion, ρ˙ + ρ∇ · v = 0,

ρ˙v = ∇ · T + ρb,

the balance of energy is taken in the form ρε˙ = T · L − ∇ · q + ∇ · u + ρr where u is the assumed interstitial work flux and T is allowed to be non-symmetric. Upon the standard assumption about the entropy inequality, ρη˙ ≥ −∇ · it follows that

q ρr + , θ θ

˙ + T · L + ∇ · u − 1 q · ∇θ ≥ 0. − ρ(ψ˙ + η θ) θ

(8.23)

We let ψ, η, T, q, u be continuous functions of ˙  = (F, θ, ∇R F, ∇R2 F, ∇θ, F). Moreover we let ψ be differentiable. To derive the restrictions placed by (8.23) it seems convenient to use the component form. To save writing we let Fi K H = ∂ X K ∂ X H χi ,

Fi K H R = ∂ X K ∂ X H ∂ X R χi ,

Fi K H RS = ∂ X K ∂ X H ∂ X R ∂ X S χi

and the analogue for F˙i K H , F˙i K H R . Further let g = ∇θ, G = ∇R θ. Since ∇ = F−T ∇R we observe gi = FK−1 i GK,

∇ · u = FK−1 i ∂X K ui ,

q · ∇θ = FK−1 i qi G K

˙ we obtain and hence, from u = u(F, θ, ∇R F, ∇R2 F, ∇θ, F), ∂ X K u i = ∂ Fp Q u i F p Q K + ∂θ u i G K + ∂ Fp Q R u i F p Q R K + ∂ Fp Q R S u i F p Q RS K ˙ +∂g p u i ∂ X K (G H FH−1 p ) + ∂ F˙ u i F p Q K . pQ

Moreover we have −1 −1 −1 ∂ X K (G H FH−1 p ) = (∂ X K ∂ X H θ)FH p − G H FH k Fk J K FJ p .

Upon computation of ψ˙ and ∇ · u we can write inequality (8.23) in the form

524

8 Aging and Higher-Order Grade Materials

˙ ˙ (Ti j − ρ∂ Fi K ψ FKTj )L i j + (FK−1 i ∂ F˙ j H u i − ρ∂ F j H K ψ) F j H K − ρ∂ Fi J H K ψ Fi J H K 1 −1 −1 −1 −ρ∂gi ψ g˙ i − ρ∂ F˙i K ψ F¨i K + [FH−1 i (∂θ u i − qi ) − FK i FH k FJ p ∂g p u i Fk J K ]G H θ −1 −1 +FK−1 i ∂ F p Q u i F p Q K + FK i ∂ F p Q R u i F p Q R K + FK i ∂ F p Q R S u i F p Q RS K −ρ(∂θ ψ + η)θ˙ + F −1 F −1 ∂g p u i ∂ X K ∂ X H θ ≥ 0. Ki

Hp

˙ F, ˙ ∇ 2 F, ˙ ∇θ, ¨ ∇R3 F implies The arbitrariness of θ, R η = −∂θ ψ, ∂∇R2 F ψ = 0, ∂∇θ ψ = 0, ∂F˙ ψ = 0, ∂∇R2 F u = 0. Moreover, the occurrence of ∇R F˙ implies F−1 ∂F˙ u = ρ∂∇R F ψ.

(8.24)

Consequently, the free energy is required to be independent of ∇R2 F, ∇ θ, F˙ so that ψ = ψ(F, θ, ∇R F). Relative to thermoelasticity, spatial interaction is allowed via the (admissible) dependence on ∇R F. Meanwhile the entropy η is related to the free energy ψ as it happens in thermoelasticity. The arbitrariness of ∇R2 θ ∈ Sym implies that −1 FK−1 i FH p ∂g p u i ∈ Skw.

The skew symmetry relative to K , H −1 −1 −1 FK−1 i FH p ∂g p u i = −FH i FK p ∂g p u i

in turn implies that ∂g p u i = −∂gi u p , Thus, since

then

∂∇θ u ∈ Skw.

−1 −1 −1 FK−1 i FJ p Fk J K = FJ i FK p Fk J K

−1 FK−1 i FJ p ∂g p u i Fk J K = 0, ∀k = 1, 2, 3.

In view of (8.24), the vector function u satisfies ∂F˙ u = ρF∂∇R F ψ =: V

8.9 Modelling via the Interstitial Working

525

where V = V(F, θ, ∇R F) is a third-order tensor. The obvious integration leads to ˆ u = VF˙ + u(F, θ, ∇R F, ∇θ), ˙ i = Vi j H F˙ j H . The reduced inequality reads where (VF) (Ti j − ρ∂ Fi K ψ FKT j )L i j + FH−1 i (∂θ u i −

1 −1 qi )G H + FK−1 i ∂ F p Q u i F p Q K + FK i ∂ F p Q R u i F p Q R K ≥ 0. θ

Simple non-trivial models arise by considering the following sufficient conditions for the validity of the second law. We let uˆ = 0,

V = V(F, θ).

Hence the inequality simplifies to 1 ˙ (Ti j − ρ∂ Fi K ψ FKTj ) F˙i H FH−1j + (∂θ Vi j K F˙ j K − qi )gi + FK−1 i (∂ F p Q Vi j H )F p Q K F j H ≥ 0. θ The occurrence of F˙ j K gi allows for cross-coupling terms. Let Ai H := (Ti j − ρ∂ Fi K ψ FKTj )FH−1j + FK−1j (∂ Fp Q V ji H )F p Q K . The reduced inequality can be written in the form 1 Ai H F˙i H + ∂θ Vi j K F˙ j K gi − qi gi ≥ 0. θ

(8.25)

Inequality (8.25) is the thermodynamic restriction for the functions T, V, q. The stress T and the heat flux q are, so far, functions of the whole set of variables . For definiteness, we now establish two solutions of inequality (8.25). First let T be independent of g. Hence we may set g = 0 and then the left hand side of the inequality reduces to Ai H F˙i H ≥ 0. Since T, and hence A, may depend on F˙ then the inequality holds if Ai H = Bi H j K F˙ j K where B is a fourth-order positive semi-definite tensor. Thus we have ˙ Ti j = ρ∂ Fi K ψ FKTj − FK−1 h (∂ F p Q Vhi H )F p Q K F j H + F j H Bi H h K Fh K . The reduced inequality then involves the heat flux q,

(8.26)

526

8 Aging and Higher-Order Grade Materials

1 (∂θ Vi j K F˙ j K − qi )gi ≥ 0 θ whence

qi = θ∂θ Vi j K F˙ j K − κgi ,

κ being a non-negative function of . Otherwise, if T is allowed to depend (linearly) on g = ∇θ then a possible restriction is given by the pair of conditions (Ai H + ∂θ V ji H g j ) F˙i H ≥ 0,

qi gi ≤ 0.

The first condition results in ˙ Ti j = ρ∂ Fi K ψ FKTj − FK−1 i (∂ F p Q V ji H )F p Q K F j H − F j H ∂θ Vki H gk + F j H Bi H h K Fh K . (8.27) while the second one is the classical heat conduction inequality. It is of interest that the occurrence of a non-zero tensor V is necessary to have the stress dependent on the second deformation gradient ∇R F. If V = 0 then, by (8.24), ∂∇R F ψ = 0 and hence ψ depends only on F and θ. Accordingly, if V = 0 then both (8.26) and (8.27) reduce to the stress of thermoelasticity. The dependence of T and u on F˙ has to comply with objectivity. Consider u i = Vi j H F˙ j H and, more conveniently the referential analogue u R = J F−1 u,

(u R ) K = VK j H F˙ j H ,

VK j H = J FK−1 i VK j H .

Under a change of frame we have ˙ ph VK p H Fh H . (VK j H F˙ j H )∗ = Q j p VK p H Q j h˙Fh H = Q j p VK p H ( Q˙ j h Fh H + Q j h F˙h H ) = VK h H F˙h H + (Q T Q)

The skew property of Q T Q˙ implies that u R is invariant if and only if V satisfies VK p H Fh H = VK h H F p H .

8.9.1 Modelling via the Extra-Entropy Flux It might seem more natural to keep the balance of energy unchanged without any supply of energy via the interstitial working and to use the statement of the second law by letting a possible extra-entropy flux occur. An objection against the use of the interstitial working is that this term occurs in the balance of energy but is not considered in connection with the entropy flux. We now review the modelling of second gradient materials by merely allowing for the occurrence of an extra-entropy flux in the statement of the second law.

8.9 Modelling via the Interstitial Working

527

The second law inequality is considered in the general form ρη˙ + ∇ · (

ρr q + k) − ≥0 θ θ

while the balance of energy is given by ρε˙ = T · L − ∇ · q + ρr. Hence we obtain ˙ + T · L + θ∇ · k − 1 q · ∇θ ≥ 0. −ρ(ψ˙ + η θ) θ Relative to (8.23) the formal difference is the occurrence of θ∇ · k instead of ∇ · u in the Clausius-Duhem inequality. Hence the two schemes are not strictly equivalent unless θ is constant. ˙ be the set of independent variables. As before we let  = (F, θ, ∇R F, ∇R2 F, ∇θ, F) ˙ Upon evaluation of ψ and ∇ · k we can write the inequality in the form ˙ −ρ(∂θ ψ + η)θ˙ + (Ti j − ρ∂ Fi K ψ FKTj )L i j + (θFK−1 i ∂ F˙ j H ki − ρ∂ F j H K ψ) F j H K −ρ∂ Fi J H K ψ F˙i J H K − ρ∂gi ψ g˙ i − ρ∂ F˙i K ψ F¨i K + θFK−1 i ∂ F p Q ki F p Q K −1 −1 −1 +θFK−1 i ∂ F p Q R ki F p Q R K + θFK i ∂ F p Q R S ki F p Q RS K + θFK i FH p ∂g p ki ∂ X K ∂ X H θ 1 −1 −1 −1 +[FH−1 i (θ∂θ ki − qi ) − θFK i FH h FJ p ∂g p ki Fh J K ]∂ X H θ ≥ 0 θ

By paralleling the previous procedure we obtain the following analogous results. By ˙ F, ˙ ∇ 2 F, ˙ ∇θ, ¨ ∇R3 F it follows the linearity of θ, R η = −∂θ ψ, ∂∇R2 F ψ = 0, ∂∇ θ ψ = 0, ∂F˙ ψ = 0, ∂∇R2 F k = 0. Now, the linearity of ∇R F˙ implies ∂ F˙ j H ki =

ρ ∂ F ψ Fi K . θ jHK

(8.28)

Integration of (8.28) yields ˆ k = K(F, θ, ∇R F)F˙ + k(F, θ, ∇R F, ∇θ), By the linearity of ∂ X K ∂ X H θ ∈ Sym we have

Ki j H =

ρ Fi K ∂ F j H K ψ. θ

528

8 Aging and Higher-Order Grade Materials −1 FK−1 i FH p ∂g p ki ∈ Skw,

whence

−1 FK−1 i FH p ∂g p ki Fh H K = 0.

∂∇θ k ∈ Skw, The reduced inequality takes the form (Ti j − ρ∂ Fi K ψ FKT j )L i j + FH−1 i (θ∂θ ki −

1 −1 qi )G H + θFK−1 i ∂ F p Q ki F p Q K + θFK i ∂ F p Q R ki F p Q R K ≥ 0. θ

We now examine the consequences of the assumption kˆ = 0,

K = K(F, θ).

The inequality takes the form 1 Ai H F˙i H + θ ∂θ K ji H F˙i H g j − qi gi ≥ 0, θ where Ai H = (Ti j − ρ∂ Fi K ψ FKTj )FH−1j + θFK−1 h ∂ F p Q K hi H F p Q K . The functions obtained within the scheme of the interstitial working occur here too with the formal replacement of ∂θ V with θ∂θ K. The inequality holds if Ai H = Bi H j Q F˙ j Q ,

θ2 ∂θ K ji H F˙i H − q j = κg j

where B is a fourth-order non-negative tensor and κ is a non-negative scalar (or second-order tensor). Hence we have ˙ Ti j = ρ∂ Fi Q ψ FQTj − θ FR−1 h (∂ F p Q K hi H )F p Q R F j H + F j H Bi H p Q F p Q and

qi = θ2 ∂θ K i j R F˙ j R − κgi .

Otherwise we can obtain ˙ Ti j = ρ∂ Fi Q ψ FQTj − θ FR−1 h (∂ F p Q K hi H )F p Q R F j H − θF j H ∂θ K ki H gk + F j H Bi H p Q F p Q and Fourier’s law for q.

8.10 Materials of Korteweg Type Paralleling the analysis of the previous section, we now consider (fluid) materials of Korteweg type. Hence our purpose is to see whether and how a stress function of the form

8.10 Materials of Korteweg Type

529

T = T(ρ, θ, ∇ρ, ∇ 2 ρ, ∇θ, D) is consistent with the principles of continuum thermodynamics; ∇ 2 stands for ∇∇. The dependence on ∇θ and D means that heat conduction and viscosity are considered. In view of the continuity equation ρ˙ + ρ∇ · v = 0, the scalars ρ, ˙ ρ, ∇ · v are not linearly independent. For technical convenience we choose ρ and ρ, ˙ but not ∇ · v, as independent quantities. Hence since D = D0 + 13 (∇ · v)1 a possible dependence on D is restricted to D0 . We then let ˙ D0 )  = (ρ, θ, ∇ρ, ∇ 2 ρ, ∇θ, ρ, be the set of independent variables and assume ψ, η, T, q, k are functions of  while ψ and k are differentiable. We let T be in the form T = − p(ρ, θ)1 + T . Upon computation of ψ˙ and ∇ · k and substitution in the Clausius-Duhem inequality ˙ + T · D + θ∇ · k − 1 q · ∇θ ≥ 0 −ρ(ψ˙ + η θ) θ we obtain ˙ − ρ∂ 2 ψ · ∇˙2 ρ −ρ(∂θ ψ + η)θ˙ + (−ρ∂ρ ψ + p/ρ − tr T /3ρ)˙ρ + T · L0 − ρ∂∇ρ ψ · ∇ρ ∇ ρ ˙ − ρ∂ ψ ρ¨ − ρ∂ ψ · D ˙ 0 + θ∂ρ k · ∇ρ + θ∂∇ρ k · ∇ 2 ρ −ρ∂∇θ ψ · ∇θ ρ˙ D0 1 +θ∂∇ 2 ρ k · ∇ 3 ρ + θ∂∇θ k · ∇ 2 θ + θ∂ρ˙ k · ∇ ρ˙ + θ∂D0 k · ∇D0 + (θ∂θ k − q) · ∇θ ≥ 0. θ

We first observe that ρ, ¨ ∇ 2 θ, ∇ 3 ρ occur linearly. Their arbitrariness then implies that ∂ρ˙ ψ = 0, ∂∇θ k ∈ Skw, ∂∇ 2 ρ k ∈ Skw; For simplicity we assume ∂∇θ k = 0, ∂∇ 2 ρ k = 0. The inequality then reduces to ˙ −ρ(∂θ ψ + η)θ˙ + (−ρ∂ρ ψ + p/ρ − tr T /3ρ)ρ˙ + T · L0 − ρ∂∇ρ ψ · ∇ρ ˙ ˙ − ρ∂ ψ · D ˙ 0 + θ∂ρ k · ∇ρ −ρ∂∇ 2 ρ ψ · ∇ 2 ρ − ρ∂∇θ ψ · ∇θ D0 1 +θ∂∇ρ k · ∇ 2 ρ + θ∂ρ˙ k · ∇ ρ˙ + θ∂D0 k · ∇D0 + (θ∂θ k − q) · ∇θ ≥ 0. θ To determine the extent of arbitrary quantities in the reduced inequality we need to exploit some identities. Since ˙ = ∇ θ˙ − LT ∇θ ∇θ

530

8 Aging and Higher-Order Grade Materials

then the arbitrariness of ∇ θ˙ implies ∂∇θ ψ = 0. Moreover the arbitrariness of θ˙ implies the standard relation η = −∂θ ψ. ˙ = ∇ ρ˙ − LT ∇ρ allows us to write The identity ∇ρ ˙ + θ∂ k · ∇ ρ˙ = (θ∂ k − ρ∂ ψ) · ∇ ρ˙ + ρ∇ρ ⊗ ∂ ψ · L. −ρ∂∇ρ ψ · ∇ρ ρ˙ ρ˙ ∇ρ ∇ρ The arbitrariness of ∇ ρ˙ implies θ∂ρ˙ k − ρ∂∇ρ ψ = 0

(8.29)

Observe that ρ∇ρ ⊗ ∂∇ρ ψ · L = ρ∇ρ ⊗ ∂∇ρ ψ · L0 + 13 ρ∇ρ · ∂∇ρ ψ∇ · v ˙ = ρ∇ρ ⊗ ∂∇ρ ψ · L0 − 13 ∇ρ · ∂∇ρ ψ ρ. Further we apply the identity6 ˙ ∇ 2 ρ = ∇ 2 ρ˙ − ∇ρ∇ 2 v − 2 sym ∇ 2 ρ L. ˙ 0 implies The arbitrariness of ∇ 2 ρ˙ and D ∂∇ 2 ρ ψ = 0,

∂∇ 2 v ψ = 0,

∂L0 ψ = 0.

Consequently we obtain the reduced inequality (−ρ∂ρ ψ + p/ρ − tr T /3ρ + 13 ∇ρ · ∂∇ρ ψ])ρ˙ + T · L0 − ρ∇ρ ⊗ ∂∇ρ ψ · L0 1 +θ∂ρ k · ∇ρ + θ∂∇ρ k · ∇ 2 ρ + θ∂L0 k · ∇L0 + (θ∂θ k − q) · ∇θ ≥ 0. θ (8.30) As for ρ∇ρ ⊗ ∂∇ρ ψ · L0 in (8.30) observe that ρ∇ρ ⊗ ∂∇ρ ψ · L0 = ρ∇ρ ⊗ ∂∇ρ ψ · D0 + ρ∇ρ ⊗ ∂∇ρ ψ · W. The arbitrariness of W implies 6

In suffix notation ∂xi ∂˙ x j ρ = ∂xi ∂x j ρ˙ − (∂xk ρ)∂xi ∂x j vk − (∂xi ∂xk ρ)∂x j vk − (∂x j ∂xk ρ)∂xi vk .

8.10 Materials of Korteweg Type

531

∇ρ ⊗ ∂∇ρ ψ ∈ Sym. Hence it follows ∂∇ρ ψ = α∇ρ. Since ψ is a function of ρ, θ, ∇ρ it follows that the dependence of ψ and α on ∇ρ is through (∇ρ)2 . As a further consequence, integration of (8.29) yields k=

ρ ˜ α(ρ, θ, (∇ρ)2 )∇ρ ρ˙ + k(ρ, θ, D0 ). θ

(8.31)

Incidentally, the absence of W in the entropy inequality is consistent with objectivity. In summary, compatibility with thermodynamics requires that ψ = ψ(ρ, θ, (∇ρ)2 ), k is given by (8.31) and T, q, k, ψ are subject to (−ρ∂ρ ψ + p/ρ − tr T /3ρ + 13 ∇ρ · ∂∇ρ ψ)ρ˙ + T − ρ∇ρ ⊗ ∂∇ρ ψ) · D0 1 +θ∂ρ k · ∇ρ + θ∂∇ρ k · ∇ 2 ρ + θ∂D0 k · ∇D0 + (θ∂θ k − q) · ∇θ ≥ 0.(8.32) θ For definiteness, we now establish a simple model based on the assumptions k˜ = 0,

∂∇θT = 0,

∂D0 tr T = 0,

and tr T → 0 as ∇ · v → 0. Since T and k (and ψ) are independent of ∇θ, then without any restriction on T and k we can examine (8.32) as ∇θ = 0 so that (−ρ∂ρ ψ + p/ρ − tr T /3ρ + 13 ∇ρ · ∂∇ρ ψ)˙ρ + (T − ρ∇ρ ⊗ ∂∇ρ ψ) · D0 + θ∂ρ k · ∇ρ + θ∂∇ρ k · ∇ 2 ρ ≥ 0.

Now observe ∂ρ k · ∇ρ = ∂∇ρ k · ∇ 2 ρ =

1 β(∇ρ)2 ρ, ˙ θ

β := α + ρ∂ρ α,

ρρ˙ [αρ + γ∇ρ ⊗ ∇ρ · ∇ 2 ρ], θ

γ := ∂(∇ρ)2 α.

The inequality then reads [−ρ∂ρ ψ +

1 p − tr Tˇ + (β + 13 α)(∇ρ)2 + ραρ + ργ∇ρ ⊗ ∇ρ · ∇ 2 ρ]ρ˙ ρ 3ρ +(Tˇ − αρ∇ρ ⊗ ∇ρ) · D0 ≥ 0

The mutual independence of ρ˙ and D0 implies that the two lines of the inequality are non-negative. Now, the assumption that tr T → 0 as ∇ · v → 0 (or ρ˙ → 0) implies

532

8 Aging and Higher-Order Grade Materials

that the factor of ρ, ˙ except tr T , has to vanish. Hence we obtain the pressure p in the form p = p¯ − 13 α ρ(∇ρ)2 ,

p¯ := ρ2 ∂ρ ψ − β ρ(∇ρ)2 − ρ2 αρ − ρ2 γ(∇ρ ⊗ ∇ρ) · ∇ 2 ρ.

(8.33)

The inequalities tr T ρ˙ ≤ 0,

T − αρ∇ρ ⊗ ∇ρ) · D0 ≥ 0 (T

imply 1 T 3

= ξρρ˙ = ξ∇ · v,

ξ ≥ 0,

T − αρ∇ρ ⊗ ∇ρ)0 = 2μD0 , (T

μ ≥ 0,

where ( )0 denotes the deviator. Hence we have T = − p1 + αρ∇ρ ⊗ ∇ρ + ξ tr D 1 + 2μ D0 .

(8.34)

The inequality involving the heat flux q, 1 (θ∂θ k − q) · ∇θ ≥ 0 θ implies q = θ2 ∂θ k − κ∇θ,

κ ≥ 0,

κ being a function of . Since ∂θ k =

ρ2 ρ (α − θ∂θ α)∇ · v ∇ρ = − 2 (α − θ∂θ α)ρ˙ ∇ρ θ2 θ

it follows q = (α − θ∂θ α)ρ2 ∇ · v ∇ρ − κ∇θ;

(8.35)

the scalar κ might be replaced with a non-negative second-order tensor. These results deserve some comments. As for the pressure p, given by (8.33), the term ρ2 ∂ρ ψ is the classical (equilibrium) pressure of simple fluids while the other terms show the effects of the non-simple model described by gradients of the mass density. In addition, αρ∇ρ ⊗ ∇ρ shows the anisotropic stress due to ∇ρ. Of course ξ tr D 1 + 2μ D0 is the standard viscous Cauchy stress. Likewise, the heat flux is the sum of the classical Fourier term −κ∇θ and the term (α − θ∂θ α)ρ2 ∇ · v ∇ρ again induced by the gradient of the mass density. It is of interest that a dual model, consistent with (8.32), holds with a contribution to T induced by ∇θ while q is merely parallel to −∇θ. To find the detailed expressions assume

8.10 Materials of Korteweg Type

k˜ = 0,

533

∂ρ˙ q = 0,

∂D0 q = 0,

∂D0 tr T = 0,

and, again, let tr T → 0 as ∇ · v = 0. Hence inequality (8.32) splits into [−ρ∂ρ ψ +

1 p − tr T + (β + 13 α)(∇ρ)2 + ραρ + ργ∇ρ ⊗ ∇ρ · ∇ 2 ρ ρ 3ρ ρ T − αρ∇ρ ⊗ ∇ρ) · D0 ≥ 0 − (α − θ∂θ α)∇θ · ∇ρ]ρ˙ + (T θ q · ∇θ ≤ 0.

Since ρ˙ → 0 implies tr T → 0 it follows that p = p˜ − 13 α ρ(∇ρ)2 ,

p˜ := ρ2 ∂ρ ψ − β ρ(∇ρ)2 − ρ2 αρ − ρ2 γ(∇ρ ⊗ ∇ρ) · ∇ 2 ρ +

ρ2 (α − θ∂θ α)∇θ · ∇ρ. θ

Now since tr T is independent of D0 we let D0 = 0 and the inequality implies 1 tr T 3

= ξ∇ · v,

ξ ≥ 0,

while T − αρ∇ρ ⊗ ∇ρ)0 · D0 T − αρ∇ρ ⊗ ∇ρ) · D0 = (T 0 ≤ (T results in T − αρ∇ρ ⊗ ∇ρ)0 = 2μD0 , (T

μ ≥ 0.

Hence T = − p˜ 1 + αρ∇ρ ⊗ ∇ρ + ξ∇ · v 1 + 2μD0 . Meanwhile q merely satisfies q = −κ∇θ,

κ = κ(ρ, θ, ∇ρ, ∇ 2 ρ, ∇θ) ≥ 0.

Hence, by means of the same term θ∂θ k · ∇θ, two inequivalent models are obtained. In the first case it results in an energy flux (α − θ∂θ α)ρ2 ∇ · v ∇ρ, in the second case it gives rise to an additional pressure (ρ2 /θ)(α − θ∂θ α)∇θ · ∇ρ while q = −κ∇θ.

534

8 Aging and Higher-Order Grade Materials

8.11 Hyperstress and Materials of Higher-Order Grade Equations of equilibrium and boundary conditions for materials of grade 2 are given by Toupin [421, 422] through a principle of virtual work. Borrowing from the work of E. and F. Cosserat [107] on material with couple-stresses, Toupin allows for hypertractions on the boundary of the pertinent region. The work, per unit area, of the hypertraction H is taken to be Hi δ Fi K N K , N being the unit normal. Hence it follows that the stress tensor can depend on ∇R F. Hence the character of material with a higher-order grade is associated with the existence of an hypertraction. Since higher-order grade materials are incompatible with the standard theory of thermodynamics, much research in the literature has been developed by modifying the expression of the power expended on the pertinent region Pt . The mechanical power, per unit volume, is T · L. This is so because the power of the traction, per unit area, is v · Tn; the divergence theorem implies ∫ v · Tnda = ∫ [v · (∇ · T) + T · L]dv. Pt

∂Pt

The power v · (∇ · T) cancels by means of the equation of motion while T · L is the power affecting the evolution of the internal energy. Also by analogy with [422], the mechanical power is generalized [180] by including the quantity G · ∇ 2 v so that the power is7 T · L + G · ∇ 2 v. The third-order tensor G is named hyperstress. Obvious consistency requirements indicate that the introduction of the additive power G · ∇ 2 v is allowed only if the other balance equations are revised properly. While T · L, or T · D, is well recognized to be the stress power, the theory of thermodynamics is modified (or generalized) by letting the power include an analogous term G · ∇ 2 v, linear in the second gradient ∇ 2 v, again with G a third-order hyperstress. Further generalizations occur with higher-order hyperstresses or with the temperature so that a power a θ˙ may occur.8 The power T · L + G · ∇ 2 v might be framed within a scheme of balance equations as follows [180]. For any part Pt we can apply the divergence theorem to obtain ∫ T · Ldv = − ∫ (∇ · T) · v dv + ∫ v · Tn dv.

Pt

Pt

∂Pt

Since G i jk ∂x j ∂xk vi = ∂xk (G i jk ∂x j vi ) − ∂x j (∂xk G i jk vi ) + (∂x j ∂xk G i jk )vi by applying the divergence theorem it follows 7 8

In components, G · ∇ 2 v = G i jk vi, jk . See, e.g., [83].

8.11 Hyperstress and Materials of Higher-Order Grade

535

∫ T · L + G · ∇ 2 v dv = ∫ [∇ · (∇G − T)] · v dv + ∫ Gn · L + [Tn − (∇ · G)n] · v da.

Pt

Pt

∂ Pt

Denote by P =1−n⊗n the projection onto the plane with normal n and ∂n = n · ∇,

∇ S = P∇ = ∇ − n(n · ∇)

the normal derivative and the surface gradient (relative to the surface S). Hence ∇v = ∇ S v + ∂n v ⊗ n,

L i j = ∂x j vi = n j ∂n vi + (∇ S v)i j .

Consequently Gn · L = Gn · ∇ S v + ∂n v · [(Gn)n] and hence ∫ T · L + G · ∇ 2 v dv = ∫ [∇ · (∇G − T)] · v dv + ∫ {Gn · ∇ S v + ∂n v · [(Gn)n] + [Tn − (∇ · G)n] · v}da.

Pt

Pt

∂Pt

This splitting of the power justifies the balance of linear momentum, ρ˙v = ∇(T − ∇ · G) + ρb, and the boundary conditions, t S = Tn + Gn · ∇ S v − (∇ · G)n,

m S = (Gn)n,

t S and m S being the traction and the hypertraction. However the thermodynamic restrictions are derived from the free-energy imbalance ρψ˙ − T · D − G · ∇ 2 v ≤ 0. An example of constitutive equations is given for incompressible fluids in the form Ti j = 2μDi j − pδi j ,

G i jk = η1 ∂x j ∂xk vi + η2 (∂xi ∂x j vk + ∂xi ∂xk v j − ∂xr ∂xr vi δ jk ) − πk δi j ,

where p is a pressure while π is a hyperpressure vector.

8.11.1 Solids of Higher-Order Grade Here we show that non-local models, in space and time, are possible by merely allowing for a non-zero extra-entropy production. Hence models of higher-order

536

8 Aging and Higher-Order Grade Materials

grade are allowed without the introduction of ad hoc terms like hyperstress or extraenergy supplies. For notational simplicity the scheme is restricted to first grade but models of higher-order grades can be established by the same approach. We generalize the model of Sect. 8.9.1 by letting ˙ ˙ θ) ˙ ∇R F,  = (F, θ, ∇R F, ∇R2 F, ∇R θ, F, be the set of independent variables. Having chosen the material gradients, ∇ R , it is convenient to follow systematically the referential description. The entropy inequality takes the form ˙ + T R · F˙ + θ∇R · k R − 1 q R · ∇R θ ≥ 0, −ρ R (ψ˙ + η θ) θ where T R is the first Piola stress and k R = J kF−T , q R = J qF−T . Upon computation of ψ˙ and ∇R · k R we can write the inequality in the form −ρ R (∂θ ψ + η)θ˙ + (Ti RK − ρ R ∂ Fi K ψ) F˙i K − ρ R ∂ Fi H K ψ F˙i H K − ρ R ∂ Fi H K J ψ F˙i H K J −ρ R ∂∂ X K ψ∂ X K θ˙ − ρ R ∂ F˙i K ψ F¨i K − ρ R ∂ F˙i H K ψ F¨i H K − ρ R ∂θ˙ ψ θ¨ +θ∂ Fi K k HR Fi K H + θ∂θ k HR ∂ X K θ + θ∂ Fi K J k HR Fi K J H + θ∂ Fi K J I k HR Fi K J I H 1 R ˙ Fi K H + θ∂ F˙i K J k HR F˙i K J H + θ∂θ˙ k HR ∂ X H θ˙ − q KR ∂ X K θ ≥ 0 +θ∂∂ X K θ k HR ∂ X H ∂ X K θ + θ∂ F˙i K k H θ

¨ F, ¨ ∇R3 F it follows that ¨ ∇R F, By the linearity (and arbitrariness) of θ, ∂θ˙ ψ = 0,

∂F˙ ψ = 0,

∂∇R F˙ ψ = 0,

∂∇R2 F k R = 0.

Hence ψ = ψ(F, θ, ∇R F, ∇R2 F, ∇R θ),

˙ ˙ θ) ˙ ∇R F, k R = k R (F, θ, ∇R F, ∇R θ, F,

and −ρ R (∂θ ψ + η)θ˙ + (Ti RK − ρ R ∂ Fi K ψ) F˙i K − ρ R ∂ Fi H K ψ F˙i H K − ρ R ∂ Fi H K J ψ F˙i H K J −ρ R ∂∂ X K θ ψ∂ X K θ + θ∂ Fi K k HR Fi K H + θ∂θ k HR ∂ X H θ + θ∂ Fi K J k HR Fi K J H 1 +θ∂∂ X K θ k HR ∂ X H ∂ X K θ + θ∂ F˙i K k HR F˙i K H + θ∂ F˙i K J k HR F˙i K J H + θ∂θ˙ k HR ∂ X H θ˙ − q KR ∂ K θ ≥ 0 θ The linearity of ∇R θ˙ and ∇ R2 F˙ implies − ρ R ∂ Fi H K J ψ + θ∂ F˙i K J k HR = 0, A direct integration of (8.36) gives

−ρ R ∂∂ X H θ ψ + θ∂θ˙ k HR = 0.

(8.36)

8.11 Hyperstress and Materials of Higher-Order Grade

k HR = Ai H K J F˙i K J + c H θ˙ + kˆ H ,

Ai H K J :=

537

ρR ρR ∂ F ψ, c H := ∂∂ X H θ ψ, θ iHK J θ

where A and c depend on (F, θ, ∇R F, ∇R2 F, ∇R θ) while kˆ is a function of (F, θ, ∇R F, ˙ Moreover the linearity, (arbitrariness) and symmetry of ∂ X H ∂ X K θ imply that ∇R θ, F). ∂∇R θ k R ∈ Skw. It then follows the reduced inequality −ρ R (∂θ ψ + η)θ˙ + (Ti RK − ρ R ∂ Fi K ψ) F˙i K − ρ R ∂ Fi H K ψ F˙i H K + θ∂ Fi K k HR Fi K H 1 +θ∂θ k HR ∂ X H θ + θ∂ Fi K J k HR Fi K J H + θ∂ F˙i K k HR F˙i K H − q KR ∂ X K θ ≥ 0 θ Physically admissible properties are allowed provided this set of restrictions hold. Yet this scheme seems to be quite involved and we then look for particular constitutive functions allowing for the non-simple character provided by ∇ R F, ∇ R2 F, ∇ R θ. The first requirement in (8.36) implies that ψ depends at most linearly on ∇ 2 F. Consistent with the skew-symmetry of ∂∇R θ k R we let Skw   = ∂∇R θ k R . Hence we simplify the constitutive properties by letting A, c, and  be functions of ˙ by letting ˙ θ, F, θ, by assuming that T R and q R be independent of ∇R F, ˙ T R = T R (F, θ, ∇R F, ∇R θ, F),

˙ q R = q R (F, θ, ∇R F, ∇R θ, F),

and by taking the extra-entropy vector k R in the special form ˙ X K θ. k HR = A p H K J (F, θ) F˙ p K J + c H (F, θ)θ˙ +  H K (F, θ, F)∂ The reduced inequality then reads [−ρ R (∂θ ψ + η) + θFi M H ∂ Fi M c H + θ∂θ c H ∂ X H θ]θ˙ + (TiRK − ρ R ∂ Fi K ψ) F˙i K 1 +(−ρ R ∂ F p H K ψ + θFi M H ∂ Fi M A p H K J + θ∂θ A p H K J ) F˙ p H K + (θFi M H ∂ Fi M  H K − q KR )∂ X K θ ≥ 0. θ

The linearity of θ˙ and ∇R F˙ implies9 η = −∂θ ψ +

1 (θFi M H ∂ Fi M c H + θ∂θ c H ∂ X H θ), ρR

− ρ R ∂ Fp H K ψ + θFi M H ∂ Fi M A p H K J + θ∂θ A p H K J = 0, 9

(8.37) (8.38)

Equation (8.37) is an uncommon case where entropy is not merely a derivative of a potential function.

538

8 Aging and Higher-Order Grade Materials

and hence 1 (Ti RK − ρ R ∂ Fi K ψ) F˙i K + (θFi M H ∂ Fi M  H K − q KR )∂ X K θ ≥ 0. θ ˙ Hence the reduced inequality allows us to We neglect cross-coupled terms ∇R θ ⊗ F. represent T R and q R in the form Ti RK = ρ R ∂ Fi K ψ + T˜i RK ,

q PR = θ Fi M H ∂ Fi M  H P − K P Q ∂ X Q θ,

(8.39)

where K is a non-negative tensor while T˜ R is an additive Piola stress accounting for dissipation (viscosity). Indeed, T˜ R · F˙ ≥ 0. For definiteness we let

Hence

T˜ R = [αD + β(tr D)1]F−T .

T˜ R · F˙ = [αD + β(tr D)1]F−T F˙ = α|D|2 + β(tr D)2 .

The inequality T˜ R · F˙ ≥ 0 holds provided only that α ≥ 0, 2α + 3β ≥ 0. As to the effects of the deformation gradient, since ψ is a function of F, ∇R F, ∇R2 F then by (8.39) it follows that T R is a function of F, ∇R F, ∇R2 F and is allowed to depend linearly on F˙ via a non-negative tensor M. Meanwhile Eq. (8.38) may be viewed as a constraint on the function ψ. Equation (8.37) implies that the entropy is allowed to depend on F, θ, ∇R F, ∇R2 F, ∇R θ. If, instead, the occurrence of an hyperstress G is allowed so that the mechanical power is T R · F˙ + G · ∇R F˙ then the previous procedure can be repeated step by step to arrive at the reduced inequality −ρ R (∂θ ψ + η)θ˙ + (Ti RK − ρ R ∂ Fi K ψ) F˙i K + G i H K F˙i H K − ρ R ∂ Fi H K ψ F˙i H K + θ∂ Fi K k HR Fi K H 1 +θ∂θ k HR ∂ X H θ + θ∂ Fi K J k HR Fi K J H + θ∂ F˙i K k HR F˙i K H − q KR ∂ X K θ ≥ 0. θ

Then we find that (8.38) is replaced by G p H K = ρ R ∂ Fp H K ψ − θFi M H ∂ Fi M A p H K J − θ∂θ A p H K J .

8.12 A New Scheme Associated with the Hyperstress We start with the power density w := T R · F˙ + G · ∇ R F˙

(8.40)

8.12 A New Scheme Associated with the Hyperstress

539

and assume w is objective. First we show that w can be represented by appropriate surface and volume terms. Since ThRK F˙h K + G h RS F˙h RS = ∂ X K (ThRK vh ) − ∂ X K (ThRK )vh + ∂ X S (G h RS F˙h R ) − (∂ X S G h RS ) F˙h R and

˙ ∂ X K (ThRK vh ) + ∂ X S (G h RS F˙h R ) = ∇R · (vT R + FG)

then, for any region P ⊂ R, the power W = ∫P w dv R can be written W = ∫ [ThRK m K vh ) + G h K Q m Q F˙h K ]da R − ∫[(∂ X K ThRK )vh + (∂ X Q G h K Q ) F˙h K ]dv R , ∂P

P

where m = n R is the unit outward normal in ∂P, P ⊂ R. Now, (∂ X Q G h K Q ) F˙h K = ∂ X K ((∂ X Q G h K Q )vh ) − (∂ X K ∂ X Q G h K Q )vh . Hence W = ∫ [(ThRK − ∂ X Q G h K Q )m K vh ) + G h K Q m Q F˙h K ]da R − ∫[∂ X K (ThRK − ∂ X Q G h K Q )vh ]dv R . ∂P

P

Let

f Q = G h K Q F˙h K .

Sh K = ThRK − ∂ X Q G h K Q , The power W then can be represented by

W = ∫ [Sh K m K vh + f Q m Q ]da R − ∫ vh ∂ X K Sh K dv R , ∂P

P

or, in absolute notation, W = ∫ [v · Sm + f · m]da R − ∫ v · (∇ R · S)dv R . ∂P

The identity

P

∇ R · (vS) = S · F˙ + v · (∇ R · S)

and use of the divergence theorem allow W to be written W = ∫ f · m da R + ∫ S · F˙ dv R . ∂P

P

According to this representation of W, it is natural to identify S with the effective Piola stress and to think of f as an energy flux vector which reminds the interstitial energy flux [142]. Associated with the effective Piola stress S is the Cauchy stress

540

8 Aging and Higher-Order Grade Materials

1 1 1 Tˆ = SFT = (T R − ∇ R · G)FT = T − (∇ R · G)FT . J J J ˆ are the effective stress tensors, we now review the Based on the view that S and T balance equations. First we assume the balance of linear momentum in the form ρ R v˙ = ∇ R · S + ρ R b. The balance of angular momentum is satisfied if Tˆ = Tˆ T ,

SFT = FST .

(8.41)

To see whether the condition (8.41) is plausible, we start from the required objectivity of the power w. Under the Euclidean transformation x∗ = c + Qx, the transforms F∗ and ∇R F∗ take the forms F∗ = QF, and hence

˙ + QF, ˙ F˙∗ = QF

∇R F∗ = Q∇R F ˙ ˙ R F + Q∇R F. ∇R ˙F∗ = Q∇

Consequently the invariance w∗ = w results in ˙ ˙ + G∗ · Q∇ ˙ R F + T∗R · QF˙ + G∗ · Q∇ R F˙ = T R · F˙ + G · ∇ R F. T∗R · QF The arbitrariness of F˙ and ∇ R F˙ implies T R = QT T∗R ,

G = QT G∗ .

Substitution of T∗ = QT R and G∗ = QG in the reduced equation ˙ + G∗ · Q∇ ˙ RF = 0 T∗R · QF leads to l j (Tl RK F j K + G l K P F j K P ) = 0 ˙ ∈ Skw. Hence it follows the symmetry condition where  = QT Q Tl RK F j K + G l K P F j K P = T jRK Fl K + G j K P Fl K P .

(8.42)

We now go back to (8.41) or, in suffix notation, ThRK F j K − ∂ X P (G h K P )F j K = T jRK Fh K − ∂ X P (G j K P )Fh K .

(8.43)

8.12 A New Scheme Associated with the Hyperstress

541

Proposition 8.1 The objectivity of the stress power implies the symmetry of the ˆ formally, the symmetry condition (8.42) implies (8.41). effective Cauchy stress T; We follow the proof given in [332]. First we write the assumption (8.43) in the form ThRK F j K − ∂ X P (G h K P F j K ) + G h K P F j K P = T jRK Fh K − ∂ X P (G j K P Fh K ) + G j K P Fh K P . In view of (8.42), the symmetry condition (8.43) holds if and only if G h K P F j K = G j K P Fh K . Denote by M and N the symmetric and skew-symmetric parts of G relative to the capital indices, G h K P = Mh K P + N h K P ,

Mh K P = 21 (G h K P + G h P K ),

Nh K P = 21 (G h K P − G h P K ).

Hence we have Mh K P F j K + Nh K P F j K = M j K P Fh K + N j K P Fh K whence Mh K P F j K Fi P + Nh K P F j K Fi P = M j K P Fh K Fi P + N j K P Fh K Fi P .

(8.44)

Let Dh ji := Mh K P F j K Fi P ,

Hh ji := Nh K P F j K Fi P .

It follows that Dh ji = Dhi j ,

Hh ji = −Hhi j

while, by (8.44), Dh ji + Hh ji = D j hi + H j hi or, equivalently, Dh ji − D j hi = H j hi − Hh ji . By interchanging j and i, and next h and j, we have Dhi j − Di h j = Hi h j − Hhi j , D ji h − Di j h = Hi j h − H ji h . By summing these three relations, in view of the symmetry of D and the skewsymmetry of H it follows

542

8 Aging and Higher-Order Grade Materials

2Dhi j − 2Di h j = 2H j hi whence Mh K P Fi K F j P − Mi K P Fh K F j P = N j K P Fh K Fi P . −1 Upon multiplication by FQ−1 h FL i and summation of h and i yields −1 N j Q L = Mh L P F j P FQ−1 h − Mi Q P F j P FL i .

(8.45)

Hence, provided the skew part N of G is related to the symmetric part M by (8.45), ˆ = SFT /J is symmetric.  the Cauchy stress T To sum up, the balance equations can be written in a consistent way in terms of the effective Piola stress S in the form ρ R v˙ = ∇ R · S + ρ R b,

SFT = FST ,

ρ R ε˙ = S · F˙ + ∇ R · f − ∇R · q R + ρ R r, where f K = G h J K F˙h J . Conceptually it emerges that the occurrence of a hyperstress ˙ can be embodied in an effective Piola stress S provided G, with stress power G · ∇ R F, an appropriate energy flux f is allowed to occur. We now investigate whether and how this scheme is compatible with thermodynamics.

Thermodynamic Restrictions We follow the standard assumption on the second law of thermodynamics, expressed by the inequality qR ρR r − ∇R · kR + . ρ R η˙ ≥ −∇R · θ θ Substitution of ρ R r − ∇R · q R from the balance of energy results in the entropy inequality ˙ + S · F˙ + ∇ R · w + θ∇ R · k R − 1 q R · ∇ R θ ≥ 0. −ρ R (ψ˙ + η θ) θ While the extra-entropy flux k R is an unknown constitutive function, the expression of the energy flux f is known. Indeed, since N · ∇ R F˙ = 0 then ˙ ∇R · f = ∂ X K (G h J K F˙h J ) = ∂ X K G h J K F˙h J + Mh J K F˙h J K = (∇R G) · F˙ + M · ∇R F. The entropy inequality can then be written ˙ + (S + ∇R · G) · F˙ + M · ∇R F˙ + θ∇R · k R − 1 q R · ∇R θ ≥ 0. −ρ R (ψ˙ + η θ) θ

8.12 A New Scheme Associated with the Hyperstress

543

Seemingly the skew part N of G does not enter the entropy inequality and this would ˙ However ∇R · G comprises be consistent with the vanishing of the power N · ∇R F. ˙ ∇R · N and this is consistent with the nonzero value of the power (∇R · N) · F. We let ψ, η, S, G, q R and k R be given by functions of the set ˙ (F, θ, ∇R F, ∇R θ, F) of independent variables. Now, to save writing, we observe that the role of ∇ R · k R here proves inessential. We can prove that, as a consequence of the entropy inequality, ∂∇R F k R = 0, ∂∇ R θ k R = 0, and ∂F k R = 0. Moreover, θ2 ∂θ k R provides a physically inessential contribution to q R as well as θ∂F˙ k R is a possible minor correction to M. Then no significant generality is lost by letting k R = 0. Upon computation of ψ˙ the entropy inequality takes the form −ρ R (∂θ + η)θ˙ + (−ρ R ∂F ψ + S + ∇R · G) · F˙ + (−ρ R ∂∇R F ψ + M) · ∇R F˙ 1 −ρ R ∂∇R θ ψ · ∇R θ˙ − ρ R ∂F˙ ψ · F¨ − q R · ∇ R θ ≥ 0. θ The standard procedure shows that the entropy inequality holds if and only if ∂∇R θ ψ = 0, and

∂F˙ ψ = 0,

η = −∂θ ψ,

M = ρ R ∂∇R F ψ

(8.46)

1 (−ρ R ∂F ψ + S + ∇R · G) · F˙ − q R · ∇R θ ≥ 0. θ

Since N is subject to (8.45) then also N and G are functions of F, θ, ∇R F via ∂∇R F ψ. ˙ If cross-coupling Consequently only S and q R are allowed to depend on ∇R θ and F. terms are ignored then the reduced inequality implies that ˜ S = −∇R · G + ρ R ∂F ψ + S,

q R · ∇ R θ ≤ 0,

˙ satisfies ˜ where S(F, θ, ∇R F, ∇R θ, F) S˜ · F˙ ≥ 0 and hence models viscous effects. For definiteness we observe that ˜ T ) · D. ˜ T ) · L = (SF S˜ · F˙ = (SF Thus we can take the Newtonian model ˜ T = 2μD + λ(tr D)1, SF whence

μ ≥ 0, 2μ + 3λ ≥ 0,

(8.47)

544

8 Aging and Higher-Order Grade Materials

S˜ · F˙ = 2μ|D|2 + λ(tr D)2 ≥ 0. Quite similar results occur in the two procedures. Look at (8.46) and (8.47). Since S + ∇R · G = T R then it follows that ˜ T R = ρ R ∂F ψ + S. Moreover G = ρ R ∂∇R F ψ + N, N being determined by M via (8.45). In the previous subsection, T R is given by (8.39) which coincides with (8.47). As for G, the same term ρ R ∂∇R F ψ arises in (8.40). The difference is given by the additional skew term N (8.45) which serves to get consistency of the whole scheme. Instead, in (8.40) the additional terms are related to the extra-entropy flux k R and can be made to vanish if a simpler scheme is in order.

Chapter 9

Mixtures

Mixtures consist of different substances which keep their own properties. The individual molecules enjoy being near to each other so that, in the mathematical model, each point in the mixture may be occupied by all of the substances simultaneously. If the chemical structure of a substance changes when it enters the mixture then the mixture is said to be chemically reacting. The easiest mixtures to deal with are gaseous mixtures in which gases readily mix. Fluid mixtures may be formed from fluids or from a fluid and gases that dissolve in the fluid or from a fluid and solids that dissolve in the fluid. In most cases, a fluid is preponderant and hence is called the solvent while the remaining constituents are called solutes. Both fluid mixtures and solid mixtures are examined. The mixture is regarded as ideal in that the volumetric and energetic properties of the mixture are taken to be the linear combination of those of the constituents or species. The chapter provides a derivation of the balance equations by starting with global balances and allowing for interaction terms (growths and supplies). The second law is stated in terms of the constituents and is expressed by the non-negative value of the sum of the entropy productions. Details are given for the balance equations of the whole mixture as a single body; an analysis is given of the various forms of the second-law inequality (Gibbs equation) occurring in the literature. Diffusion is considered through various known models (Fick, Maxwell–Stefan, Cahn–Hilliard, Allen–Cahn). Moreover immiscible mixtures are investigated.

9.1 Kinematics We consider n (fluid or solid) constituents occupying a time-dependent region  ⊂ E . The functions describing the evolution of the mixture have  × R as their common space-time domain. The subscripts α, β = 1, 2, ..., n label the quantities pertaining to © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_9

545

546

9 Mixtures

the constituents. Hence ρα is the mass density, vα the velocity, εα the internal energy density, on  × R, of the αth constituent. Fields pertaining to single constituents are named partial (or peculiar). Hence vα is a partial velocity. The constituents are regarded as continuous bodies and are supposed to occupy the regions R1 , ..., Rn , in the three-dimensional space E , in their reference configurations. The motion of the αth constituent is given by a function x = χα (Xα , t), where x ∈  is the position vector at time t of the point at Xα ∈ Rα . For any region Pα ⊆ Rα we denote by P tα the region of space occupied by the αth constituent at time t, Pαt = χα (Pα , t) := {x ∈ E : x = χα (Xα , t), Xα ∈ Pα }. The whole region  occupied by the mixture is the union of χα (Rα , t),  = ∪ χα (Rα , t). α

Here and in sums it is tacitly understood that the label of the constituent, say α or β, runs over 1, 2, ..., n. t The functions {χα } are invertible, at each time t, so that Xα = χ−1 α (x, t), x ∈ Pα , and is twice continuously differentiable with respect to Xα and t. Hence vα = ∂t χα (Xα , t),

aα = ∂t2 χα (Xα , t)

are the velocity and acceleration of Xα at time t. The symbol ∇Xα , or ∂Xα , denotes the gradient with respect to Xα . The deformation gradient Fα is then defined by1 Fα = ∇Xα χα ,

α F pK = ∂ X αK χαp .

The velocity vα may be expressed as vˆ α (Xα , t) = vˆ α (χ−1 α (x, t), t) =: vα (x, t). The velocity gradient Lα is defined by Lα = ∇vα (x, t),

L αpq = ∂xq v αp (x, t).

A backward prime affixed to a symbol with a subscript α denotes the αth peculiar time derivative, namely, the material derivative following the motion of the αth constituent. For instance, aα = v` α . Some properties are now established about peculiar time derivatives. To avoid too many subscripts, the subscript α labelling the constituent sometimes is denoted as a superscript.

1

9.1 Kinematics

547

Starting with the deformation gradient Fα , we have F` α = ∂t ∇Xα χα (Xα , t) = ∇Xα ∂t χα (Xα , t) = ∇vα (x, t)Fα whence

F` α = Lα Fα .

(9.1)

By the invertibility of χα , Fα is non-singular. Having in mind continuous motions we let Jα := det Fα > 0. The computation of the derivative yields J`α = (∂Fα Jα ) · F` α = Jα (Fα−1 )T · F` α . Upon substitution of F` α from (9.1), since (Fα−1 )T · (Lα Fα ) = tr Lα = ∇ · vα , we find

J`α = Jα ∇ · vα .

(9.2)

Let wα and gα be vector- and scalar-valued functions on Rα × R. Let ˜ α (x, t), g˘ α (Xα , t) = g˘ α (χ−1 α (x, t), t) =: g

(9.3)

and the like for wα . Equation (9.3) establishes the connection between the Lagrangian description g˘ α (Xα , t) and the Eulerian description g˜ α (x, t) of a quantity gα . Hence we have ∂ X αK g˘ α (Xα , t) = ∂x p g˜ α (x, t)F pαK and ∂t g˘ α (Xα , t) = ∂t g˜ α (x, t) + vα · ∇ g˜ α (x, t). To save writing, we usually omit the symbols˘and˜over the function on Rα × R and  × R. Hence, in direct notation, ∇Xα gα = ∇gα Fα ,

(9.4)

g` α = ∂t gα + (vα · ∇)gα ,

(9.5)

Equation (9.5) provides the way of computing the peculiar derivative within the Eulerian description. Upon substitution of (9.4) we obtain

548

9 Mixtures

wα · ∇Xα gα = w αK ∂x p gα F pαK = ∂x p (F pαK w αK gα ) − gα ∂x p (F pαK w αK ), whence wα · ∇Xα gα = ∇ · (Fα wα gα ) − gα ∇ · (Fα wα ).

(9.6)

For convenience in calculations we observe that ∇X`α gα = ∂t ∇Xα g˘ α = ∇Xα ∂t g˘ α = ∇Xα g` α and hence we get the commutation property ∇X`α gα = ∇Xα g` α .

(9.7)

Let ρα be the mass density of the αth constituent, namely, the mass of constituent α per unit volume of the mixture. By the ideal character of the mixture we define the mass density ρ of the mixture as ρ=



α ρα ,

and the mass fraction ωα as2 ωα =

ρα . ρ

Moreover we define the velocity of the mixture, or barycentric velocity,3 v as v=



α ωα vα .

Let Mα be the molecular weight (or mass) of the αth constituent. Hence Nα = ρα /Mα is the αth number density of moles. The relation between the number densities {Nα } and the mass fractions {ωα } is Nα Mα ωα =  . β Nβ Mβ Define

Nα xα =  β Nβ

the mole fraction. In general ωα = xα and ωα ≈ xα inasmuch as the molecular weights are nearly equal. Now define 2

In continuum mechanics the mass fraction is often termed concentration and denoted by cα (see, e.g. [64, 336, 427]). In chemistry cα denotes the molar concentration, that is, the number of moles per unit volume. To avoid any misunderstandings, we avoid the use of cα . 3 Also named mean velocity.

9.1 Kinematics

549

 M=

β Nβ Mβ  β Nβ

as the average molar weight of the mixture. Hence Mα Nα Mα ωα =  = xα . N M M β β While Mα is a constant, the average M is related to the composition of the mixture. The relation ωα = xα Mα /M is then operative when the composition of the mixture is approximately constant (in time and space). We denote by ϕα ∈ [0, 1] the volume fraction occupied by the constituent α. Unless there are voids we have  α ϕα = 1. Sometimes the voids are considered as a constituent, other times they are not and then the sum over the constituents gives 

α ϕα

< 1.

Denote by uα the diffusion velocity, uα = vα − v. Letting hα = ρα uα we find that



α hα

=



α ρα (vα

− v) = 0.

(9.8)

The true mass density4 ρ0α of a constituent α is the mass of constituent α per unit volume of constituent α. Hence we have ρα = ρ0α ϕα . By definition,

ρα ρ0 ϕα = α 0 . ωα =  β ρβ β ρβ ϕβ

Only if ρ0α is the same for all of the constituents, i.e. ρ0α = ρ0 , we can say that ω α = ϕα . 4

Also denoted by ραR in the literature.

550

9 Mixtures

If the constituents are incompressible then any true density ρ0α is a constant. In this case ρ`α = ρ0α ϕ` α .

9.2 Balance Equations For any region Pα ⊆ Rα we express balance laws, for the part of constituent α within Pα , in the form 

 ∫ gα dv ` = ∫ Kα n da + ∫ sα dv + ∫ γα dv.

Pαt

Pαt

∂Pαt

Pαt

(9.9)

This means that the content of gα in Pαt changes in time because of boundary and volume terms, Kα and sα , as with single bodies, and also because of the interaction via γα of the αth constituent with the other ones. The fields {γα } are termed growths and are subject to  α γα = 0. To establish the local counterpart of the global balance equation (9.9) we observe ∫ g˜ α (x, t) dv = ∫ g˘ α (Xα , t) J˘α (Xα , t)dv R Pα

Pαt

and hence, by (9.2), 

 ∫ gα dv ` = ∫ (g` α + gα ∇ · vα )(x, t)dv.

Pαt

Pαt

(9.10)

Moreover, by the divergence theorem, ∫ Kα n da = ∫ ∇ · Kα dv.

∂Pαt

Pαt

Consequently, by (9.9) the arbitrariness of Pαt and the continuity of the integrand in , for any t ∈ R, imply that g` α + gα ∇ · vα − ∇ · Kα − sα = γα .

(9.11)

We can now state the balance equations. We postulate the balance equations for any constituent as those for a single body but allow for the pertinent interaction (relative to, e.g. mass, momentum, energy) with the other constituents of the mixture. Continuity and differentiability of the functions are assumed as needed. Balance of mass.

9.2 Balance Equations

551

Let ρα be the mass density so that M(Pαt ) = ∫ ρα dv Pαt

is the mass of constituent α in Pαt . We denote by τα the mass growth (mass produced per unit volume and unit time) due to the pertinent chemical reactions and assume the balance of mass in the form 

 ∫ ρα dv ` = ∫ τα dv,

Pαt

Pαt

for any region Pαt ⊆ , subject to 

α τα

= 0.

(9.12)

By applying (9.11) it follows that ρ`α + ρα ∇ · vα = τα .

(9.13)

For later convenience, we now consider functions of the form gα = ρα h α . In view of (9.10) and (9.13) we find that 

 ∫ ρα h α dv ` = ∫ (ρα h` α + τα h α )dv.

Pαt

Pαt

(9.14)

The result (9.14) is the content of the transport theorem for reacting constituents. Balance of linear momentum. The linear momentum P of the region Pαt is P(Pαt ) = ∫ ρα vα dv. Pαt

Letting Tα be the partial, or peculiar, Cauchy stress tensor and bα the body force per unit mass we assume 

 ∫ ρα vα dv ` − ∫ Tα n da − ∫ ρα bα dv = ∫ mα dv,

Pαt

Pαt

∂Pαt

Pαt

(9.15)

where mα is the growth of linear momentum. The growths {mα } are subject to 

α mα

= 0.

As with the general form (9.9), we find the local version of (9.15) as ρα v` α + ρ`α vα + ρα vα ∇ · vα − ∇ · Tα − ρα bα = mα .

(9.16)

552

9 Mixtures

In view of (9.13) we obtain ρα v` α − ∇ · Tα − ρα bα = pˆ α ,

(9.17)

pˆ α = mα − τα vα .

(9.18)

where

Balance of angular momentum. Let r be the position vector relative to a fixed (base) point. We define the angular momentum L(Pαt ) as L(Pαt ) = ∫ r × ρα vα dv. Pαt

For technical convenience, we represent the angular momentum growth as r × mα + γ α . We then assume that the balance of angular momentum is expressed by 

 ∫ ρα r × vα dv ` − ∫ r × Tα n da − ∫ ρα r × bα dv = ∫ (r × mα + γ α )dv.

Pαt

Pαt

∂Pαt

Pαt

(9.19)

The growths {γ α } are subject to and hence, consistently,





αγα

α (r × mα

=0

+ γ α ) = 0.

Look at r × Tα n and observe α n h ei . r × Tα n = εi jk r j Tkh

Since

α ) = Tkαj + r j (∇ · Tα )k , ∂xh (r j Tkh

by means of the divergence theorem we find that ∫ (r × Tα n)da = ∫ [(wα + r × (∇ · Tα )]dv,

∂Pαt

where

Pαt

wα = εi jk Tkαj ei .

Moreover, since r` is the derivative of r at Xα fixed, r` = vα , then 

 ∫ ρα r × vα dv ` = ∫ r × (ρα v` α + τα vα )dv.

Pαt

Pαt

9.2 Balance Equations

553

Hence we find that ∫ [r × (ρα v` α − ∇ · Tα − ρα bα − mα ) − wα − γ α ]dv = 0.

Pαt

By (9.17), the arbitrariness of Pαt and the continuity of the integrand imply that wα = −γ α . Each stress tensor Tα need not be symmetric. By the definition of wα we find that α − Tqαp = ε pqi γiα . T pq

Let TI =



(9.20)

α Tα .

Hence summation over α of (9.20) results in T I − TTI = 0.

(9.21)

Balance of energy. Denote by εα the internal energy, per unit mass. Hence we define the (internal + kinetic) energy E(Pαt ) as E(Pαt ) = ∫ ρα (εα + 21 vα2 )dv. Pαt

Moreover let qα be the heat flux vector and rα the energy supply, per unit mass, in the αth constituent. We then assume that the balance of energy is expressed by 

 ∫ ρα (εα + 21 vα2 )dv ` − ∫ (vα · Tα − qα ) · n da − ∫ ρα (bα · vα + rα )dv = ∫ eα dv,

Pαt

∂ Pαt

Pαt

Pαt

(9.22)

where the growths {eα } are subject to 

α eα

= 0.

In local form we obtain ρα ε` α + τα (εα + 21 vα2 ) + vα · (ρα v` α − ∇ · Tα − ρα bα ) + ∇ · qα − Tα · Lα + ρα rα = eα .

Upon substitution from the equation of motion (9.17) we obtain the balance of internal energy in the form ρα ε`α + τα (εα − 21 vα2 ) − Tα · Lα + ∇ · qα − ρα rα + mα · vα = eα .

(9.23)

554

9 Mixtures

Equations (9.13), (9.17), and (9.23) are the (local) continuity equation, the equation of motion, and the energy equation with the peculiar derivative, namely, the time derivative at fixed Xα . To sum up, we can state them in the form ρ`α + ρα ∇ · vα = τα ρα v` α − ∇ · Tα − ρα bα = pˆ α , ρα ε`α − Tα · Lα + ∇ · qα − ρα rα = εˆα ,

(9.24)

εˆα = eα − mα · vα − τα (εα − 21 vα2 ).

(9.25)

where Hence, τα , pˆ α , εˆα represent the interaction effects on mass, linear momentum, and energy, relative to the observer following the motion of the αth constituent. Consequently, τα , pˆ α , εˆα are objective quantities.  Moreover, in light of the balance of angular momentum, it follows that T I = α Tα is symmetric. Since Eq. (9.24) involves the peculiar derivative, we can view them as the (local) balance equations in the Lagrangian description. For completeness and later convenience, we now establish the corresponding equations in the Eulerian description.

9.2.1 Balance Equations in the Eulerian Description The balance equations (9.13), (9.17), and (9.23) are now reviewed by applying Eq. (9.5) for the peculiar time derivative. Upon substitution of ρ`α from (9.5) we have ∂t ρα + vα · ∇ρα + ρα ∇ · vα = τα . Hence we can write ∂t ρα + ∇ · (ρα vα ) = τα .

(9.26)

By (9.26), substitution of ρα = ρωα ,

vα = v + uα

results in ρ(∂t ωα + v · ∇ωα ) + ωα (∂t ρ + ∇ · (ρv)) + ∇ · (ρα uα ) = τα whence ρω˙ α = −∇ · hα + τα .

(9.27)

For any (scalar, vector, tensor) function h α observe ρα h` α = ρα ∂t h α + ρα vα · ∇h α = ∂t (ρα h α ) + ∇ · (ρα h α vα ) − h α [∂t ρα + ∇ · (ρα vα )].

9.2 Balance Equations

555

In view of (9.26), we obtain ρα h` α = ∂t (ρα h α ) + ∇ · (ρα h α vα ) − τα h α ,

(9.28)

where h α vα has to be meant as a dyadic product h α ⊗ vα if h α is a vector or a tensor and ∇ · (u ⊗ v) = ∂xi (u j vi )e j . Equation (9.28) is an identity that yields a direct connection between Lagrangian and Eulerian descriptions. By identifying h α of (9.28) with vα and εα we obtain the balance equations (9.24) in the form ∂t ρα + ∇ · (ρα vα ) = τα ∂t (ρα vα ) + ∇ · (ρα vα ⊗ vα ) − ∇ · Tα − ρα bα = mα , ∂t (ρα εα ) + ∇ · (ρα εα vα ) − Tα · Lα + ∇ · qα − ρα rα = λα ,

(9.29)

where mα = pˆ α + τα vα , as in (9.18), and λα = eα − mα · vα + 21 τα vα2 . Alternatively, we might replace h` α from (9.28) in (9.14) to obtain ρα h` α + τα h α = ∂t (ρα h α ) + ∇ · (ρα h α vα ) and hence obtain the result of the transport theorem in the form 

 ∫ ρα h α dv ` = ∫ [∂t (ρα h α ) + ∇ · (ρα h α vα )]dv.

Pαt

Pαt

(9.30)

By applying (9.30) to (9.15) and (9.22) we obtain at once the second equation of (9.29) and ∂t [ρα (εα + 21 vα2 )] + ∇ · [ρα (εα + 21 vα2 )vα − vα Tα + qα ] − ρα bα · vα − ρα rα = eα . (9.31) ` for any (scalar- or tensor-valued) function It is also of interest to compute ∇g α gα on  × R. Since ` α = (∂t + vα · ∇)wα w for any function wα on  × R then, for any C 2 function gα we have ` = (∂ + v · ∇)∇g = ∇∂ g + (v · ∇)∇g = ∇∂ g + ∇(v · ∇g ) − LT ∇g ∇g α t α α t α α α t α α α α α whence

` = ∇ g` − LT ∇g . ∇g α α α α

(9.32)

556

9 Mixtures

In connection with vectors we may need further identities. Look at a component, say wiα of a vector wα on  × R. Then ∂xk`wiα = (∂t + vhα ∂xh )∂xk wiα = ∂xk ∂t wiα + ∂xk (vhα ∂xh wiα ) − (∂xk vhα )∂xh wiα whence

∂xk`wiα = ∂xk w`iα − L αhk ∂xh wiα .

In compact form

` α − LαT ∇ ⊗ wα . ∇ ⊗` wα = ∇ ⊗ w

Two particular significant cases arise. If we take the trace we get the identity for the divergence, ` − (LT ∇) · w . ∇ ·`w = ∇ · w (9.33) α

α

α

α

Instead the skew part yields ` α − (LαT ∇) × wα . ∇ ×` wα = ∇ × w

(9.34)

9.2.2 Mass Density in the Reference Configuration and Incompressibility Denote by α the mass density in the reference configuration Rα ; it is also denoted as ρα R in the literature. Unlike what happens in a single body, in reacting mixtures the reference mass density α depends on time. To clarify this dependence we start with the invariance of mass from Pα ⊆ Rα to the image χα (Pα , t), ∫ α (Xα , t)dv R =





χα (Pα ,t)

ρα (x, t)dv.

Under the change of (vector) variable x ∈ χα (Pα , t) → Xα ∈ Pα the Jacobian is Jα and we have ∫ ρα (x, t)dv = ∫ (ρα Jα )(Xα , t)dv R . χα (Pα ,t)



Hence it follows ∫ ( α − ρα Jα )(Xα , t)dv R = 0.



The arbitrariness of Pα and the supposed continuity of the integrand implies that

α = ρα Jα

(9.35)

9.2 Balance Equations

557

at any point Xα ∈ Pα and time t. Upon time differentiation we obtain

`α = τα Jα .

(9.36)

Equation (9.36) shows that, in non-reacting mixtures, where τα = 0, the reference density α is constant. For reacting mixtures, instead, the reference density α depends on time and (9.36) is the value of the time derivative. For single bodies, incompressibility means that the mass density is constant at any point of the body, ρ˙ = 0, and hence incompressibility may be characterized by the constraint ∇ · v = 0. For a mixture, we may consider constituents that are regarded as intrinsically incompressible, namely, incompressible as single bodies. To make this operative, let ρ0α be the (constant) true density of constituent α, possibly dependent on Xα . Then letting ϕα be the volume fraction of constituent α we can write ρα = ρ0α ϕα and, consequently, ρ0α ϕα =

α . Jα

Hence, it follows that ϕα Jα is a function of the referential position Xα . For formal simplicity we may let α = ρ0α . Hence, for incompressible constituents, ϕα Jα = 1. The continuity equation (9.13) implies ϕ` α + ϕα ∇ · vα =

τα , ρ0α

− J`α + Jα ∇ · vα =

τα Jα2 . ρ0α

These equations are characteristic of incompressible constituents. The constraint ∇ · vα = 0 need not hold for incompressible constituents. The assumption of saturation (no voids), 

α ϕα

= 1,

implies that 0=



α ∇ϕα ,

0=



α ∂t

ϕα =



α [−∇

· (ϕα vα ) +

τα ]. ρ0α

Upon substitution of vα = v + uα it follows that  τα (∇ϕα · uα + ϕα ∇ · vα − 0 ) = 0. ρα α

(9.37)

558

9 Mixtures

9.3 Second Law of Thermodynamics Let ηα be the entropy density (per unit mass) and θα the absolute temperature. We assume the balance of entropy in the form 

 ∫ ρα ηα dv ` + ∫ jα · n da − ∫ ρα sα dv = ∫ σα dv,

Pαt

∂Pαt

Pαt

Pαt

(9.38)

where jα is the entropy flux and sα the entropy supply. The entropy growth σα may account for two aspects, namely, dissipative effects within the αth constituent and interaction with the other constituents of the mixture. The local counterpart of (9.38) follows in the Lagrangian and Eulerian descriptions by using (9.14), ρα η`α + τα ηα + ∇ · jα − ρα sα = σα ,

(9.39)

∂t (ρα ηα ) + ∇ · (jα + ρα ηα vα ) − ρα sα = σα .

(9.40)

and (9.30), As with single bodies, the entropy supply sα is identified with rα /θα . The entropy growth σα can also be viewed as the entropy production, within the αth constituent. As an assumption, we now state the second law of thermodynamics.5 The inequality  (9.41) α σα ≥ 0 must hold, at any point x ∈  and for any time t ∈ R, for any set of functions on  × R compatible with the balance equations. The entropy fluxes {jα } are unknown and have to be determined, along with the other constitutive functions, so that the second law is satisfied. For formal convenience, we let qα jα = + kα , θα the (unknown) vector kα representing the extra-entropy flux. As with the second law for single bodies, we may take the entropy productions {σα } as given by constitutive functions. The analysis of compatibility of constitutive assumptions with the entropy inequality (9.41) is more direct if the peculiar derivative is involved. Hence, we look at (9.39) and write the entropy inequality (9.41) in the form 

5

α [ρα η`α

+ τα ηα + ∇ ·

Or axiom of dissipation, [427].

rα qα + ∇ · kα − ρα ] ≥ 0. θα θα

9.3 Second Law of Thermodynamics

559

Substitution of ∇ · qα − ρα rα from (9.23) yields 

1 1 1 2 α { θ [ρα θα η` α + τα θα ηα + eα − mα · vα + Tα · Lα − ρα ε` α − τα (εα − 2 vα ) − θ qα · ∇θα + θα ∇ · kα ]} ≥ 0. α α

Using the Helmholtz free energy ψα = εα − θα ηα we can write the inequality in the form 

α{

1 1 [−ρα (ψ` α + ηα θ` α ) − τα (ψα − 21 vα2 ) + eα − mα · vα + Tα · Lα − qα · ∇θα + θα ∇ · kα ]} ≥ 0. θα θα

(9.42) By (9.25), inequality (9.42) can also be written in terms of the supplies εˆα as follows: 

1

1

` ` α { θ [−ρα (ψα + ηα θα ) + τα θα ηα + εˆ α + Tα · Lα − θ qα · ∇θα + θα ∇ · kα ]} ≥ 0. α α

(9.43)

Next we show how the balance equations of the constituents allow the derivation of the balance equations of the mixture as a single body. Here we look at the possibility of deriving constitutive properties of the mixture as a whole along with properties of the constituents. In this sense, it is of interest to look at (9.42), or (9.43), under the assumption that the mixture is subject to a single (common) temperature, θα = θ, α = 1, 2, ..., n. If this is viewed as a constraint we have θ`α = ∂t θ + vα · ∇θ = θ˙ + (vα − v) · ∇θ. Consequently   1 1 1 − α [ρα ηα θ`α + qα · ∇θα ] → − {ρη θ˙ + α [qα + ρα ηα (vα − v)] · ∇θ}. θα θα θ Likewise we use the continuity equation (9.27) and find that [61] 

  ˙ ` = ˙ ˙ α + hα · ∇ψα ] α ρα ψα + ρα uα · ψα ] = α [ρ ωα ψα − ρ ψα ω    ˙ ˙ = ρψ I − α [ψα (τα − ∇ · hα ) + hα · ∇ψα = ρψ I + ∇ · α ψα hα − α τα ψα . α ρα ψα

Hence inequality (9.42) can be written in the form  2 1 ˙ − 1 q˜ · ∇θ + ˆ α · uα + θ∇ · kα − ∇ · (ψα hα )] ≥ 0, − ρ(ψ˙ I + η θ) α [Tα · Lα − 2 τα uα − p θ

(9.44)   where ψ I = α ωα ψα and q˜ = α (qα + ηα hα ). Yet, this inequality requires that we look for ψ I , η, q˜ as properties of the whole mixture while Tα , pˆ α , kα are related to the constituents. Moreover, unless there is

560

9 Mixtures

a mechanism equalling the temperatures, from a physical viewpoint the assumption that the constituents are subject to a common temperature merely means that the temperatures θα are almost equal thanks to the exchange of energy among the constituents. Hence, we find it reasonable to determine peculiar properties of the constituents and next to examine the remaining dissipation inequality within the = θ. assumption (or approximation) θα  If θα = θ then we observe that α eα = 0 and, in light of (9.42), we consider  :=



α [mα

· vα − 21 τα vα2 ].

By (9.18) we have mα · vα − 21 τα vα2 = pˆ α · vα + 21 τα vα2 . For any velocity V let vˇ α = vα − V. Hence, by (9.12) we have



α τα vα

=



ˇ α. α τα v

Consequently, upon replacing vα with V + vˇ α the constraint (9.16) results in 0=



α mα

=

  (pˆ α + τα vα ) = α (pˆ α + τα vˇ α ).

(9.45)

α

By computing and using (9.12) and (9.45) we obtain =



ˆα α (p

· vα + 21 τα vα2 ) =



ˆα α (p

· vˇ α + 21 τα vˇ α2 ).

Thus the scalar  is objective, namely, is invariant under a change of frame, vα → vˇ α = vα − V. In particular, we can take V as the barycentric velocity v, and hence vˇ α = uα .

9.3.1 Principle of Phase Separation A classical rule for the formulation of constitutive equations is the principle of equipresence6 which states that a variable present as an independent variable in one constitutive equation should be so present in all, unless the symmetry of the material, the principle of material objectivity, and the second law of thermodynamics require otherwise. 6

See [429], pp. 703–704, and [95].

9.4 Balance Equations for the Whole Mixture

561

According to Passman and Nunziato [356], in mixtures (where) the individual constituents are clearly separated physically, it is plausible to think the material properties of the constituents as being separated from one another.7 The principle of equipresence is then replaced by the principle of phase separation. By this principle, the material-specific-dependent variables of a given constituent (such as the stress Tα and the internal energy εα ) depend only on the independent variables of that constituent. The interaction terms (such as the interaction force mα ) depend on all the independent variables.

9.3.2 Remarks on the Thermodynamic Restrictions Assume the constitutive functions have the form ψα = ψα (θα , ∇θα , ϒα ), ` . Hence inequality where ϒα is a set of variables independent of θα , ∇θα , θ`α , ∇θ α (9.42) takes the form 

α{

1 ` + ...]} ≥ 0, [−ρα (∂θα ψα + ηα )θ`α − ρα ∂∇θα ψα ∇θ α θα

(9.46)

` . Since ε depends on θ and the dots indicating terms independent of θ`α and ∇θ α α α ∇θα then, by (9.23), we have ` + ... = ρ r . ρα ∂θα εα θ`α + ρα ∂∇θα εα ∇θ α α α At any point x and time t, θ`α can take any scalar value and meanwhile (9.23) holds by letting rα take the appropriate value. Similarly, we can take the gradient of (9.23) ` is allowed by letting ∇r be appropriate. and observe that the arbitrary value of ∇θ α

α

` . The Consequently, inequality (9.46) has to hold for arbitrary values of θ`α and ∇θ α linearity then implies that ηα = −∂θα ψα ,

∂∇θα ψα = 0.

9.4 Balance Equations for the Whole Mixture It is natural to ask for the balance equations of a mixture as a single body and to look for the connection with the balance equations for the constituents. It is a 7

See, also, [329].

562

9 Mixtures

guiding principle of the theory that the motion of the mixture is governed by the same equations as is a single body [427]. This principle in turn should indicate the appropriate definitions of the quantities pertaining to the whole mixture8 in terms of the quantities related to the constituents. The starting point is the set of balance equations in Eulerian form (9.29) which enjoy the conceptual advantage of being referred to a single observer. Summation of (9.29)1 over α, the definitions ρ=



α ρα ,

ρv =



α ρα vα ,

and the constraint (9.12) yields ∂t ρ + ∇ · (ρv) = 0,

(9.47)

which is the continuity equation for single bodies with mass density ρ and velocity v. The barycentric velocity v plays the role of the velocity of the mixture viewed as a single body. Denote by a superposed dot the derivative following the motion defined by the velocity v. For any (scalar or vector) function w(x, t), we have ˙ = ∂t w + v · ∇w. w Since ∇ · (ρv) = ρ∇ · v + (v · ∇)ρ then (9.47) can be written in the form ρ˙ + ρ∇ · v = 0.

(9.48)

Following are some mathematical consequences of (9.47). First, in light of (9.13) we have ρω ˙ α + ρω˙ α = ρω˙ α = −ρα ∇ · (v + uα ) + τα − (uα · ∇)ρα . Consequently, in view of (9.48) it follows ρω˙ α = −∇ · (ρα uα ) + τα .

(9.49)

Since ∂t (ρw) + ∇ · (ρw ⊗ v) = w[∂t ρ + ∇ · (ρv)] + ρ(∂t w + v · ∇w) in view of (9.47) we find the identity ˙ ∂t (ρw) + ∇ · (ρw ⊗ v) = ρw.

8

For example, internal energy, stress tensor, and heat flux.

(9.50)

9.4 Balance Equations for the Whole Mixture

563

We now examine Eq. (9.29)2 . Observe that replacing vα with v + uα and accounting for the property (9.8) we have 

α ρα vα

⊗ vα = ρv ⊗ v +



α ρα uα

⊗ uα .

Hence summation of (9.29)2 over α and using (9.16) we find ∂t (ρv) + ∇ · (ρv ⊗ v) + ∇ · where TI =



Letting T=



α ρα uα

α Tα ,



⊗ uα − ∇ · T I − ρb = 0,

ρb =

α (Tα



(9.51)

α ρα bα .

− ρα uα ⊗ uα ).

(9.52)

and using (9.50) we can write Eq. (9.51) as ρ˙v = ∇ · T + ρb.

(9.53)

Equation (9.53) has the standard form of the equation of motion for a single body. The stress tensor T is given by (9.52) and is the result of the partial stresses {Tα } and the additive contribution − α ρα uα ⊗ uα originated by the diffusion velocities {uα }. As a consequence of the balance of angular momentum Eq. (9.21) holds and states that T I is symmetric. Since T = TI −



α ρα uα

⊗ uα

then the symmetry of T I implies that of T. To obtain the balance of energy we first consider (9.31) which involves the total energy density εα + 21 vα2 . Replace vα by v + uα and then evaluate the sum of (9.31). In view of the constraint (9.8) and the definitions of T and b we can write the resulting equation as ∂t ( 21 ρv2 ) + ∇ · ( 21 ρv2 v) + ∂t (ρε) + ∇ · (ρεv) − ∇ · (vT) − ρb · v   +∇ · α [qα − uα Tα + ρα (εα + 21 uα2 )uα ] − α (ρα rα + ρα bα · uα ) = 0, where ρε =



α ρα (εα

+ 21 uα2 ).

(9.54)

Now, by using the identity (9.50) and the balance equation (9.53) we find that ∂t ( 21 ρv2 ) + ∇ · ( 21 ρv2 v) − ∇ · (vT) − ρb · v = v(ρ˙v − ∇ · T − ρb) − T · D = −T · D,

564

9 Mixtures

where the symmetry of T is used. Since ˙ ∂t (ρε) + ∇ · (ρεv) = ρε, the remaining terms provide the standard balance of internal energy, ρε˙ = T · D − ∇ · q + ρr,

(9.55)

provided only that we define q and r by the equations q=



α [qα

ρr =

− uα Tα + ρα (εα + 21 uα2 )uα ], 

α (ρα r α

+ ρα bα · uα ).

(9.56) (9.57)

It is worth remarking that the quantities associated with the mixture are not merely the sum, or the average, of the corresponding quantities associated with the constituents. While ρ=



α ρα ,

v=



α ωα vα ,

b=



α ωα bα ,

analogous relations are not true for T, ε, q, and r as it follows from (9.52), (9.54), (9.56), and (9.57). Only if the diffusion velocities {uα } vanish, which is a trivial kinematic case, T and q are the sum while ε and r are the average. If h α and kα are partial fields, the fields h=



αhα,

k=



α ωα k α

are named inner part of h and k. Only for the mass density and the velocity, the field associated with the whole mixture equals the inner part. Moreover, the inner part of ρ is merely the sum whereas that of v is the average. It is worth deriving an approximated balance of energy occurring in the literature. Let the constituents be incompressible inviscid fluids so that Tα = − pα 1. The power T · D becomes    T · D = − α ( pα 1 + ρα uα ⊗ uα ) · D = −( α pα )∇ · v − ( α ρα uα ⊗ uα ) · D. If the last term is ignored and it is assumed that ∇ · v = 0 then T · D = 0. In addition, if third-order terms 21 ρα uα2 uα in the expression of q are ignored then q = qI +



α ( pα uα

+ ρα εα uα ).

In terms of the enthalpies9 χα = pα /ρα + εα we can write 9

Often in the literature the specific enthalpy is denoted by h; here we do not use h α to avoid ambiguities with the diffusion flux hα .

9.4 Balance Equations for the Whole Mixture

565

q = qI +



α χα hα .

Moreover neglect the energy supply ρr . Within these approximations we obtain the balance of energy in the form ρε˙ = −∇ · q I − ∇ ·



α χα hα .

(9.58)

The entropy principle for the whole mixture As a metaphysical principle, the balance equations of a mixture are the same as those for a single body [427]. The second law of thermodynamics (or entropy principle) can then be stated by assuming that there are an entropy density η (of the mixture), an entropy flux j, and an entropy supply s such that the inequality ρη˙ + ∇ · j − ρs ≥ 0

(9.59)

holds for the set of constitutive equations subject to the balance equations. To guarantee the consistency with (9.41), we consider (9.40) and sum over α to obtain ∂t



α ρα ηα

+∇ ·



α ρα ηα vα

+∇ ·



α jα





α ρα sα

≥ 0.

This suggests that we define η by ρη =



α ρα ηα .

Upon replacing vα by v + uα and using the identity (9.50) we find ρη˙ + ∇ ·



α (jα

+ ρα ηα uα ) −



α ρα sα

≥ 0.

(9.60)

This means that the second law in the form (9.41) is consistent with the entropy principle (9.59) on condition that we make the identifications j=



α (jα

+ ρα ηα uα ),

ρs =



α ρα sα ,

in addition to the definition of η as the average of the entropies {ηα }. The entropy flux j will be taken in the form j = q/θ + k with j a constitutive function to be characterized by the second-law inequality. Instead s has to be defined and two definitions seem to be consistent but inequivalent. In the 70s, the literature (e.g. [61, 206, 334]) favoured the definition s = sr := Alternatively we might assume

1 ωα r α . θ α

566

9 Mixtures

s = sr b :=

1 ωα (rα + uα · bα ). θ α

For instance, Müller [334] supports sr on the basis of the kinetic theory of gases. Instead sr b may be justified by both rα and uα · bα being external powers. We now show the consequences of the two definitions of s. Since we consider the whole mixture it is natural to assume that the constituents have a common temperature θ. Hence in both cases, upon multiplication of (9.60) by θ it follows ρθη˙ + θ∇ · j − θ



α ρα sα

≥ 0.

(9.61)

Let s = sr . Since the balance of internal energy (9.55) can be given the form ρε˙ = T · D − ∇ · q +



α hα

· bα +

 α

ρα rα ,

then the inequality can be written as −ρε˙ + ρθη˙ + θ∇ · j + T · D − ∇ · q +



α hα

· bα ≥ 0.

Letting, as is customary for single bodies, ε = ψ + θη and j = q/θ + k we find the second-law inequality in the form  1 − ρψ˙ − ρη θ˙ + T · D − q · ∇θ + θ∇ · k + α hα · bα ≥ 0. θ

(9.62)

Let s = sr b . The second-law inequality becomes 1 − ρψ˙ − ρη θ˙ + T · D − q · ∇θ + θ∇ · k ≥ 0. θ

(9.63)

Inequalities (9.62) and (9.63) are the Clausius–Duhem inequalities for the whole mixture in terms of the Helmholtz free energy ψ = ε − θη. The difference between  the two inequalities is that α hα · bα occurs if we let s = sr . This term vanishes, and the inequalities are equal, if bα is the same for all constituents, bα = b, in that 

α hα

· bα = (



α hα )

· b = 0.

This is so, e.g. if the body force is due to gravity, bα = g being the gravity acceleration. Of course, no ambiguity arises if the second law is considered in the detailed form (9.42). To conclude, the properties of the whole mixture, in terms of the partial quantities of the constituents, are given as follows: ρ=



α ρα ,

v=



α ωα vα ,

b=



α ωα bα ,

T=



α (Tα − ρα uα ⊗ uα ),

9.4 Balance Equations for the Whole Mixture ε=



η=

1 2 α ωα (εα + 2 uα ),



α ωα ηα ,

s=

Moreover it is

q= 



567



1 2 α [qα − uα Tα + ρα (εα + 2 uα )uα ],

α ωα sα ,

α ωα = 1,

j=





α( θ

α



α ωα uα = 0,

+ kα + ωα ηα uα ), 

α mα = 0,

r=

ψ=



α ωα (rα + bα · uα )



1 2 α ωα (ψα + 2 uα ).



α eα = 0,

and mα = pˆ α + τα vα ,

eα = εˆα + mα · vα + τα (εα − 21 vα2 ).

The quantities TI =



α Tα ,

εI =



α ω α εα ,

qI =



α qα ,

ψI =



α ωα ψα

are the inner parts of T, ε, q, ψ. The second law of thermodynamics for mixtures is given different forms depending on whether we give evidence to the single constituents or we look at the mixture as a whole. So far (9.41) and (9.59) should yield the same consequences on the constitutive models. In classical approaches though inequivalent constitutive models may arise merely because of the pertinent assumptions. Section 9.10 shows the derivation of some models with different thermodynamic assumptions.

9.4.1 Stoichiometry Requirements on the Mass Growths In a chemical reaction, also the atomic substances making up the constituents are indestructible. This results in additional requirements on the mass growths {τα }. As an example, in a mixture of carbon, C; molecular oxygen, O2 ; carbon dioxide, CO2 ; and carbon monoxide, CO, the atoms of C and O are conserved. Let Mα be the molecular weight of the αth constituent E α . Hence τα /Mα is the number of moles of E α produced or absorbed per unit time and unit volume. Let Tiα be the number of atoms Ai in the constituent E α . Relative to the example, A1 = C, A2 = O, and {T1α } = (1, 0, 1, 1),

{T2α } = (0, 2, 2, 1).

The net molar rate of production of the ith atomic element results in 

α α Ti

τα =0 Mα

(9.64)

568

9 Mixtures

for every ith atomic substance. Equation (9.64) for any i is a set of restrictions on the growths {τα }. Let Wi be the atomic weight of Ai . Hence Mα =



α i Ti Wi .

Multiplying Eq. (9.64) by Wi and sum over i, we have 0=

 i

α α Ti Wi

    τα τα τα = α ( i Tiα Wi ) = α Mα = α τα ; Mα Mα Mα

we then find again that the growths {τα } have a zero net value.

9.4.2 Balance Equation for the Diffusion Flux We now follow a general scheme of diffusion flux, consistent with the notion that diffusion is relative motion of constituents. First let v∗ be a reference velocity field. For generality, we let v∗ stand for the velocity of any constituent or that of the mixture (barycentric velocity). We then consider the diffusion flux hα∗ := ρα (vα − v∗ )

(9.65)

relative to v∗ . The interest in the flux hα∗ , relative to the velocity v∗ of a constituent, is associated with the motion of the solute relative to the solvent or the motion of a constituent relative to markers in a crystal lattice10 [390]. The evolution (or balance) equation of hα∗ is inherited from the continuity equations and the equations of motion, (9.24) and (9.53), as appropriate. Next developments require a careful distinction between peculiar derivatives. Hence for functions gα , g∗ on  × R we let g` α = (∂t + vα · ∇)gα ,

g` ∗ = (∂t + v∗ · ∇)g∗ .

Moreover, for any function on  × R we need the operator D ∗ := ∂t + v∗ · ∇, namely, the time derivative relative to the observer that moves with velocity v∗ . Let uα∗ := vα − v∗ . It follows that 10

In these cases, v∗ is the velocity of the solvent or that of the markers.

9.4 Balance Equations for the Whole Mixture

569

D ∗ gα = (∂t + v∗ · ∇)gα = (∂t + vα · ∇)gα − uα∗ · ∇gα . Hence, for any function gα on  × R we have the identity D ∗ gα = g` α − uα∗ · ∇gα .

(9.66)

To evaluate D ∗ hα∗ we first observe that, by means of (9.66) and (9.65), D ∗ hα∗ = (ρα vα )`− uα∗ · ∇(ρα vα ) − (ρ`α − uα∗ · ∇ρα )v∗ − ρα D ∗ v∗ . Now, by (9.24) we have (ρα vα )` = ∇ · Tα + ρα bα + pˆ α + vα (−ρα ∇ · vα + τα ).

(9.67)

If v∗ = vβ then, by (9.24)2 , ρα D ∗ v∗ = ρα v` β =

ρα (∇ · Tβ + ρβ bβ + pˆ β ). ρβ

If, instead, v∗ = v then by (9.53) we have ρα D ∗ v∗ = ρα v˙ =

ρα (∇ · T + ρb). ρ

Let ρ∗ = ρβ , ρ,

T∗ = Tβ , T,

b∗ = bβ , b,

pˆ ∗ = pˆ β , 0,

according as v∗ = vβ , v. Hence, in any case, we can write ρα D ∗ v∗ =

ρα (∇ · T∗ + ρ∗ b∗ + pˆ ∗ ). ρ∗

Upon the appropriate substitutions and some rearrangements, we obtain D ∗ hα∗ = ∇ · Tα −

ρα ρα ∇ · T∗ + ρα (bα − b∗ ) + pˆ α − pˆ ∗ + τα uα∗ ρ∗ ρ∗ ∗ ∗ −uα · ∇(ρα vα ) − ρα uα ∇ · vα + (uα∗ · ∇ρα )v∗ .

Now, replace vα with v∗ + uα∗ and observe −uα∗ · ∇(ρα vα ) − ρα uα∗ ∇ · vα = −∇ · (ρα uα∗ ⊗ uα∗ ) − (uα∗ · ∇ρα )v∗ − ρα uα∗ · ∇v∗ − ρα uα∗ ∇ · v∗ .

Denote by L∗ the gradient of the reference velocity field, L∗ = ∇v∗ .

570

9 Mixtures

Hence

−ρα uα∗ · ∇v∗ − ρα uα∗ ∇ · v∗ = −[L∗ + (∇ · v∗ )1]hα∗ .

Consequently, ∗ · ∇(ρ v ) − ρ u∗ ∇ · v + (u∗ · ∇ρ )v = −∇ · (ρ u∗ ⊗ u∗ ) − [L + (∇ · v )1]h∗ . −uα α α α α α α ∗ α α ∗ ∗ α α α

By (9.18) we have pˆ α −

ρα ρα ρα pˆ ∗ + τα uα∗ = mα − mβ − (τα − τβ )vβ , ρ∗ ρβ ρβ

and pˆ α −

ρα pˆ ∗ + τα uα∗ = mα − τα v, ρ∗

if v∗ = vβ

if v∗ = v.

Letting τ∗ = τβ , 0,

m∗ = mβ , 0,

according to v∗ = vβ , 0, we can write pˆ α −

ρα ρα ρα pˆ ∗ + τα uα∗ = mα − m∗ − (τα − τ∗ )v∗ . ρ∗ ρ∗ ρ∗

Hence we find that hα∗ satisfies the balance equation ρα ∇ · T∗ + ρα (bα − b∗ ) ρ∗ ρα ρα +mα − m∗ − (τα − τ∗ )v∗ . (9.68) ρ∗ ρ∗

D ∗ hα∗ + (L∗ + ∇ · v∗ 1)hα∗ = ∇ · (Tα − ρα uα∗ ⊗ uα∗ ) −

The evolution (or balance) equation (9.68) with any reference velocity v∗ was first given in [324]. The particular case of (9.68) with v∗ = v, ρα h˙ α + (L + ∇ · v1)hα = ∇ · (Tα − ρα uα ⊗ uα ) − ∇ · T + ρα (bα − b) + mα − τα v ρ

(9.69) is the balance equation of the diffusion flux hα relative to the barycentric reference. If we let hα = ρ pα and observe h˙ α = ρ p˙ α − ρ(∇ · v)pα then we find that (9.69) simplifies to the form ρ p˙ α + ρ L pα = ∇ · (Tα − ρα uα ⊗ uα ) −

ρα ∇ · T + ρα (bα − b) + mα − τα v ρ

9.4 Balance Equations for the Whole Mixture

571

that traces back to Müller [336]. Equation (9.68) for the diffusion flux hα∗ is a first-order evolution equation with an apparent nonlinearity due to ρα uα∗ ⊗ uα∗ =

1 ∗ h ⊗ hα∗ . ρα α

As α = 1, 2, ..., n, Eqs. (9.68) constitute a coupled system of differential equations, the coupling being given by the velocity gradient L∗ , the stress T∗ , the mass growth τ∗ , and the linear momentum growth m∗ . In addition, the mass densities {ρα } are unknowns too and the equations are operative once the constitutive equations for {Tα }, {mα } and {τα } are given.  It is worth remarking that only n − 1 equations of (9.69) are significant in that α hα = 0. This is consistent with the fact that summation of (9.69) over α produces a trivial identity. In practical applications, the use of (9.68) and (9.69) is quite difficult in view of the occurrence of L∗ (or L) and the fact that they are a coupled system of equations. It is then natural to look for reasonable approximations. In this sense, we examine whether an appropriate approximation leads to Fick’s law. For definiteness look at (9.69). First we assume that bα = b, as it happens for gravity. Moreover, we select the barycentric reference and hence take v = 0 and also L = 0. Further, we disregard the nonlinear term ρα uα ⊗ uα . The growth mα is assumed to be given by the velocity differences between constituents and proportional to the respective densities. We let mα = ξρα

 β

ρβ (vβ − vα ),

ξ being a function of the temperature θ. Since  α

mα = ξ



ρα ρβ (vβ − vα ) = 0,

α,β

(9.70)

the constraint (9.16) is identically satisfied. Moreover, we can write mα in the form mα = ξρα

 β

ρβ (uβ − uα ).

We now restrict attention to binary mixtures. Hence h1 + h2 = 0 and (9.70) implies m1 = ξρ1 ρ2 (u2 − u1 ) = ξ(ρ1 h2 − ρ2 h1 ) = −ξ ρ h1 and m2 = −ξ ρ h2 . Consequently, Eq. (9.69) simplifies to h˙ α + ξ ρ hα = ∇ · Tα − ωα ∇ · T.

(9.71)

572

9 Mixtures

Further approximations are considered by restricting attention to stationary conditions, h˙ α = 0, and regarding the constituents as inviscid fluids, so that Tα = − pα 1,

T  − p1,

Hence Eq. (9.71) reduces to hα = −

pα = ωα p.

p ∇ωα , ξρ

(9.72)

which is just Fick’s law for hα where κα = p/ξρ. Yet, in ideal gases, see Sect. 9.6, pα = xα p, with xα the αth mole fraction; it is xα ≈ ωα if the molecular weights are nearly equal. It is of interest to observe that in uniform conditions, namely, if L, ∇Tα , ∇ · T, and ∇ · (ρα uα ⊗ uα ) vanish, the balance equation (9.69) reduces to h˙ α = −ξ ρ hα . This implies that hα decays in time as hα (t) = hα (0) exp(−ξ ρ t). The interaction force mα results then in an exponential decay of the diffusion flux.

9.5 Constitutive Models for Fluid Mixtures A wide set of models for fluid mixtures is obtained by letting the constitutive functions ψα , ηα , Tα , qα depend (continuously) on the variables α = (θα , ρα , ∇θα , ∇ρα , Dα ). The interaction terms {τα }, {eα }, {mα }, {kα } are allowed to depend on the whole set of variables11 {θα }, {ρα }, {∇θα }, {∇ρα }, {Dα }, {vα − vβ }. For mathematical convenience, the functions ψα and kα are assumed to be continuously differentiable. We start from the second-law inequality (9.42), that is, 

α{

11

1 1 [−ρα (ψ` α + ηα θ` α ) − τα (ψα − 21 vα2 ) + eα − mα · vα + Tα · Lα − qα · ∇θα + θα ∇ · kα ]} ≥ 0. θα θα

In so doing we follow the principle of phase separation [356] rather than the principle of equipresence [427].

9.5 Constitutive Models for Fluid Mixtures

573

Upon evaluation of ψ` α and substitution in (9.42), we find  1 ` + ...] ≥ 0, ` α − ρα ∂∇θα ψα · ∇θ [−ρα (∂θα ψα + ηα )θ`α − ρα ∂Dα ψα · D α α θα ` α }, the dots representing the remaining terms, which are independent of {θ`α }, {D ` ` } ` ` and {∇θα }. The arbitrariness of the peculiar time derivatives {θα }, {Dα }, and {∇θ α requires that ηα = −∂θα ψα , ψα = ψα (θα , ρα , ∇ρα ), for any α = 1, 2, ..., n. Look now at the dependence on ρα and ∇ρα . Apply identity (9.32) to the function ρα and observe −

ρα ρα ` = − ρα ∂ ψ ρ` − ρα ∂ ψ · (∇ ρ` − LT ∇ρ ) ∂ρ ψα ρ`α − ∂∇ρα ψα · ∇ρ α ρ α α ∇ρα α α α α θα α θα θα α θα ρα ρα ρα = −∇ · ( ∂∇ρα ψα ρ`α ) − δρα ψα ρ`α + (∇ρα ⊗ ∂∇ρα ψα ) · Lα , θα θα θα

where δρα ψα := ∂ρα ψα −

θα ρα ∇ · ( ∂∇ρα ψα ) ρα θα

is the (generalized) variational derivative of ψα with respect to ρα . Since ρ`α = −ρα ∇ · vα + τα = −ρα 1 · Lα + τα , upon some rearrangements we find the remaining inequality in the form 

1 2 α { θ [(Tα + ρα δρα ψα 1 + ρα ∇ρα ⊗ ∂∇ρα ψα ) · (Dα + Wα ) − τα (ψα + ρα δρα ψα ) α

2 − 1 q · ∇θ ] + ∇ · [k − ρα ∂ +eα − mα · vα + 21 τα vα α α α ∇ρα ψα (−ρα ∇ · vα + τα )]} ≥ 0. θα θα

The arbitrariness of Wα implies that Tα + ρα ∇ρα ⊗ ∂∇ρα ψα ∈ Sym. Consequently, skwTα = −skwρα ∇ρα ⊗ ∂∇ρα ψα . If ψα depends on ∇ρα via |∇ρα | then ρα ∇ρα ⊗ ∂∇ρα ψα ∈ Sym and hence also Tα ∈ Sym.

574

9 Mixtures

The inequality so obtained indicates that, to within inessential divergence-free terms, we can take the extra-entropy flux kα in the form kα =

ρα ∂∇ρα ψα (−ρα ∇ · vα + τα ). θα

Let pα := ρ2α δρα ψα ,

μα := ψα +

pα . ρα

The function pα may be viewed as the (thermodynamic) αth partial pressure in the mixture; if ψα is independent of ∇ρα then pα = ρ2α ∂ρα ψα . Moreover, we regard μα as the αth chemical potential. The inequality then reduces to  1 1 {(Tα + pα 1 + ρα ∇ρα ⊗ ∂∇ρα ψα ) · Dα − μα τα + eα − mα · vα + 21 τα vα2 − qα · ∇θα } ≥ 0. α θα θα

A sufficient condition is suggested by the fact that ψα , Tα , and qα are functions of variables pertaining to the αth constituent whereas τα , eα , and mα depend on quantities pertaining to all of the constituents. Hence, we let (Tα + pα 1 + ρα ∇ρα ⊗ ∂∇ρα ψα ) · Dα −

1 qα · ∇θα ≥ 0, θα

 1 (−μα τα + eα − mα · vα + 21 τα vα2 ) ≥ 0. α θα

(9.73)

(9.74)

Inequality (9.74) is a restriction on the functions μα , eα , mα , and τα . In addition to (9.74), the functions τα , mα , eα are subject to the constraints 

α τα

= 0,



α mα

= 0,



α eα

= 0,

and the Galilean invariance of pˆ α = mα − τα vα ,

εˆα = eα − pˆ α · vα − τα (εα + 21 vα2 ).

Incidentally, Galilean invariant quantities means that, with respect to frames in relative motion with any velocity V, they appear to be the same. By a change of frame, the velocities of the constituents undergo the transformation vα → vα + V. Incidentally, in connection with (9.74) we remark that eα − mα · vα + 21 τα vα2 = εˆα + τα εα

9.5 Constitutive Models for Fluid Mixtures

575

and hence eα − mα · vα + 21 τα vα2 is a Galilean invariant. For definiteness, the analysis of restrictions is now developed in particular cases.

9.5.1 Extent of Reaction and Law of Mass Action Assume the reacting mixture is at rest (vα = 0) and exchanges of energy are negligible (eα = 0). Inequality (9.74) reduces to  μα τα ≤ 0. α θα

(9.75)

If the mixture consists of two constituents, 1 and 2, then τ2 = −τ1 and (9.75) reduces to  μ1 μ2  − τ1 ≤ 0. θ1 θ2 The inequality holds if τ1 = λ

 μ2 θ2



μ1  , θ1

where λ is a positive function of θ1 , θ2 , ρ1 , ρ2 . Hence, the mass of constituent 1 increases if μ2 /θ2 > μ1 /θ1 and vice versa. Equilibrium occurs if μ2 /θ2 = μ1 /θ1 . If the two phases are at the same temperature then equilibrium occurs when the two chemical potentials are equal, (9.76) μ1 = μ2 . We now derive some consequences of inequality (9.75) when several constituents occur. Look at the reaction r

α=1

να Bα 

r + p

α= r +1

να Bα .

To fix ideas we let the reaction proceed from left to right so that B1 , ..., Br are the reactants and Br +1 , ..., Br + p are the products. Hence define the stoichiometric coefficients {γα } as r + p α=1 γα Bα = 0. Of course we let r + p = n, the number of constituents. Let Mα be the molar mass and Nα the molar number density,12 or number of moles per unit volume. Hence we have Nα Mα = ρα . 12

The molar number density, or molar concentration, is usually denoted by cα in the chemical literature. We do not use the symbol cα to avoid any confusion with the mass fraction ρα /ρ, often denoted by cα within continuum mechanics.

576

9 Mixtures

Moreover, upon disregarding diffusion we can write the (approximate) relation ρ˙α = τα . Denote by  the extent of reaction or reaction coordinate. For definiteness we let  = 0 denote the setting with zero moles of products. Let Nˆ α denote the value of Nα as  = 0 and hence let Nˆ r +1 , ..., Nˆ r + p = 0. Since diffusion is ignored then the molar number densities change because of the chemical reaction only and hence Nα − Nˆ α = γα . Consequently we have

˙ τα = ρ˙α = γα Mα .

Substitution of τα in (9.75) results in 

α γα μα



˙ ≤ 0, 

where μα = Mα μα /θα is the chemical potential13 per mole. Hence it follows that ˙ = −χ 

 α

γα μα ,

(9.77)

 where χ is a non-negative valued function of {θα }, {ρα }. The vanishing of α γα μα determines the equilibrium between products and reactants and may be viewed as the law of mass action ([337], Chap. 6).  As a remark we observe that, by the conservation of mass, α τα = 0 and then we have  α γα Mα = 0.

9.5.2 Non-reacting Mixtures with Several Temperatures If τα = 0 then the growths mα = pˆ α have to be both Galilean invariant and with vanishing sum. The vector function mα =

 β

Mαβ (vβ − vα )

(9.78)

is apparently Galilean invariant. Moreover, provided Mαβ = Mβα , it follows that  α

13

mα =

Rescaled by the temperature θα .

 α,β

Mαβ (vβ − vα ) = 0

9.5 Constitutive Models for Fluid Mixtures

577

as a consequence of the symmetry of Mαβ and the skew symmetry of vβ − vα , with respect to the indices α, β. Different temperatures are likely to induce exchanges of energy. Hence we let eα =

 β

Nαβ (θβ − θα ) + wα ,

where Nαβ = Nβα and wα is a scalar function to be determined. Now, εˆα becomes εˆα =

 β

Nαβ (θβ − θα ) + wα −

 β

Mαβ (vβ − vα ) · vα .

Hence εˆα is invariant if so is wα −

 β

Mαβ (vβ − vα ) · vα .

Upon replacing vα by v + uα we conclude that wα may be chosen in the form wα =

 β

Mαβ (vβ − vα ) · v.

Hence we have 

eα =

Nαβ (θβ − θα ) +

β

 β

Mαβ (vβ − vα ) · v.

(9.79)

As expected, it follows that 

α eα

=

 α,β

Nαβ (θβ − θα ) +



α,β Mαβ (vβ

 − vα ) · v = 0.

Finally, we have to examine the thermodynamic requirement  α

(

1 1 eα − mα · vα ) ≥ 0. θα θα

Upon some rearrangements, we can write 

α

(

 (θβ − θα )2  uβ 1 1 uα eα − mα · vα ) = α 0. Consequently θβ > θα implies that thermal energy is transferred from the constituent at higher temperature (β) to the constituent at lower temperature (α).

9.5.3 Reacting Mixtures with a Single Temperature The reacting mixture is considered in the limit case when all of the constituents have the same temperature θ. In this case the analysis of thermodynamic consistency for the chemical  potentials {μα } and the growths {mα } is greatly simplified. Since α eα = 0, inequality (9.74) becomes 

α (−μα τα

− mα · uα + 21 τα vα2 ) ≥ 0.

Replacing mα with pˆ α + τα vα and vα with v + uα , we obtain 

α (−μα τα

− mα · uα + 21 τα vα2 ) =

The inequality



α [(μα



α [−(μα

+ 21 uα2 )τα − pˆ α · uα ].

+ 21 uα2 )τα + pˆ α · uα ] ≤ 0,

or similar ones [63], might suggest that thermodynamic models are considered with the separate inequalities 

α (μα

+ 21 uα2 )τα ≤ 0,

 α

pˆ α · uα ≤ 0.

Yet the literature shows that only relations around equilibrium are considered and hence at zero diffusion velocities uα . We rather think that physically sound models are those compatible with the separate conditions 

α μα τα

≤ 0,



ˆα α (p

· uα + 21 τα uα2 ) =



α (mα

· uα − 21 τα vα2 ) ≤ 0. (9.80)

To establish a thermodynamically consistent model, we take mα in the form mα =

 β

Mαβ (vβ − vα ) + wα ,

 α

wα = 0.

Substitution of mα results in   − α,β Mαβ (vβ − vα ) · vα − α (wα · uα − 21 τα vα2 ) ≥ 0.

9.5 Constitutive Models for Fluid Mixtures

579

As before, provided Mαβ = Mβα ≥ 0 we have   − α,β Mαβ (vβ − vα ) · vα = α 0, if effect of pressure on the extent of reaction. By (9.77), a reaction proceeds,   α

γα μα < 0.

According to (9.90) this is the case if  α

γα Mα μ0α (θ) + Rθ

 α

ln

pα < 0. p0

If we take the view, common in chemistry, that pα is the pressure p of the mixture times the mole fraction n α /n, nα pα = p , n then the inequality becomes (

 α

γα )Rθ ln

  p nα + Rθ( α γα ln ) < − α γα Mα μ0α (θ). p0 n

At a fixed set of mole numbers {n α } this inequality provides the values of p such that the reaction proceeds at a given temperature or the like for the values of θ at a  given pressure. If α γα > 0 then we have ln

 p 1 nα  [Rθ α γα ln + α Mα μ0α (θ)],  [Rθ α γα ln + α Mα μ0α (θ)], p0 | α γα | Rθ n

ln

which indicates that the reaction is favoured by high pressures. Temperature dependence of the chemical potential There are processes, like in phase transitions, where the pressure is kept constant. In these cases, it is of greater interest to evaluate the effect of temperature on the chemical potential. Let the pressure be constant, pα = pα0 . Then by (9.89) ηα takes the form R θ α θα 1  ln + ∫ ε (ζ)dζ − cα , ηα = M α θ 0 θ0 ζ α while (9.90) reduces to μα =

ξ 1 θ α  θα Rθα  1 − ln − ∫ dξ ∫ εα (ζ)dζ + cα (θα − θ0 ) + dα . Mα θ0 θ0 θ0 ζ

Hence, we have μα = −

θα 1 R ln θα θ0 − ∫ εα (ζ)dζ + cα = −ηα . Mα θ0 ζ

We assume ηα > 0 and then conclude that, at constant pressure, μα is a decreasing function of the temperature θ. A strictly analogous conclusion follows by looking at the mixture as a single body with T = − p1 and q = 0, k = 0. The entropy inequality is then taken in the form −ρψ˙ − ρη θ˙ − p∇ · v = 0. Since ∇ ·v =−

v˙ ρ˙ = ρ v

then the entropy relation can be written in the form −ψ˙ − η θ˙ − p v˙ = 0, whence

−φ˙ − η θ˙ + v p˙ = 0,

588

9 Mixtures

where φ = ψ + pv is the Gibbs free energy (per unit mass). At constant pressure ˙ φ˙ = −η θ,

φ = −η.

The positive value of η implies that φ is a decreasing function of θ.

9.6.2 Mixing Entropy and Mixing Free Energy Since ηα /ηα0 = ( pα / pα0 )(θα0 /θα ), by (9.89) we obtain the entropy ηα per unit mass in the form R θ α θα 1  R pα ln + ∫ εα (ζ)dζ − cα − ln 0 . ηα = M α θ 0 θ0 ζ Mα pα Assume the distribution is uniform, namely, the fields are independent of the position. Moreover, the mixture is subject to a common temperature, θ. The entropy per mole is then pα , Mα ηα = Rsα − R ln p0 where sα = ln

 θα M α  θα 1  ∫ ε (ζ)dζ − cα . + θ0 R θ0 ζ α

The entropy of the constituent α is n α Mα ηα . The total entropy, S, is the sum, S=R

 α

n α (sα − ln

pα ). p0

Since the mixture is subject to a single temperature θ and occupies the volume V then n α Rθ nα Rθ = n = xα p, pα = V n V   where n = α n α , xα = n α / β n β , and p is the (total) pressure. Consequently, the partial pressure pα , Rθ pα = x α p = , V /xα is the same as that obtained by confining the same gas, at temperature θ, in the volume V /xα . With this in mind we consider a conceptual experiment named after Gibbs. The volume V is partitioned in volumes {xα V }. The αth constituent, at the temperature θ, is placed within the volume xα V and hence is subject to the pressure

9.6 Mixtures of Ideal Gases

589

pα = n α

Rθ Rθ =n = p; xα V V

each gas is subject to the (same) pressure p, the total pressure of the previous derivation. Summing the corresponding entropies, we have Si = R



α n α (sα

− ln

p ). p0

Now let the gases diffuse uniformly in the volume V . At the end the system is the configuration of the previous case, with pα = xα p, and hence the entropy S f takes the form  xα p ). S f = R α n α (sα − ln p0 Consequently S f − Si = −R

 α

n α ln xα .

The difference S f − Si is the change of entropy due to diffusion and we call it mixing entropy. Since xα ≤ 1 then S f − Si is always positive. Remark 9.1 The mixing entropy depends only on the fractions {xα } and not on the properties of the gases. It might then be the case of the mixture of two portions of the same gas. A nonzero mixing entropy for the same gas is usually referred to as Gibbs’ paradox. Look now at the chemical potential (9.90). At a constant, common temperature θ we have R xα p μα = θ(1 + ln ) − γα , Mα p0 where

θ

ξ

θ0

θ0

γα = ∫ dξ ∫

1  ε (ζ)dζ − cα (θ − θ0 ) + dα . ζ α

For uniform mixtures, we compute the Gibbs free energy of the mixture as G i =  α n α Mα μα to obtain G i = Rθ



α n α (1

+ ln

xα p Mα γα ). − p0 Rθ

By arguing as with the entropy we find that the Gibbs free energy after mixing, G f differs from G i for the occurrence of p instead of xα p. Hence, it follows G f − G i = Rθ



αnα

ln xα .

Let H be the enthalpy, H = G + θS. It follows that

(9.91)

590

9 Mixtures

H f − Hi = G f − G i + θ(S f − Si ) = 0. For mixtures of ideal gases, the mixing enthalpy is zero. Intermolecular forces are mainly responsible of changes in the enthalpy of a mixture. Stronger attractive forces between the mixed molecules result in a lower enthalpy of the mixture.

9.7 Diffusion Diffusion means relative motion between constituents of a mixture. The motion of a mixture, and hence the relative motion between constituents, may be determined by solving Eqs. (9.24) or (9.29) along with appropriate boundary and initial conditions. Yet, there are more direct ways of modelling, and then determining diffusion phenomena.

9.7.1 Dynamic Diffusion Equation Models of diffusion are usually based on the balance equation (9.27) for the mass fraction ωα and a constitutive equation for the mass flux hα (see Sect. 9.4.2), though never via the right equation (9.69) for hα . More unusually, ωα is taken to satisfy an evolution equation [215]. In all cases, the resulting differential equation for ωα is strongly affected by constitutive assumptions. In this connection, we observe that the thermodynamic scheme for mixtures based on (9.42) does not involve the mass fluxes {hα }. Consistently here we find that, according to the balance equations, the evolution of the mass densities {ρα } is free from the mass fluxes {hα }. We start with the balance of mass and linear momentum, ∂t ρα = −∇ · (ρα vα ) + τα , ∂t (ρα vα ) + ∇ · (ρα vα ⊗ vα ) = ∇ · Tα + ρα bα + mα . Partial time differentiation of the first equation, divergence of the second one, and substitution of ∇ · ∂t (ρα vα ) yield ∂t2 ρα = ∇ · [∇ · (ρα vα ⊗ vα )] − ∇ · (∇ · Tα ) − ∇ · (ρα bα ) − ∇ · mα + ∂t τα . If the constituent is regarded as an inviscid fluid, Tα = − pα 1, it follows ∂t2 ρα = ∇ · [∇ · (ρα vα ⊗ vα )] + pα − ∇ · (ρα bα ) − ∇ · mα + ∂t τα . We let pα = pα (ρα , θα ) and hence

9.7 Diffusion

591

∇ pα = ∂ρα pα ∇ρα + ∂θα pα ∇θα . In the linear approximation, we neglect ∇ · [∇ · (ρα vα ⊗ vα )] and let pα = ∂ρα pα ρα + ∂θα pα θα to obtain ∂t2 ρα = ∂ρα pα ρα + ∂θα pα θα − ∇ · (ρα bα ) − ∇ · mα + ∂t τα . If, further, the temperature is assumed to be uniform, ∇θα = 0, then ∂t2 ρα = ∂ρα pα ρα − ∇ · (ρα bα ) − ∇ · mα + ∂t τα .

(9.92)

Equation (9.92) is the αth equation of a system where ∇ · mα and ∂t τα account for possible coupling terms. Since ρα = ρωα , if ρ is assumed to be constant then we have ∂t2 ωα = ∂ρα pα ωα − ∇ · (ωα bα ) + (1/ρ)[∂t τα − ∇ · mα ].

9.7.2 Fourier-Like and Rate-Type Diffusion Equations The simplest and best known model of diffusion traces back to Fick [170] and is based on a strict analogy with the Fourier model of heat conduction. It is referred to the pertinent concentration (say mass fraction) ωα and is expressed by the differential equation (9.93) ∂t ωα = ∇ · (Dα ∇ωα ) + ζα , where the parameter Dα is the diffusivity and ζα is related to the αth mass growth. Equation (9.93) and some generalizations are derived as follows. Consider Eq. (9.27). If we take Fick’s law (approximation) for hα in the form hα = −κα ∇ωα ,

(9.94)

κα being a constant, we obtain Eq. (9.93) with Dα = κα /ρ and ζα = τα /ρ. If κα , and hence Dα , is not constant then (9.93) has to be generalized as ω˙ α =

1 ∇ · (κα ∇ωα ) + ζα . ρ

Other well-known models stem from the balance equation (9.27) along with an appropriate assumption on the diffusion flux hα . Borrowing from the Ginzburg– Landau scheme [215] we let

592

9 Mixtures

ψα (ωα , ∇ωα ) = f α (ωα ) + 21 λα |∇ωα |2

(9.95)

be the (Helmholtz) free energy of the αth constituent. Hence we take hα = −κα ∇ μˆ α , where μˆ α is the variational derivative of ψα , μˆ α = δωα ψα := ∂ωα ψα − ∇ · ∂∇ωα ψα , then, if λα is constant,

μˆ α = f α − λα ωα .

Substitution of hα in (9.27) results in ω˙ α =

1 λα 2 ∇ · (κα f α ∇ωα ) −  ωα + ζα . ρ ρ

(9.96)

Equation (9.96) is usually referred to as the Cahn–Hilliard equation [75, 76]. It is a fourth-order partial differential equation. If the dependence of μˆ α on ∇ωα is ignored then the Cahn–Hilliard equation reduces formally to the classical secondorder equation (9.93). Instead of applying the balance equation (9.27) we might take the view that the evolution of ωα is in fact a relaxation toward equilibrium governed by a parameter βα > 0 through (9.97) βα ω˙ α = −δωα ψα . If we assume again ψα in the form (9.95) then (9.97) reads βα ω˙ α = −∂ωα ψα + ∇ · ∂∇ωα ψα .

(9.98)

Equation (9.98) is a second-order partial differential equation even though the dependence of ψα on ∇ωα is allowed; it is referred to as Ginzburg–Landau equation [87, 269] or also Allen–Cahn equation [6]. The lower order relative to the Cahn–Hilliard equation (9.96) is interpreted by saying that (9.98) describes the ordering of atoms within unit cells on a lattice whereas (9.96) describes the transport of atoms between unit cells. Equations (9.93), (9.96), and (9.98) are well-renowned models in the literature of diffusion. Their consistency with thermodynamics is investigated in Sect. 2.7. Both Fick’s law (9.94) and the evolution equation (9.97) are approximations to (9.69) and (9.27). In a genuine thermodynamic theory of mixtures, Sect. 9.5, there is no need of approximations to the evolution equation for ωα . This need occurs when we approximate the behaviour of a mixture with properties of the whole body, Sect. 9.10. Diffusion and Boltzmann–Matano method

9.7 Diffusion

593

The homogeneous one-dimensional version of the diffusion equation (9.93) is sometimes investigated through the Boltzmann–Matano method. The mass fraction14 ω is taken to satisfy the homogeneous equation ∂t ω = ∂x (D(ω)∂x ω). √ It is a basic assumption (of the method) that ω depends on x and t via ξ = x/2 t. Hence ξ 1 ∂ x ω = √ ∂ξ ω ∂t ω = − √ ∂ξ ω, 2 t 2 t and then

ξ 1 1 − √ ∂ξ ω = ∂x (D(ω) √ ∂ξ ω) = √ (D(ω)∂ξ ω). 2 t 2 t 2 t

Thus ω satisfies the ordinary differential equation −ξ∂ξ ω = ∂ξ (D(ω)∂ξ ω) for the function ω(ξ). A trivial solution is given by ∂ξ ω = 0, that is, when ω(ξ) is constant. √ This is interpreted by saying that a concentration front propagates with x ∝ t or t ∝ x 2 . √ √ Now, if ξ = x/2 t then 1/2 t = ξ/x and hence we cannot view t as independent of x any longer. Rather we have 1 1 1 ξ ∂x √ = − 2 = − √ x x2 t 2 t and the differential equation can be written as −ξ∂ξ ω = ∂ξ (D(ω)∂ξ ω) −

1 D(ω)∂ξ ω. x

The trivial solution with ∂ξ ω = 0 still holds.

9.7.3 Maxwell–Stefan Diffusion Model The literature shows various equations, under the name of Maxwell–Boltzmann model, which have been developed as approximations to the Boltzmann equation [307, 399]. For definiteness, we take the Maxwell–Stefan (MS) model as that arising from the assumption

14

In this context, the mass fraction is most often denoted by c.

594

9 Mixtures

  Dα Dβ mα = − β Mαβ ( − )∇ ln θ + β Mαβ (vβ − vα ), ρα ρβ where Mαβ =

(9.99)

xα xβ . Dαβ

The quantities Dαβ are usually referred to as MS diffusion coefficients while xα is the αth mole fraction. It is required that Dαβ = Dβα so that Mαβ = Mβα . By the symmetry of Mαβ and the skew symmetry of Dα /ρα − Dβ /ρβ and vβ − vα , relative to α and β, the model satisfies the required constraint on the growths {mα }, 

α mα

  Dα Dβ = − αβ Mαβ ( − )∇ ln θ + αβ Mαβ (vβ − vα ) = 0. ρα ρβ

The mass growths are zero and the constituents are subject to the same temperature. Consistency with thermodynamics is then guaranteed if the separate inequalities 

α (Tα

+ pα 1 + ρα ∇ρα ⊗ ∂∇ρα ψα ) · Dα ≥ 0, 

α (mα

1 · vα + qα · ∇θ) ≤ 0 θ

(9.100)

hold. Only (9.100) is of interest for the MS model. Observe  α,β

Mαβ (vβ − vα ) · uα =

 α,β

Mαβ (uβ − uα ) · uα = − 21



Mαβ (uβ − uα )2 ,

α,β

which is non-positive provided only that Mαβ ≥ 0. Upon substitution of mα we then require that  Dα Dβ 1 − α,β Mαβ ( − )∇ ln θ · uα + qα · ∇θ ≤ 0. ρα ρβ θ α This inequality holds if qα = −κα ∇θ + [



β Mαβ (

Dα Dβ − )]uα . ρα ρβ

The first term is the classical Fourier-type constitutive function, subject to κα ≥ 0. The second one is uncommon in the literature and shows that the (partial) heat conduction qα is induced by the diffusion velocity uα . As a comment we observe that ωα ωβ (vβ − vα ) = ωα ωβ (uβ − uα ) =

ωα ωβ hβ hα 1 ( − ) = (ωα hβ − ωβ hα ). ρ ωβ ωα ρ

9.7 Diffusion

595

That is why the term Mαβ (vβ − vα ) is often written in the form ωα hβ − ωβ hα .

9.7.4 Coupled Effects Among Diffusion and Heat Conduction Fluid mixtures may show coupled effects such as the motion originated by a temperature gradient (Soret effect) or heat conduction affected by a gradient of mass density (Dufour effect). Here we investigate possible constitutive models allowing for coupled effects in mixtures and then generalizing the scheme of Sect. 9.5.2. We consider a non-reacting mixture (τα = 0) with several temperatures and let ρα , θα , Dα , ∇ρα , ∇θα be the set of variables of the functions ψα , ηα , Tα , qα . The interaction terms {τα }, {mα }, {eα } are allowed to depend on all of the variables {ρβ }, {θβ }, {Dβ }, {∇ρβ }, {∇θβ }. The stress Tα is assumed to have the classical form Tα = − pα (ρα , θα )1 + T α where T α → 0 as Dα → 0. Moreover εˆα is independent of {Dβ }. To save writing we let ψα depend only on ρα and θα . Moreover, we will see that the extra-entropy fluxes are zero. We let ψα be differentiable. Substituting ψ` α into the entropy inequality (9.43) we find 

α{

1 1 [−ρα (∂θα ψα + ηα )θ` α + (ρ2α ∂ρα ψα − pα )∇ · vα + τα θα ηα + εˆ α + T α · Lα − qα · ∇θα ]} ≥ 0. θα θα

The arbitrariness of θ`α , ∇ · vα , Wα = skwLα implies that ηα = −∂θα ψα ,

pα = ρ2α ∂ρα ψα ,

T α ∈ Sym.

The reduced inequality 

α{

1 1 [εˆα + T · Dα − qα · ∇θα ]} ≥ 0 θα θα

is investigated in connection with particular constitutive functions. Let T α = 2μα Dα + λα tr Dα 1, mα = pˆ α = εˆα =





β Nαβ (θβ

β [ξαβ (vβ

− θα ) −

qα = −κα ∇θα ,

− vα ) + γαβ ∇θβ ],



β Mαβ (vβ

− vα ) · uα .

A dependence of T α on Wα is not considered because of objectivity. We observe that the constraint

596

9 Mixtures

0=



α mα



=

holds for ξαβ = ξβα and

α,β [ξαβ (vβ



α γαβ

− vα ) + γαβ ∇θβ ]

= 0.

This condition seems unrealistic in that different constituents would have opposite behaviours15 (e.g. γ1β > 0, γ2β < 0). Replacing ∇θβ with ∇θβ − ∇θα , and hence γαβ = γβα , would be of no interest in that of no effect in mixtures with a single temperature. Hence we let γαβ = 0. The constraint 0=



α eα =



α εˆ α + mα · vα =



α,β Nαβ (θβ − θα ) +



α,β (vβ − vα ) · (ξαβ vα − Mαβ uα )

implies that Nαβ = Nβα ,

ξαβ = Mαβ .

The reduced inequality then becomes 

1 

2 2 β Nαβ (θβ − θα ) − Mαβ (uβ − uα ) · uα + 2μα Dα · Dα + λ(tr Dα ) + κα |∇θα | ]} ≥ 0

α{ θ [ α

whence the classical inequalities μα ≥ 0, 2μα + 3λα ≥ 0, κα ≥ 0 follow. Moreover we observe 

1 

α{ θ [ α

β Nαβ (θβ − θα ) − Mαβ (uβ − uα ) · uα ]} =



α 0. For binary mixtures, mα = ξρα ρβ (uβ − uα ) = ξρα hβ − ξρβ hα = −ξρhα . In stationary conditions, we neglect ρα v` α and hence Eq. (9.101) can be written as hα = −

∂θ α p α ∂ρ p α 1 ∇θα − α ∇ρα + fα ξρ ξρ ξρ

(9.102)

thus ascribing to ∂θα pα /ξρ the meaning of thermodiffusion coefficient. In general, the nonzero diffusion flux hα comprises the contribution −

∂θ α p α ∇θα ξρ

which accounts for the Soret effect. By (9.56) the heat flux vector q can be given the form q=



α [qα

− uαT α + (εα + 21 uα2 + ρα ∂ρα ψα )hα ].

Let χ˜ α = εα + 21 uα2 + ρα ∂ρα ψα ; hence χ˜ α is the standard enthalpy density plus the 21 uα2 term. Now, qα = −κα ∇θα , κα > 0 implies qα · ∇θα ≤ 0 and then is consistent with the entropy inequality. Upon substitution of hα we then find the model equation q=



α [−(κα

+

χ˜ α χ˜ α χ˜ α ∂θ pα )∇θα − ∂ρ pα ∇ρα + fα − uαT α ]. ξρ α ξρ α ξρ

The Dufour effect, that is, the effect of concentration (or mass density) gradients on the heat flux, is then represented by  χ˜ α − α ∂ρα pα ∇ρα . ξρ

9.7.5 Evolution Problems A dynamic problem of a mixture is governed by the system (9.24) of partial differential equations in the unknowns ρ1 , ..., ρn , v1 , ..., vn , θ1 , ..., θn .

9.7 Diffusion

599

The stresses {Tα }, the heat fluxes {qα }, the internal energies {εα }, and the supplies {τα }, {pˆ α }, {εˆα } are supposed to be given by the thermodynamically consistent constitutive functions. In particular, if εα = εα (θα , ρα ) then the balance equations ρα ε`α = Tα · Lα − ∇ · qα + ρα rα − mα · vα − τα (εα − 21 vα2 ) + eα ,

(9.103)

where α = 1, ..., n, take the form 2) + e . ρα ∂θα εα θ` α + ρα ∂ρα εα ρ` α = Tα · Lα − ∇ · qα + ρα rα − mα · vα − τα (εα − 21 vα α

(9.104) This makes apparent the structure of (9.24) as a system for {ρα }, {vα }, and {θα }. Moreover, once the fields {ρα }, {vα }, {θα } are found, the diffusion fluxes {hα } are determined by solving (9.69). Very often, however, it is physically sound, or at least convenient, to let the mixture be subject to a single temperature θ. Then the balance equations might seem redundant in view of the n equations of energy involving the single temperature. Now the singletemperature assumption is in fact a constraint, on the a priori distinct temperatures θ1 , ..., θn , which is realized by appropriate exchanges of energy {eα } among the constituents. Hence, we can regard the balance equations (9.103) as a system of n equations in the n unknowns θ, e1 , ..., en−1 ,  the growth en being given by the a priori constraint α eα = 0. This view overcomes naturally the seeming closure problem connected to n balance equations in the single unknown θ. Indeed, if εα = εα (θ, ρα ) then (9.103) reads 2)+e . ρα ∂θ εα (∂t θ + vα · ∇θ) + ρα ∂θ εα ρ` α = Tα · Lα − ∇ · qα + ρα rα − mα · vα − τα (εα − 21 vα α

Summation over α and account of the constraint 

α [ρα ∂θ εα (∂t θ + vα · ∇θ) + ρα ∂θ εα ρ` α ] =





α eα

= 0 imply

1 2 α [Tα · Lα − ∇ · qα + ρα rα − mα · vα − τα (εα − 2 vα )],

which can be viewed as the equation associated with the unknown temperature θ. Once θ is determined, equations 2) eα = ρα ∂θ εα (∂t θ + vα · ∇θ) + ρα ∂θ εα ρ` α − Tα · Lα + ∇ · qα − ρα rα + mα · vα + τα (εα − 21 vα

determine the required energy growths that guarantee the common value of the temperature fields for the n constituents.

600

9 Mixtures

If the diffusion velocities are disregarded, in that ˙ ∂t θ + vα · ∇θ  ∂t θ + v · ∇θ = θ, then we have 

˙ α ρα ∂θ εα θ =



1 2 α [−ρα ∂θ εα ρ` α + Tα · Lα − ∇ · qα + ρα rα − mα · vα − τα (εα − 2 vα )],

thus showing that



α ρα ∂θ εα

= ∂θ



α ρα εα

is the equivalent (approximate) heat capacity of the mixture. Sometimes a mean temperature is considered for mixtures with multiple temperatures. Going back to (9.104) and following the approximation θ`α  θ˙α , we can write 

˙ α ρα ∂θ εα θα =



1 2 α [−ρα ∂θ εα ρ` α + Tα · Lα − ∇ · qα + ρα rα − mα · vα − τα (εα − 2 vα )].

(9.105)

Letting θ be defined by  1 ˙ θ˙ =  α ρα ∂θ εα θα β ρβ ∂θ εβ we can write (9.105) as 

α ρα ∂θ εα

θ˙ =



`α α [−ρα ∂θ εα ρ

+ Tα · Lα − ∇ · qα + ρα rα − mα · vα − τα (εα − 21 vα2 )]

and regard θ so defined as a convenient definition of mean temperature.

9.8 Solid Mixtures The balance equations derived in Sect. 9.2 hold for any mixture irrespective of the nature of the constituents except for those with internal structure (e.g. micropolar continua). Yet a remark is in order about the mass density α in the reference configuration Rα . By (9.35) we conclude that in non-reacting mixtures ρα =

α ; Jα

9.8 Solid Mixtures

601

therefore, ρα is determined by Jα , and hence by Fα , because α is a constant (at any Xα ). Instead, in reacting mixtures α is time dependent and hence ρα is dependent Jα . Equations (9.35) and (9.36) motivate the insertion of the reference mass density

α (or of the mass density ρα ) among the set of independent variables for the αth constituent. In single bodies, we have

= ρJ,

˙ = 0,

so that is a constant (in time) possibly dependent on the point X in the reference configuration. Hence, at any point X, the knowledge of J = det F produces ρ to within the constant . We then regard ρ as determined by F. For a constituent, instead, ρα is not determined by Jα in that, by (9.35), α depends on t because of the chemical reaction. Consequently, Jα and α , or Jα and ρα are independent of each other. For definiteness, we select α and Fα as independent variables.18 To our mind this choice makes it more direct the passage from reacting to non-reacting mixtures.

9.8.1 Constitutive Assumptions and Thermodynamic Restrictions Again we follow the principle of phase separation. For any constituent α, we consider as independent variables the temperature θα , the reference mass density α , and the deformation gradient Fα , along with their gradients in the reference configuration. Hence we let the free energy ψα , the entropy ηα , the stress Tα , the heat flux qα , and the extra-entropy flux kα be functions of the set of variables α := (θα , α , Fα , ∇Xα θα , ∇Xα α , ∇Xα Fα ). The interaction terms τα , pˆ α , εˆα are functions of 1 , 2 , ..., n , u1 , u2 , ..., un , vα − v1 , vα − v2 , ..., vα − vn . Moreover, the functions {ψα } are differentiable while the remaining functions are continuous. By Sect. 9.3.2, the second law requires that ψα be independent of ∇Xα θα and moreover ηα = −∂θα ψα . Hence inequality (9.43) becomes  1 [−ρα ∂ α ψα ` α − ρα ∂Fα ψα · F` α − ρα ∂∇Xα α ψα · ∇Xα ` α − ρα ∂∇Xα Fα ψα · ∇Xα F` α α θα 1 +τα θα ηα + Tα · Lα − qα · ∇θα + εˆ α + θα ∇ · kα ] ≥ 0, θα 18

This approach is developed in [327].

602

9 Mixtures

where use is made of the commutation property (9.7). In view of (9.1) and (9.36), we can write  1 [−ρα ∂ α ψα Jα τα − ρα ∂Fα ψα · (Lα Fα ) − ρα ∂∇Xα α ψα · ∇Xα (Jα τα ) + τα θα ηα α θα 1 −ρα ∂∇Xα Fα ψα · (∇Xα (Lα Fα )) + Tα · Lα − qα · ∇θα + εˆ α + θα ∇ · kα ] ≥ 0. θα

(9.106) The field ∇Xα (Lα Fα ), at any point x and time t, can take arbitrary third-order tensor values and meanwhile the balance equations hold. This is so because εα = ψα − θα ∂θα ψα = εα (θα , α , ∇Xα α , ∇Xα Fα ) and ∇Xα (Lα Fα ) enters the balance of energy, ρα ∂∇Xα Fα εα · ∇Xα (Lα Fα ) + ... = Tα · Lα − ∇qα + ρα rα + εˆα , the dots indicating terms independent of ∇Xα (Lα Fα ). The conceptual arbitrariness of rα allows ∇Xα (Lα Fα ) to be chosen arbitrarily while the balance of energy holds. Hence inequality (9.106) holds only if ∂∇Xα Fα ψα = 0 and hence ψα = ψα (θα , α , Fα , ∇Xα α ). The linear dependence of (9.106) on Lα allows us to conclude that Tα is given by Tα = ρα ∂Fα ψα FαT .

(9.107)

Inequality (9.106) then simplifies to  1 [−ρα ∂ α ψα Jα τα − ρα ∂∇Xα α ψα · ∇Xα (Jα τα ) α θα 1 +τα θα ηα − qα · ∇θα + εˆα + θα ∇ · kα ] ≥ 0. θα

(9.108)

We now establish sufficient conditions for (9.108) to hold. By (9.4), upon some rearrangements we have 1 1 ρα ∂∇Xα α ψα · ∇Xα (Jα τα ) = ρα ∂∇Xα α ψα FαT ∇(Jα τα ) θα θα

α τα ρα =∇ ·( Fα ∂∇Xα α ψα ) − Jα τα ∇ · ( Fα ∂∇Xα α ψα ). θα θα

9.8 Solid Mixtures

603

Consequently we take kα =

α τα Fα ∂∇Xα α ψα . θα

Let μα = ψα + α ∂ α ψα − θα Jα ∇ · (

ρα Fα ∂∇Xα α ψα ) θα

(9.109)

and recall that εˆα = eα − mα · vα + 21 τα vα2 − τα εα . We can then write inequality (9.108) in the form  1 1 [−μα τα + eα − mα · vα + 21 τα vα2 − qα · ∇θα ] ≥ 0. α θα θα

(9.110)

Though cross-coupling effects may occur, it is likely that no significant case is lost by letting separate inequalities hold in the form  μα τα ≤ 0, α θα

 1 [eα − mα · vα + 21 τα vα2 ] ≥ 0, α θα



1 ≥ 0. θα (9.111) In the particular case where the constituents are subject to the same temperature θ these relations become 

α μα τα

≤ 0,



ˆα α [p

· uα + 21 τα uα2 ] ≤ 0,



(

α qα )

α qα

·∇

· ∇θ ≤ 0.

(9.112)

Some comments are in order. First,

α ∂ α ψα is the analogue of pα /ρα in fluids, with α instead of ρα . Accordingly, μα as defined by (9.109) can be viewed as the chemical potential,as for fluids, with a correction due to the dependence of ψα on ∇Xα α . Consistently, α τα μα /θα governs the evolution of chemical reactions where the constituents have different temperatures. The constitutive equation for the stress Tα is subject to the principle of objectivity which asserts that the constitutive equations be independent of the observer.19 We apply to any constituents the argument developed for a single body (Sect. 3.2). Under a change of observer, such that χα (Xα , t)

−→

Q χα (Xα , t)

the free energy ψα , as well as θα and ηα , is invariant while the deformation gradient is subject to Fα −→ Q Fα ,

19

See Sect. 3.2 and [428], Sects. 17–19.

604

9 Mixtures

for any α = 1, ..., n. Hence objectivity requires that ψα (θα , α , Fα , ∇Xα α ) = ψα (θα , α , Q Fα , ∇Xα α ). By the polar decomposition theorem, we can write Fα = Rα Uα , where Rα is the rotation of Fα and Uα is the stretch. Hence choose Q = RαT and observe RαT Fα = RαT Rα Uα to obtain ψα (θα , α , Fα , ∇Xα α ) = ψα (θα , α , Uα , ∇Xα α ). Alternatively, we might take ψα as a function of the αth right Cauchy–Green strain tensor Cα = Uα2 = FαT Fα . Since ∂Fα ψα (Cα ) = 2Fα ∂Cα ψα (Cα ), by (9.107) it follows that Tα = 2Fα ∂Cα ψα FαT . Hence, the stress in elastic constituents is symmetric. Remark 9.2 By analogy with the observations in Sect. 5.5, we claim that the dependence of ψα on ∇Xα θα and ∇Xα Fα is admissible provided kα , as well as the other constitutive functions, depend on θ`α and F` α . In that case, we would find that  ρα   θα ∂∇ F ψα FαT , Tα = ρα ∂Fα ψα − ∇Xα · ρα θ α Xα α Now that we have established the constitutive equations in the Lagrangian description, it is worth looking at corresponding approximate equations in the Eulerian description. We have

α = ρα Jα and Fα  ∂xα (xα + uα ) = 1 + ∂xα uα ,

Jα  1 + ∇ · uα ,

Hence, letting Hα  ∂xα uα ,

Eα  21 (Hα + HαT ),

α  ρα (1 + ∇ · uα ).

9.9 Immiscible Mixtures

605

we have Cα = 1 + HαT + Hα + HαT Hα  1 + 2 Eα , the approximation being reasonable inasmuch as |∂xα uα |  1. Since ∂Eα ψα = 2 ∂Cα ψα , in the linear approximation we have ψα = ψα (θα , ρα , Eα ) and Tα = ∂Eα ψα .

9.9 Immiscible Mixtures An immiscible mixture is one in which the individual constituents of the mixture remain physically separate on some scale. This local separation of the constituents can be incorporated into the construction of a theory by means of appropriate constitutive assumptions. In [139], the immiscibility postulate is considered so that the free energy of a given constituent is assumed to depend only upon constitutive variables associated with that constituent. Here this view is adopted for the constitutive functions that are not related to interaction terms (τα , mα , eα ) irrespective of the immiscibility property. To our mind, at the scale of interest, any region of the body contains separate parts of the constituents. The (second) mathematical property of immiscible mixtures, modelling the local microstructure, is that the constitutive equations include a dependence on the volume fraction [62]. Now there is a decisive difference between the model of incompressible constituents and that of compressible ones. Let the constituents be incompressible, which means that the true density ρ0α of any constituent is constant. By

α = ρα Jα = ρ0α ϕα Jα it follows that

α = ρ0α Jα ϕα

whence two variables among α , Jα , ϕα are independent. This means that for solid constituents we may use Fα , ϕα as independent variables. Moreover, because

606

9 Mixtures

J`α = Jα ∇ · vα ,

ρ`α = −ρα ∇ · vα + τα

then

`α = Jα τα .

(9.113)

Equation (9.113) shows that α is constant for non-reacting constituents and is a possible independent variable, in place of ϕα , for reacting constituents. The continuity equation for ρα = ϕα ρ0α results in ϕ` α = −ϕα ∇ · vα +

1 τα . ρ0α

(9.114)

Equation (9.114) is the evolution equation for ϕα and shows how ϕα is affected by the motion (via ∇ · vα ) and the chemical reaction (via τα ). Hence, for incompressible constituents, the evolution equation for ϕα is known and is equivalent to the continuity equation for the mass density ρα . Things are no longer easy for compressible constituents. Though we can write again ρα = ϕα ρ0α , the peculiar time derivative and use of the continuity equation yields ϕ` α = −ϕα ∇ · vα +

ρ`0 1 τα − ϕα α0 . 0 ρα ρα

(9.115)

Equation (9.115) is not operative in that we cannot determine ρ`0α . We might take the approximation that ϕα ρ`0α /ρ0α is negligibly small. In such a case (9.115) reduces to (9.114), the evolution equation for the volume fraction of incompressible constituents. In [62], immiscible compressible constituents are modelled by letting ϕ` α be given by a constitutive equation, ϕ` α = ϕˆ α (...), the dots representing the independent variables. If the free energy is taken in the form ψα = ψα (ϕα , ...) then, depending on the variables represented by the dots, the entropy inequality may lead to the reduced inequality 

α ρα ∂ϕα ψα

Hence ϕˆ α is required to be of the form

ϕˆ α ≤ 0.

9.10 Entropy Inequality and Models for the Whole Mixture

607

 ϕˆ α = − β αβ ρβ ∂ϕβ ψβ , αβ being a positive semi-definite tensor. This in turn indicates that the function ϕˆ α is related to the derivatives ∂ϕβ ψβ . We might then obtain ϕˆ α by an appropriate assumption on ∂ϕβ ψβ . However, the correct equation for ϕα is (9.115), which is of dynamic character and any constitutive equation may be viewed as an approximation for ϕ` α as, e.g. (9.114).

9.10 Entropy Inequality and Models for the Whole Mixture

Though the balance equations of a mixture are required to be the same as those for a single body [427], practical applications to constitutive models may show significant differences. It is worth investigating how different constitutive restrictions follow from different ways of accounting for the underlying structure (mixture) of the whole body.

9.10.1 Gibbs Equations and Restrictions on the Diffusion Flux Classical results follow from the assumption of the so-called Gibbs equation [61] in the form (9.85),  p θη˙ = ε˙I − 2 ρ˙ − α μα ω˙ α , ρ  where p = α pα is the pressure on the mixture and μα is the αth chemical potential. This equation amounts to assuming that η is a function of ε I , ρ, and {ωα } with the obvious relation for the partial derivatives. As shown in Sect. 9.5.4, the Gibbs equations follow from the relations ηα = −∂θ ψα ,

pα = ρ2α ∂ρα ψα ,

which are obtained as a consequence of the second-law inequality. Sometimes20 the internal energy ε is considered instead of the inner part ε I seemingly for technical convenience. By (9.54), the two energies are related by ε = εI +

2 1 α ωα uα , 2

the difference being due to a diffusion term. Hence, 20

See, e.g. [118].

608

9 Mixtures

ρη˙ =

 μα ρ ρ p ε˙ − (ωα uα2 )˙− ρ˙ − α ρω˙ α . θ 2θ α ρθ θ

(9.116)

We now show how entropy inequalities given in the literature follow from the Gibbs equation. As we see in a moment, a procedure is developed where the properties of the mixture as a whole (T, q, ε, j) are connected to properties of the constituents (ωα , hα , μα ). As to hα we observe that (9.69) might be regarded as a balance equation which expresses the derivative h˙ α in terms of hα and Tα , T, uα , bα , L, mα , τα v. Irrespective of specific constitutive assumptions we now examine the entropy inequality and look for sufficient conditions. Entropy inequalities follow from (9.59), ρη˙ + ∇ · j − ρs ≥ 0, by replacing ρη˙ from (9.116). Substitution of ρε˙ and ρω˙ α from (9.55) and (9.27) allows (9.59) to be written in the form  μα  μα 1 ρ 1 2 (T + p1) · D − 21 ∇ · hα − α τα − ∇ · q α (ωα uα )˙+ α θ θ θ θ θ ρr I 1 + + bα · hα + ∇ · j − ρs ≥ 0. θ θ α Upon some rearrangements, we obtain 1 ρ 1  rI 2 (T + p1) · D − 21 − s) α (ωα uα )˙+ ∇ · [j + ( α μα hα − q)] + ρ( θ θ θ θ 1  1 μα 1 +q · ∇ − α hα · (∇ − bα ) − μα τα ≥ 0. θ θ θ θ α (9.117) We first ignore



2 α (ωα uα )˙.

The entropy flux j is taken in the form j=

If we let ρs = (ρr I +



α bα

 μα 1 q − α hα . θ θ

· hα )/θ then the thermodynamic requirement is

(T + p1) · D + q · ∇

1 1  μα − α hα · ∇ − μα τα ≥ 0. θ θ θ α

(9.118)

Depending on the constitutive assumptions, inequality (9.118) allows for crosscoupling terms. Yet, sufficient conditions for the validity of (9.118) are given by the non-negative valuedness of each term. So (T + p1) · D ≥ 0 allows for viscosity effects whereas q · ∇(1/θ) ≥ 0 models heat conduction. Moreover, α μα τα ≥ 0 governs the evolution of the constituents in chemical reactions. The remaining inequality μα hα · ∇ ≤0 θ shows the requirement on the diffusion flux hα .

9.10 Entropy Inequality and Models for the Whole Mixture

609

If instead ρs = ρr I /θ then the requirement is hα · (∇

μα 1 − bα ) ≤ 0. θ θ

In both cases, the approach indicates for hα a Fick-like constitutive dependence hα = −κα (∇

μα 1 − bα ) θ θ

hα = −κα ∇

or

μα , θ

κα ≥ 0.

(9.119)

More involved relations follow if the term (ωα uα2 )˙ is taken into account. We observe that Eq. (9.119) commonly occurs in the literature though they are slightly contradictory in that hα is given by the differential equation (9.69).

9.10.2 Gibbs–Duhem Equation Restrict attention to homogeneous mixtures. Let n α be the number of moles of the αth constituent and let G, S, V, p be the Gibbs free energy, the entropy, the volume, and the pressure of the whole mixture. In classical thermodynamics, the chemical potential of the αth constituent is defined as the αth molal Gibbs function. To avoid formal contradictions denote by μα the chemical potential so defined. Let G(η, p, {n α }) be the Gibbs free energy function and define μα as μα = ∂n α G. This does not imply that (see [316], Chap. 11) G=



α n α μα

(9.120)

but this is so if μα is independent of the mole numbers {n α }. Now, G˙ = ∂θ G θ˙ + ∂ p G p˙ + and, in view of (5.15),

G˙ = −S θ˙ + V p˙ +



˙α α μα n



˙ α. α μα n

(9.121)

As a consequence of (9.120), we have G˙ = Hence we find that





˙ +

α n α μα



˙ α. α μα n

˙ = −S θ˙ + V p. ˙

α n α μα

(9.122)

610

9 Mixtures

Equation (9.122) is usually referred to as Gibbs–Duhem equation. It determines the evolution of the chemical potentials {μα } in terms of the time dependence of θ and p. Two remarks are in order about the Gibbs–Duhem equation. First, Eq. (9.122) is based on the assumption (9.120), which amounts to regarding μα as a function of θ and p, independent of the mole numbers. This assumption is motivated in [169] for dilute solutions, which means that the derivatives ∂n α G are evaluated at n α = 0. In general, we have to regard μα as dependent on the mole numbers and hence the function G in (9.120) is nonlinear with respect to the n α ’s. Yet, (9.122) looks as a reasonable approximation in that we can write V =



αnαvα,

S=



α nα ηα ,

where the molal volume v α and the molal entropy η α are regarded as functions of θ and p. Hence all terms in (9.122) are linear in the mole numbers n α . Second, divide (9.121) by the mass M of the mixture. Hence we have 1 1 μ n˙ α . φ˙ = −η θ˙ + p˙ + ρ M α α Now,

n α Mα ωα =  , β n β Mβ

where Mα is the αth mole mass. Upon differentiation,  n˙ α Mα n α Mα −  n˙ β Mβ . ω˙ α =  2 ( β n β Mβ ) β n β Mβ β

This proves the inequivalence of (9.87) and (9.121).

Chapter 10

Micropolar Media

There are materials with internal structure (e.g. composites, polymers, liquid crystals, soil, and bone) for which a reasonable model should view the points no longer as purely geometric in character but as rigid-body particles or even deformable continua. Models accounting for rigid-body particles are within the theories of micropolar media; those allowing for the deformation of particles are regarded as micromorphic media. The deformation of the micropolar continuum is described by ascribing, to each particle of the continuum, the position vector, and three orthonormal vectors (called directors) that describe the translations and the orientation changes of the particles. The balance laws of mass, linear momentum, angular momentum, and energy are derived by allowing for an orientational momentum, a body couple density, and a surface couple density. Next some constitutive models are developed, in particular some thermoviscous micropolar media. As a relevant topic, the micropolar model is applied to the description of liquid crystals, mainly the nematics. Mixtures of micropolar constituents are considered; the second law and the growth terms show interesting modelling subjects. As an application of the mixture theory, the model for nanofluids is established where a (solute) constituent has a natural microplar character.

10.1 Kinematics of Micropolar Media For continua with significant microstructure contribution the classical Cauchy’s theory of continuum mechanics is inadequate. At the end of the 19th century a generalization of the continuum model was set up by considering the rotational degrees of freedom of material particles. This material is named micropolar continuum; it is also named Cosserat continuum since the leading ideas are given in a monograph © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_10

611

612

10 Micropolar Media

of the brothers Eugéne and François Cosserat. After the book by E. and F. Cosserat [107], linear and nonlinear theories have been developed; [149, 212, 222, 313, 421] and the book [151] show various steps of the theory and a general presentation of the subject. A micropolar continuum is a continuous distribution of oriented material points (or particles). A material point is characterized by the position and the space orientation. As with any continuum, a point is identified by the position vector X in a chosen reference configuration R. The motion of a point is characterized by the position vector x, relative to a fixed origin, and three vectors ki , i = 1, 2, 3, rigidly attached to the point, as functions of time, x = χ(X, t),

ki = kˆ i (X, t),

X ∈ R, t ∈ R.

To avoid inessential difficulties we let the triad {ki } satisfy ki · k j = δi j

(10.1)

and hence the triad {ki } is an orthonormal corotational basis. The function χ is assumed to be invertible at any time t ∈ R. Moreover, χ and kˆ i are assumed to be twice continuously differentiable. A superposed dot denotes time differentiation holding X fixed; hence as in classical kinematics we say that χ ˙ is the velocity and χ ¨ is the acceleration. Let Ri H (X, t) = kˆ i (X, t) · kˆ H (X, 0) be the (passive) rotation matrix. For formal simplicity, let ki stand for kˆ i (X, t) and κ H for kˆ H (X, 0). Since (ki · κ H )(k j · κ H ) = ki · k j = δi j and (ki · κ H )(ki · κ J ) = κ H · κ J = δ H J then the matrix R is orthogonal, R R T = R T R = 1l. Hence R is invertible and R −1 = R T . Consequently, R˙ R T + R R˙ T = 0. Letting

we have

Oi j := R˙ i H R j H R˙ i H = Oi j R j H ,

and hence O is skew,

Oi j = R˙ i H R j H = −Ri H R˙ j H = −O ji O = −O T .

10.1 Kinematics of Micropolar Media

613

The matrix O is often referred to as gyration matrix. The definition of R allows us to write ki = Ri H κ H ,

κ H = Ri H ki .

We can view {κ H } → {ki } as a change of basis. The corresponding change of vector components {U H } → {u i } of a vector u is given by u i = u · ki = u · Ri H κ H = Ri H u · κ H ,

u i = Ri H U H .

Thus vector components and basis vectors obey the same transformation law. Time differentiation of ki , k˙ i = R˙ i H κ H , substitution for κ H , and the definition of O result in k˙ i = R˙ i H Rh H kh = Oi h kh . To each skew matrix we can associate a vector and vice versa. The axial vector, say1 ν, associated with O is defined by νi = − 21 i hk Ohk , and hence Ohk = −hki νi . In the literature, O is called the gyration tensor and ν is called the intrinsic angular velocity ([151], part I). Yet, in view of the results of Sect. 1.4, the angular velocity of the triad {ki }, and hence of the particle, is ω = −ν, namely Ohk = hki ωi . ωi = 21 i hk Ohk , By paralleling the procedure of Sect. 1.4, this definition of the angular velocity allows us to find the Poisson relation k˙ p = ω × k p ,

p = 1, 2, 3,

and hence the classical relation for any vector function u,  du  dt

space

=

 du  dt

body

+ ω × u,

which reduces to the Poisson relation if u is any vector constant with respect to the body (particle). The derivative (du/dt)space is usually denoted by the superposed dot. If u is a vector along a direction rigidly attached to the point, and |u| is constant, then (du/dt)body = 0 and u changes in time according to 1

The symbol ν for the axial vector in micropolar theories is customary in the literature.

614

10 Micropolar Media

u˙ = ω × u. The transformation {κ H } → {ki } can be described by the active rotation tensor R such that H = 1, 2, 3. RκH = kH , Then inner multiplying by κ J we find κJ · RκH = kH · κJ ,

RJ H = RH J ,

R = RT ,

R J H being the components in the fixed basis {κ H }. Under a change of frame (1.54) the vectors, k H say, are subject to Rκ H . k H = R κ H → k∗H = Qk H = QR Hence k∗H = R ∗ κ H implies that

R. R ∗ = QR

To compute the time derivative of R we consider the identity ˙ JH = R ˙ J L RK L RK H = A J K RK H , R whence

˙ = AR, R

A = −AT .

(10.2)

Moreover, by (1.21), ω = eL 21 L H J A J H . In connection with the objectivity principle, invariant strain measures are essential in the description of the constitutive functions. In this sense we consider the wryness tensor  defined by  K L = 21  K M N (∂ X L Rk M )Rk N , R. Hence in the fixed basis {ei }. Under a change of reference R → QR T Q k j R j N = K L .  K∗ L = 21  K M N Q kh (∂ X L Rh M Q k j R j N = 21  K M N (∂ X L Rh M )Q hk

˙ The invariance of  implies the invariance, and hence the objectivity, of . For later use it is worth computing the time derivative of the wryness tensor. Since ˙ k M )Rk N + (∂ X L Rk M )R ˙ k N )] ˙ K L = 21  K M N [(∂ X L R ˙ = A R we obtain and R

10.1 Kinematics of Micropolar Media

615

˙ k M Rk N + (∂ X L Rk M )R ˙ kN ] ˙ K L = 21  K M N [(∂ X L R = 21  K M N [(∂ X L Ak j )R j M Rk N + Ak j (∂ X L Rk M )Rk N + Ak j (∂ X L Rk M )R j N ]. Now, Ak j (∂ X L R j M )Rk N + Ak j (∂ X L Rk M )R j N = Ak j (∂ X L R j M )Rk N + A jk (∂ X L R j M )Rk N = Ak j [(∂ X L R j M )Rk N − (∂ X L R j M )Rk N ] = 0. As to the first term of ˙ K L we have  K M N R j M Rk N = i jk Ri K and hence 1  (∂ X L Ak j )R j M Rk N 2 K MN

Consequently we find

= Ri K ∂ X L ωi .

˙ K L = Ri K ∂ X L ωi .

(10.3)

Consider the strain measure named Cosserat deformation tensor C and defined by C K L = Rh K Fh L . Under a change of frame C is invariant, C∗K L = R∗h K Fh∗L = Q h j R j K Q hi Fi L = R j K Q iTh Q h j Fi K = C K L . ˙ = AR we find The time derivative is then invariant too. Since F˙ = LF and R C˙ K L = A j h Rh K F j L Rh K L h j F j L = R j K (Ah j + W j h + D j h )Fh L . Since A j h = − j hl ωl , we obtain

W j h = − j hl wl , w = 21 ∇ × v

C˙ K L = R j K D j h Fh L + h jl R j K Fh L (wl − ωl ).

(10.4)

It is worth remarking that while the angular velocity ω is non-objective both the difference w − ω (see 1.71) and the gradient ∇ω are objective. As to ∇ω, since ω ∗ = Qω + a it follows that

(∇ω)∗ = ∇ ∗ ω ∗ = (Q∇)(Qω + a ) = Q∇ωQT .

616

10 Micropolar Media

Hence ∇ω is an objective tensor. The referential gradient ∇R ω satisfies (∇R ω)∗ = ∇R ω ∗ = Q∇R ω

and hence is an objective vector.

10.1.1 Orientational Momentum The rigid-body nature of the particles allows us to ascribe to each particle a microinertia tensor density I, per unit mass, such that the orientational momentum, or intrinsic angular momentum, per unit mass σ is given by σ = I ω. In the corotational basis {k j }, I is represented by a constant symmetric matrix with entries Ii j , I = ki Ii j k j . Ii j = ki · Ik j , Hence, σ = Ii j ω j ki . If m is the mass of the particle then mI is the inertia tensor of the particle. The angular momentum, per unit mass, is then defined as r×v + σ and the kinetic energy as 1 2 (v 2

+ ω · σ).

For later convenience we evaluate the time derivative of σ and ω · σ. Relative to the corotational basis, since k˙ i = ω × ki it follows σ˙ = (Ii j ω j ki )˙ = Ii j ω˙ j ki + Ii j ω j ω × ki . Further, ω ˙ = ω˙ p k p + ω p k˙ p = ω˙ p k p + ω p ω × k p = ω˙ p k p + ω × ω p k p = ω˙ p k p and Iω ˙ = (ki Ii j k j )(ω˙ p k p ) = ki Ii j ω˙ j . Hence σ˙ = I ω ˙ + ω × I ω.

(10.5)

10.1 Kinematics of Micropolar Media

617

Thus it follows (ω · σ)˙ = ω ˙ · σ + ω · σ˙ = ω ˙ · Iω + ω · Iω. ˙ Owing to the symmetry of I, we have u · Iv = v · Iu for every pair of vectors u, v, and hence (ω · σ)˙ = 2ω · σ. ˙

(10.6)

It is worth pointing out that the micro-inertia matrix, with entries Ii j , relative to a generic (non-corotational) basis {ei }, satisfies a property usually referred to as conservation of micro-inertia [148], I˙j h − O jk Ihk − Ohk I jk = 0.

(10.7)

To show this, we start with the law of transformation of the components of I under a change of orthonormal triad, {κK } → {ei }, Ri K = ei · κ K . We have e j · Ieh = R j P Rh Q κ P · Iκ Q and hence we can write I j h = R j P Rh Q IˆP Q ,

IˆP Q := κ P · Iκ Q ,

as it happens for the components of any second-order tensor. Now, upon time differentiation we have I˙j h = R˙ j P Rh Q IˆP Q + R j P R˙ h Q IˆP Q . Since Ri P Rh Q IˆP Q = Ii h , R j P Rl Q IˆP Q = I jl , use of the relations R˙ j P Ri P = O ji ,

R˙ h Q Rl Q = Ohl ,

allows us to write I˙j h = O ji Ihi + Ohl I jl ,

h, j = 1, 2, 3.

Hence, in general, the rate Eq. (10.7) for the matrix I holds. If, however, the triad {ei } is a corotational basis then the entries Ii j are constants and (10.7) reduces to O ji Ihi + Ohl I jl = 0,

h, j = 1, 2, 3.

618

10 Micropolar Media

10.2 Balance Laws We let Pt be the spatial region that convects with the body so that Pt = χ(P, t) for some region P in the reference configuration. Reynold’s transport theorem holds and is applied in the analysis of balance laws.

10.2.1 Balance Laws in the Spatial Description The pertinent fields are described by functions of x, t, with x ∈ Pt ⊆  = χ(R, t).

Balance of Mass The mass density ρ is required to obey the continuity equation ∂t ρ + ∇ · (ρv) = 0.

Balance of Linear Momentum Let b be the body force, per unit mass, and t the force per unit area. The balance of linear momentum is expressed by saying that d ∫ ρ v dv = ∫ ρ b dv + ∫ t da. dt Pt Pt ∂Pt Cauchy’s theorem proves the existence of the stress tensor T, such that t(x, t, n) = T(x, t)n. The arbitrariness of the region Pt implies the validity of the local form ρ˙v = ∇ · T + ρ b.

(10.8)

Balance of Angular Momentum The internal structure of a micropolar body is made explicit by the orientational momentum (or intrinsic angular momentum) σ, per unit mass. Let r be the position vector of a point relative to a fixed origin. So, in general, r = x − d, d being a constant vector. We allow for contribution to couple terms by letting the occurrence of a body couple density l, per unit mass, and a surface couple density s, per unit area. Hence we state the balance of angular momentum in the form

10.2 Balance Laws

619

d ∫ ρ(r × v + σ)dv = ∫ ρ(r × b + l)dv + ∫ (r × t + s)da. dt Pt Pt ∂Pt Since t = Tn then

∫ r × t da = ∫ (ϒ + r × ∇ · T)dv Pt

∂Pt

where ϒ is the vector given by ϒi = i jk Tk j . Hence we have ∫ [ρ(r × v˙ + σ˙ − r × b − l) − ϒ − r × ∇ · T]dv = ∫ s da.

Pt

∂Pt

In view of the equation of motion (10.8) it follows ∫ [ρ(σ˙ − l) − ϒ]dv = ∫ s da.

Pt

∂Pt

The assumed boundedness of ρ(σ˙ − l) − ϒ allows us to apply Cauchy’s theorem and prove that there exists a tensor field S(x, t) such that s(x, t, n) = S(x, t)n. The tensor S is named couple stress tensor. Substitution for s and use of the divergence theorem eventually results in ρ σ˙ = ϒ + ρl + ∇ · S.

(10.9)

By analogy with rigid body dynamics we consider the inertia operator I of the particle so that I ω is the intrinsic angular momentum of the particle. If m is the mass of the particle then 1 ρ σ = N I ω, σ = I ω, m N = ρ/m being the number density of particles. Let I=

1 I; m

I is then the density of inertia tensor, per unit mass. Hence ρI =

ρ I = NI, m

σ = I ω.

By (10.5) the balance Eq. (10.9) can be written in the form ρI ω ˙ + ω × ρI ω = ϒ + ρl + ∇ · S.

(10.10)

620

10 Micropolar Media

Balance of Energy The micropolar character suggests that the kinetic energy consists of a contribution due to the intrinsic rotation, in addition to the kinetic energy associated with the macroscopic velocity field. Borrowing from the kinematics of rigid body motion we let ρ 21 (v2 + ω · σ) be the kinetic energy and observe that ω · σ = ω · I ω. We assume that l and s exert a power ρl · ω per unit volume and s · ω per unit area. In addition, an (internal) energy density ε, an energy supply r , per unit mass, and an energy flux h of non-mechanical character are allowed to occur. Hence we express the balance of energy in the form d ∫ ρ[ε + 21 (v2 + ω · σ)]dv = ∫ ρ(b · v + l · ω + r )dv + ∫ (t · v + s · ω + h)da. dt Pt Pt ∂ Pt

First we replace t with Tn and s with Sn. Since ∇ · (vT) = T · L + v · (∇ · T),

∇ · (ωS) = S · ∇ω + ω · (∇ · S),

where S · ∇ω = Si j ∂x j ωi , the divergence theorem and use of (10.6), (10.8), and (10.9) imply that ∫ [ρ ε˙ − ρr − T · L + ω · ϒ − S · ∇ω]dv = ∫ h da.

Pt

∂Pt

Again we apply Cauchy’s theorem and find that the scalar field h(x, t, n) depends linearly on n. Let h(x, t, n) = −q(x, t) · n. Substitution and use of the divergence theorem show that ∫ [ρ ε˙ − ρr − T · L + ω · ϒ − S · ∇ω + ∇ · q]dv = 0.

Pt

Hence the arbitrariness of Pt allows us to conclude that the balance of energy implies the local balance equation2 ρ ε˙ = T · L − ω · ϒ + S · ∇ω − ∇ · q + ρr. Since ϒi = ik j T jkskw ,

2

T jkskw = 21 k j p ϒ p ,

There are many different equations in the literature. Equation (10.11) coincides, e.g., with (2.2.47) of [149] once we identify the present T and q with TT and −q of [149].

10.2 Balance Laws

621

then T · W − ϒ · ω = ϒ · (w − ω),

w := 21 ∇ × v;

w is the axial vector of W and is viewed as the angular velocity of the macroscopic motion. Consequently, ϒ · (w − ω) is the power associated with the skew stress Tskw and w − ω is the relative angular velocity (among the fluid and the particle). Thus the balance of energy can be written in the form ρ ε˙ = T · D + ϒ · (w − ω) + S · ∇ω − ∇ · q + ρr.

(10.11)

Remark 10.1 Though w and ω are not objective, the difference w − ω and hence the power ϒ · (w − ω) are objective. Conceptually non-objective powers might occur in that the balance equations hold in inertial frames. However, the objectivity of w − ω allows us to consider w − ω among the independent variables of the constitutive functions.

Second Law of Thermodynamics Let η be the entropy density, per unit mass, and θ the absolute temperature, on  × R. We follow the statement of the second law as in Sect. 2.6. Hence we have the inequality 1 1 ρη˙ + (∇ · q − ρr ) − 2 q · ∇θ + ∇ · k ≥ 0. θ θ Upon substitution of ∇ · q − ρr from (10.11) we have 1 ρθη˙ − ρε˙ + T · D + ϒ · (w − ω) + S · ∇ω − q · ∇θ + θ∇ · k ≥ 0. θ By means of the Helmholtz free energy ψ = ε − θη we can write the inequality in the form ˙ + T · D + ϒ · (w − ω) + S · ∇ω − 1 q · ∇θ + θ∇ · k ≥ 0. − ρ(ψ˙ + η θ) θ (10.12) The analysis of the restrictions placed by the inequality (10.12) is developed by regarding b, r , and also l as quantities whose values make the balance of linear momentum, angular momentum, and energy hold for any v˙ , ω, ˙ ε. ˙ Analogous claims hold for the space and time derivatives. Hence b, r , and l are supposed to take the values that must be applied to support the pertinent process.

622

10 Micropolar Media

10.2.2 Balance Laws in the Material Description The pertinent fields are described by functions of X, t, with X ∈ P ⊆ R. For any field u R in R the divergence theorem reads ∫ u R · n R da R = ∫ ∇R · u R dv R .

∂P

P

Balance of Mass The mass density ρ is required to obey the continuity equation, ρ˙ + ρ∇ · v = 0, while the reference mass density ρ R is constant and ρ = ρ R /J .

Balance of Linear Momentum The balance equation d ∫ ρ v dv = ∫ ρ b dv + ∫ T n da dt Pt Pt ∂Pt in the reference configuration reads d ∫ ρ R v dv R = ∫ ρ R b dv R + ∫ T R n R da R , dt P P ∂P T R being the first Piola stress,T R = J TF−T . Hence, the arbitrariness of P implies that ρ R v˙ = ρ R b + ∇R · T R .

Balance of Angular Momentum From

d ∫ ρ(r × v + σ)dv = ∫ ρ(r × b + l)dv + ∫ (r × t + s)da, dt Pt Pt ∂Pt

since t = Tn and s = S n then d ∫ ρ(r × v + σ)dv = ∫ ρ(r × b + l)dv + ∫ (r × Tn + Sn)da. dt Pt Pt ∂Pt Now, ∫ Snda = ∫ S R n R da R ,

∂Pt

∂P

S R := J SF−T ,

10.2 Balance Laws

623

S R being the material couple stress tensor. Hence, in the reference configuration, ∫[ρ R (r × v˙ + σ˙ − r × b − l)]dv R = ∫ (r × T R n R + S R n R )da R . ∂P

P

Since [∇R · (r × T R )]i = i hk Fh Q TkRQ + (r × ∇R · T R )i and

= J i hk Tkh = J ϒi i hk Fh Q TkRQ = i hk J Fh Q Tkl Fl−T Q

then, in view of the equation of motion, ρ R v˙ = ρ R b + ∇R · T R , it follows ∫[ρ R (σ˙ − l) − J ϒ − ∇R · S R ]dv R = 0. P

The arbitrariness of P implies the local form of the balance of orientational momentum, ρ R σ˙ = ρ R l + J ϒ + ∇R · S R .

Balance of Energy The global form of the balance of energy in the reference region P becomes d ∫ ρ R [ε + 21 (v2 + ω · σ)]dv R = ∫ ρ R (b · v + l · ω + r )dv R dt P P + ∫ (v · T R n R + ω · S R n R − ∇R · q R )da R . ∂P

The divergence theorem, the equation of motion and the balance of angular momentum allow us to find that the balance of energy results in ∫[ρ R ε˙ − T R · F˙ − S R · ∇R ω + J ϒ · ω − ∇R · q R + ρ R r ]dv R = 0. P

Hence we get the local form ρ R ε˙ = T R · F˙ + S R · ∇R ω − J ϒ · ω − ∇R · q R + ρ R r. Observe that T R · F˙ = T R · (LF) = J T · D + J T · W,

T · W = ϒ · w.

Consequently the balance of energy can be given the form ρ R ε˙ = J T · D + S R · ∇R ω + J ϒ · (w − ω) − ∇R · q R + ρ R r.

624

10 Micropolar Media

Second Law of Thermodynamics The entropy inequality d ∫ ρηdv ≥ ∫ ρsdv − ∫ j · nda dt Pt Pt ∂Pt in the material description becomes d ∫ ρ R ηdv R ≥ ∫ ρ R sdv R − ∫ j R · n R da R dt P P ∂P where

j R = J j F−T

is the material entropy flux. We let s = r/θ and jR =

1 qR + kR , θ

k R = J k F−T ,

k R being the material extra-entropy flux. In the local form we have the entropy inequality  1 ρR r ρ R η˙ ≥ − ∇R · q R + k R . θ θ Hence

1 1 ρ R η˙ + (∇R · q R − ρ R r ) − 2 q R · ∇R θ + ∇R · k R ≥ 0. θ θ

Substitution of ∇R · q R − ρ R r from the balance of energy results in ˙ + T R · F˙ − J ϒ · ω + S R · ∇R ω − 1 q R · ∇R θ + θ∇R · k R ≥ 0 − ρ R (ψ˙ + η θ) θ (10.13) or ˙ + J T · D + ϒ · (w − ω) + S R · ∇R ω − 1 q R · ∇R θ + θ∇R · k R ≥ 0. −ρ R (ψ˙ + η θ) θ As with the spatial description, the analysis of the restrictions placed by the inequality (10.13) is developed by regarding b, r , and also l as quantities whose values make the balance of linear momentum, angular momentum, and energy hold for any v˙ , ω, ˙ ε. ˙ Analogous claims hold for the space and time derivatives. Hence b, r , and l are supposed to take the values that must be applied to support the pertinent process.

10.2 Balance Laws

625

10.2.3 Thermoviscous Micropolar Media Fluids As a preliminary simple model of micropolar medium we consider a micropolar fluid, that is a fluid where the particles, or some particles within it, have the structure of rigid body. We then describe the constitutive properties of the fluid by letting the response functions ψ, η, T, q, and k depend on the set of variables ρ, θ, D, ∇θ, R, w − ω, ∇ω. The possible dependence on R, w − ω, and ∇ω is peculiar to the microscopic structure; R may account for the orientation, w − ω for the rotational interaction between the particles and the macroscopic motion, ∇ω for the action of the couple stress tensor. Since ψ is invariant under changes of frame then ψ can depend on D, ∇θ, R, w − ω, and ∇ω through their invariants. Let ψ be differentiable. Moreover let T = − p1 + T , where T → 0 if L, ω, ∇θ → 0. Hence the entropy inequality (10.12) can be written as ˙ +∂ ψ·D ˙ ˙ i j + ∂(w−ω) ψ · (w ˙ + ∂R ψ R ˙ − ω) −ρ{(∂θ ψ + η)θ˙ + ∂∇θ ψ · ∇θ ˙ + ∂∇ω ψ · ∇ω} D ij +(ρ2 ∂ρ ψ − p)∇ · v + T · D + ϒ · (w − ω) + S · ∇ω −

1 q · ∇θ + θ∇ · k ≥ 0. θ

˙ w ˙ and θ˙ implies ˙ ∇θ, ˙ − ω, The arbitrariness of D, ˙ ∇ω, ∂D ψ = 0,

∂∇θ ψ = 0,

∂(w−ω) ψ = 0,

∂∇ω ψ = 0,

η = −∂θ ψ.

˙ = AR and A j h = − j hl ωl The model is consistent with the condition k = 0. Since R then it follows that p = ρ2 ∂ρ ψ. The inequality then reduces to 1 −ρ∂Ri j ψ (AR)i j + T · D + ϒ · (w − ω) + S · ∇ω − q · ∇θ ≥ 0. θ Observe −ρ∂Ri j ψ (AR)i j = −ρ∂Ri j ψ Ai p R pj = ρ∂Ri j ψ R pj i pl ωl = γ · ω, where

626

10 Micropolar Media

γl = i pl ρ ∂Ri j ψ RTjp . Hence the inequality can be written in the form 1 γ · ω + ϒ · (w − ω) + T · D + S · ∇ω − q · ∇θ ≥ 0. θ

(10.14)

Inequality (10.14) is exploited under the assumption that ϒ is linear in w − ω, ∇θ, ∇ω. Let ϒ = ϒ1 (ρ, θ)(w − ω) + ϒ2 (ρ, θ)∇θ + ϒ3 (ρ, θ)∇ × ω. As D, ∇ω, ∇θ vanish, inequality (10.14) reduces to γ · ω + ϒ1 (w − ω)2 ≥ 0. This inequality holds only if γ = 0,

ϒ1 ≥ 0.

For definiteness we now take T , S, and q in linear forms. For any vector u we let (u) be the skew tensor defined by  pq = − pqi u i . Hence we consider the constitutive equations T = 2μD + λ(tr D)1 + ∇ω + 21 ϒ1 (w − ω), S = α(∇θ) + β(w − ω) + δ(∇ · ω)1 + ν∇ω + σD, q = −κ∇θ + ξ(w − ω) + ζ∇ × ω. Inequality (10.14) then reads ϒ1 (w − ω)2 + 2μD · D + λ(tr D)2 + ( + σ)D · ∇ω + (α − ζ/θ)∇ × ω · ∇θ 1 1 +β∇ × ω · (w − ω) + δ(∇ · ω)2 + ν|∇ω|2 + κ|∇θ|2 − ξ(w − ω) · ∇θ ≥ 0. θ θ It follows that ξ, β = 0,  = −σ, α = ζ/θ, μ ≥ 0, 2μ + 3λ ≥ 0, ν ≥ 0, ν + 3δ ≥ 0, κ ≥ 0.

With these restrictions in mind, we can write the constitutive equations in the form T = 2μD + λ(tr D)1 − σ∇ω + 21 ϒ1 (w − ω), S = α(∇θ) + δ(∇ · ω)1 + ν∇ω + σD, q = −κ∇θ − αθ∇ × ω.

10.2 Balance Laws

627

The corresponding set of evolution equations follows from the balance equations. Observe ∂x j i j (w − ω) = −i jk ∂x j (wk − ωk ) = −[∇ × (w − ω)]i , (w − ω) · D = 0, ∂x j i j (∇θ) = −i jk ∂x j ∂xk θ = 0, α(∇θ) · ∇ω − ∇ · ζ∇ × ω = α∇θ · ∇ × ω − α∇θ · ∇ × ω = 0. Hence we have ρ˙ + ρ∇ · v = 0, ρ˙v = −∇ p + μv + (μ + λ)∇∇ · v − σω − 21 ϒ1 ∇ × (w − ω) + ρb, ρIω ˙ + ρω × Iω = τ1 (w − ω) + δ∇(∇ · ω) + νω + 21 σ(v + ∇(∇ · v)) + ρl, ρε˙ = − p∇ · v + 2μD · D + λ(tr D)2 + δ(∇ · ω)2 + ν|∇ω|2 + κθ + ρr.

Solids A micropolar solid is modelled so that elastic and thermo-viscous properties are allowed. We follow the material description and let the response functions of ψ, η, T R , S R , q R , depend on the set of variables F, θ, D, ∇R θ, R, ∇R R, ∇R ω. The free energy can depend only on invariants. Accordingly we consider the dependence on F, R, ∇R R via the Cosserat deformation tensor C and the wryness tensor . The dependence on the Cosserat deformation tensor C is devised to model the interaction between deformation and micromotion. The dependence on the wryness tensor  is considered so as to describe possible effects of the inhomogeneous rotation field. As we see in a moment, the dependence of ψ on D, ∇R θ, and ∇R ω is ruled out by the entropy inequality. Hence we let C, θ, D, ∇R θ, , ∇R ω) ψ = ψ(C and suppose ψ is differentiable. Upon computation of ψ˙ and substitution in the entropy inequality (10.13) it follows ˙ − ρ R ∂∇R θ ψ · ∇R θ˙ − ρ R ∂ ψ · ˙ − ρ R ∂∇R ω ψ · ∇R ω −ρ R (∂θ ψ + η)θ˙ − ρ R ∂C ψ · C˙ − ρ R ∂D ψ · D ˙ 1 +T R · F˙ − J ϒ · ω + S R · ∇R ω − q R · ∇R θ + θ∇R · k R ≥ 0. θ

˙ ∇R ω, ˙ ∇R θ, In view of (10.4) and (10.3), the arbitrariness of D, ˙ θ˙ implies that ∂D ψ = 0,

∂∇R θ ψ = 0,

∂∇R ω ψ = 0,

η = −∂θ ψ.

628

10 Micropolar Media

The inequality reduces to 1 −ρ R ∂C ψ · C˙ − ρ R ∂ ψ · ˙ + T R · F˙ − J ϒ · ω + S R · ∇R ω − q R · ∇R θ + θ∇R · k R ≥ 0. θ

We now investigate the mathematical structure of the powers in this inequality. By (10.4) we find −ρ R ∂C ψ · C˙ = −ρ R R j K ∂C K L ψ Fh L D j h − ρ R R j K ∂C K L ψ Fh L  j hl (wl − ωl ). Let

R, Tˆ := ρF∂C ψR

Hence

ϒˆ l := h jl Tˆ j h .

ˆ · (w − ω). −ρ R ∂C ψ · C˙ = −J Tˆ · D − J ϒ

By (10.3) we obtain −ρ R ∂ ψ · ˙ = −ρ R ∂ K L ψRi K ∂ X L ωi = −Sˆ R · ∇ R ω where

Sˆ R := ρ R R ∂ ψ.

Moreover

T R · F˙ = (T R FT ) · L = J T · D + J ϒ · w.

We can then write the entropy inequality in the form ˆ · (w − ω) + (S R − Sˆ R ) · ∇ R ω − 1 q R · ∇R θ + θ∇R · k R ≥ 0. ˆ · D + J (ϒ − ϒ) J (T − T) θ

Again, no significant generality is lost by letting k R = 0 in that we disregard the case k R = k R (θ). Apart from cross coupling terms we have ˆ = α(w − ω), ϒ −ϒ

T − Tˆ = 2μD + λ(tr D)1 + 21 α(w − ω),

S R − Sˆ R = ξ∇ R ω + ζ(∇R · ω)1, q R = −κ∇R θ, where α ≥ 0, μ ≥ 0, 2μ + 3λ ≥ 0, ξ ≥ 0, ξ + 3ζ ≥ 0, κ ≥ 0.

10.3 Liquid Crystals

629

ˆ induced by the dependence of ψ on C , and a The stress T is the sum of a term T, dissipative term; 2μD + λ(tr D)1 is the classical viscous stress while 21 α(w − ω) is the interaction between the rotation of the single particle, ω, and the macroscopic motion, w = 21 ∇ × v. Likewise the couple stress S R is the sum of a term Sˆ R , induced by , and a dissipative part related to ∇R ω. If ψ is dependent on F also via C = FT F then T has an additional term 2F ∂C ψ FT which is a symmetric tensor.

10.3 Liquid Crystals Liquid crystals are matter in a state that has properties between those of standard liquids and those of solids. The molecules (mesogens) are given a preferred direction, whose unit vector is called director. While a liquid crystal may flow like a liquid, in certain conditions the directors point along a common axis as in a solid crystal. Different types of liquid crystals can be distinguished by their different optical properties. Liquid crystals (LCs) are divided in thermotropic, lyotropic and metallotropic phases. Thermotropic and lyotropic LCs consist of organic molecules. Thermotropic LCs exhibit a phase transition as temperature is changed whereas lyotropic LCs undergo phase transitions depending on the temperature but also on the concentration of the LC in a solvent. Metallotropic LCs consist of both organic and inorganic molecules. The various liquid-crystal phases (mesophases) can be characterized by the type of ordering. One can distinguish positional order and orientational order. In the nematic phase the rod-shaped molecules have no positional order but they self-align to have long-range directional order with their long axes roughly parallel. Most nematics are uniaxial in that they have one axis that is longer and preferred, with the other two being equivalent. Some liquid crystals are biaxial nematics in that in addition to orienting their long axis they also orient along a secondary axis. The smectic phase, which occurs at lower temperatures than the nematic, form well-defined layers that can slide over one another. The smectics are thus positionally ordered along one direction. In the smectic A phase the molecules are oriented along the layer normal, whereas in the smectic C phase they are tilted away from the layer normal. The chiral nematic, or cholesteric, phase exhibits a twisting of the molecules perpendicular to the director. The chirality induces a finite azimuthal twist from one layer to the next thus producing a spiral twisting of the molecular axis along the layer normal. The chiral pitch is the distance over which the molecules undergo a full twist. The pitch changes when the temperature is altered or when other molecules are added to the liquid crystal host.

630

10 Micropolar Media

10.3.1 Modelling of Thermotropic Liquid Crystals Thermotropic liquid crystals are modelled as micropolar media. The molecules (mesogens) are given a position x and an orientation; the orientation is characterized by an orthonormal principal triad (k1 , k2 , k3 ) rigidly attached to the molecule. We let k1 be the director. We take the principal moments of inertia as j1 ≤ j2 ≤ j3

or

j1 ≥ j2 ≥ j3 ,

according as the liquid crystal consists of rod-like or disk-like molecules. If the spatial mean k1 of the director is nonzero we can define the unit vector n as n = k1 /| k1 |. The scalar parameter s = | k1 | is called the degree of orientation and we have

k1 = s n. Nematics can be easily aligned by an external magnetic or electric field. In smectics the molecules are arranged in (parallel) layers; let N be the normal to the layers. In smectic A LCs the molecular orientation n is perpendicular to the layers and hence |n · N| = 1. In smectic C LCs the director n is tilted so that |n · N| < 1. In cholesterics the orientation n is constant within layers but n · N takes different values also relative to adjacent layers. In isotropic LCs there are patches with transient positional as well as orientational order (as with smectics) and those with only transient orientational order (as with nematics). In these clusters n, and possibly N, are defined and vary as a function of space and time. The limit case of isotropic LCs is that where s = 0 while n and N are undefined. The degree of orientation s can then be considered to describe the transition between isotropic phases and nematics.

Balance Equations Owing to the micropolar character of LCs and the occurrence of electromagnetic fields we establish the balance equations on the basis of Sects. 2.16 and 10.2. For generality, liquid crystals are allowed to be compressible. Hence the balance of mass is considered in the standard form ρ˙ + ρ∇ · v = 0. The balance of linear momentum is taken in the form

10.3 Liquid Crystals

631

ρ˙v = ∇ · T + ρbmech + ρbem , where, by (2.72), ρbem = (P · ∇)E + v × (P · ∇)B + ρπ˙ × B + μ0 (M · ∇)H E + J × B, ˙ × D + qE −v × (M · ∇)D − μ0 ρm while bmech is the body force of mechanical character. The balance of angular momentum (moment of momentum) is based on the view that a mechanical couple stress S may occur. The body couple may consist of a mechanical term lmech and an electromagnetic one lem ; by (2.73) we have ρlem = P × E + μ0 M × H. Hence the orientational momentum σ is subject to ρσ˙ = ϒ + ρlmech + ∇ · S + P × E + μ0 M × H, where ϒ is twice the axial vector of T, ϒi = i jk Tk j . Likewise we observe that the power ρl · ω splits into ρl · ω = (ρlmech + P × E + μ0 M × H) · ω while, by (2.75), an additional electromagnetic power E · π˙ + ρμ0H · m ˙ E · J + ρE occurs. It follows that3 E · π˙ + ρμ0H · m ˙ + T · D + ϒ · (w − ω) + S · ∇ω − ∇ · q + ρr. ρε˙ = E · J + ρE (10.15) Moreover the electromagnetic fields are required to satisfy Maxwell’s equations while E = E + v × B,

H = H − v × D,

J = J − N qv,

N = ρ/m.

LCs are then characterized by adding the constitutive equations. To this end we consider the electromagnetic fields, at the rest frame at the pertinent point x and time t, so that E = E, H = H, J =J. Hence, starting with the standard entropy inequality In suffix notation T · L + S · ∇ω = Ti j ∂x j vi + Si j ∂x j ωi . Moreover D is the stretching tensor, not the electric displacement.

3

632

10 Micropolar Media

ρη˙ + ∇ ·

q ρr − +∇ ·k ≥0 θ θ

and replacing ∇ · q − ρr from the balance of energy (10.15) we find ˙ + T · D + ϒ · (w − ω) + S · ∇ω − ρθη˙ − ρ˙ε + E · J + ρE · π˙ + ρμ0 H · m

1 q · ∇θ + θ∇ · k ≥ 0. θ

For technical convenience we consider the function φ = ε − θη − E · π − μ0 H · m. In terms of φ the second-law inequality can be written in the form ˙ + E · J + T · D + ϒ · (w − ω) + S · ∇ω − 1 q · ∇θ + θ∇ · k ≥ 0. − ρφ˙ − ρη θ˙ − P · E˙ − μ0 M · H θ

(10.16)

10.3.2 Director Field, Energy, and Objectivity The energy ε of LCs might depend on the direction n of the director (of the mesogen) via the coupling with external fields. Also, the orientation N of possible layers might affect the energy via the coupling with the direction n, through n · N, and both ∇N and ∇(n · N). The inner product n · N is an invariant, and hence objective, scalar. We examine the objectivity of the dependence on n and ∇n (as well as on N and ∇N). Concerning the dependence on n, ∇n, as well as N, ∇N, we have the following result. A function ε(n, ∇n) is objective if and only if  ji p (∂n j ε n i + ∂n j,k ε n i,k + ∂n k, j ε n k,i ) = 0,

p = 1, 2, 3.

(10.17)

Proof Observe that a function ε(n, ∇n) is objective if ε(n, ∇n) = ε(Qn, Q∇Qn).

(10.18)

We define the vector function n(t) via the tensor function Q(t) associated with the change of frame, ¯ n(t) = Q(t)n,

¯ (∇n)(t) = Q(t)∇Q(t)n.

Hence the objectivity requirement may also be stated in the form ε(t) ˆ := ε(n(t), ∇n(t)) = constant.

10.3 Liquid Crystals

633

Consequently ε(n, ∇n) is objective if and only if d ε/dt ˆ = 0 whence ∂n j ε n˙ j + ∂n j,k ε n ˙j,k = 0.

(10.19)

˙ T and recall that  ∈ Skw, QQT = QT Q = 1. Now, upon As usual we let  = QQ some rearrangements we find n˙ j = Q˙ ji n¯ i = Q˙ ji Q pi Q ph n¯ h =  j p n p , n ˙j,k = Q kq˙Q ji ∂xq n¯ i = Q˙ kq Q lq Q lp ∂x p Q ji n¯ i + Q kq ∂xq Q˙ ji Q hi Q hl n¯ l = kl n j,l +  j h n h,k .

Hence we have ∂n j ε  j p n p + ∂n j,k ε(kl n j,l +  j h n h,k ) = 0. Since  ∈ Skw then we can represent  via a vector,4 say u,  j p =  j pq u q . Consequently we can write ∂n j ε  j pq u q n p + ∂n j,k ε(klq u q n j,l +  j hq u q n h,k ) = 0. The arbitrariness of u implies  j pq ∂n j ε n p + klq ∂n j,k ε n j,l +  j hq ∂n j,k ε n h,k = 0, q = 1, 2, 3. 

Relabeling the summed indices we find (10.17).5 The simplest energy function involves ∇n in the form ε =∝ |∇n|2 . We find the expression (n i,k ∂n j,k + n k,i ∂n k, j )(n p,q n p,q ) = 2(n i,k n j,k + n k,i n k, j ) which is symmetric relative to the indices i, j. Hence  ji p (∂n j,k ε n i,k + ∂n k, j ε n k,i ) = 0,

p = 1, 2, 3

and the requirement (10.17) is verified. Among the possible functions ε(n, ∇n) is the energy function 4 5

Here u is the opposite of the axial vector of . The requirement (10.17) traces back to Ericksen [145]; that proof shows only the necessity.

634

10 Micropolar Media

εOF = κ1 (∇ · n)2 + κ2 (n · ∇ × n)2 + κ3 [(n · ∇)n]2 + κ4 [(∇n) · (∇n)T − (∇ · n)2 ] (10.20) usually referred to as Oseen-Frank energy density for a nematic [174, 351]. Upon a direct check it follows that the function (10.20) satisfies the objectivity requirement (10.17). Indeed, each term in (10.20) satisfies (10.17) and hence (10.20) is objective for any choice of the invariants κ1 , κ2 , κ3 , κ4 , possibly dependent on the temperature θ and the mass density ρ. Two functions depend only on ∇n while two depend on n and ∇n. We compute the left hand side of (10.17) for (∇ · n)2 , (∇n) · (∇n)T , (n · ∇)n, n · ∇ × n)2 to obtain  ji p [∂n j (∇ · n)2 n i + ∂n j,k (∇ · n)2 n i,k + ∂n k, j (∇ · n)2 n k,i ] = 2(∇ · n) ji p (n i, j + n j,i ) = 0,

 ji p {∂n j,k [(∇n) · (∇n)T ]n i,k + ∂n k, j [(∇n) · (∇n)T ]n k,i } = 2 ji p (n k, j n i,k + n j,k n k,i ) = 0,

 ji p {∂n j [(n · ∇)n]2 n i + ∂n j,k [(n · ∇)n]2 n i,k + ∂n k, j [(n · ∇)n]2 n k,i } = 2 ji p [n q n k,q n i n k, j + n q n k,q n j n k,i + n q n j,q n k n i,k ] = 0,  ji p [∂n j (n · ∇ × n)2 n i + ∂n j,k (n · ∇ × n)2 n i,k + ∂n k, j (n · ∇ × n)2 n k,i ] = 2 n · ∇ × n  ji p (n i  jqr n r,q + n r r k j n i,k + n r r jk n k,i ) = 0; in three cases the vanishing is apparent in that  ji p multiplies a symmetric quantity in i and j, in the last case a direct calculation shows that the whole expression vanishes. The director n is well-defined for nematics and cholesterics. A rotation matrix R is defined when a triad rigidly attached to the particles is available. Correspondingly, ˙ ki Rhi , relative to by (1.21) the angular velocity of the particle is given by ωl = 21 lhk R the fixed basis {el }. As with any direction rigidly attached to the particle (mesogen), the unit vector n changes in time according to n˙ = ω × n. In isotropic LCs n is undefined and R is undefined too.

10.4 Nematics We now address attention to nematics where the director field n is defined, |n| = 1. Moreover we let nematics be subject to an electric field E. For simplicity we let the LC be non-magnetizable,6 M = 0. Hence we let the constitutive functions depend on the set of variables7 6

In case of magnetizable bodies, the procedure for M and B is the dual of the present one for P and E. 7 In principle, the dependence on w − ω is admissible in that w − ω is objective.

10.4 Nematics

635

ρ, θ, E, D, ∇θ, n, ∇n, w − ω, ∇ω, where D is the stretching tensor. The entropy inequality (10.16) becomes ˙ − ρ∂ φ · ∇n ˙ − ρ∂ ˙ ˙ − ρ∂∇θ φ · ∇θ −ρ(∂θ φ + η)θ˙ − (ρ∂E φ + P) · E˙ − ρ∂D φ · D w−ω φ · w − ω ∇n 1 ˙ − ρ∂ φ · n˙ + ϒ · (w − ω) + (T + ρ2 ∂ φ1) · D + S · ∇ω − q · ∇θ + θ∇ · k ≥ 0. −ρ∂∇ω φ · ∇ω n ρ θ

˙ w− ˙ θ, ˙ ω, ∇ω, ˙ E˙ implies ˙ ∇θ, The arbitrariness of D, ∂D φ = 0, ∂∇θ φ = 0, ∂w−ω φ = 0, ∂∇ω φ = 0, η = −∂θ φ, P = −ρ∂E φ. A thermodynamically consistent model is now derived where k = 0. The entropy inequality simplifies to ˙ − ρ∂ φ · n˙ + (T + ρ2 ∂ φ1) · D + ϒ · (w − ω) + S · ∇ω − 1 q · ∇θ ≥ 0. −ρ∂∇n φ · ∇n n ρ θ

˙ = ∂ φ · (∇ n˙ − LT ∇n). In suffix notation, since n˙ = ω × n we Look at ∂∇n φ · ∇n ∇n have ∂ni, j φ n i,˙ j = ∂ni, j φ(∂x j n˙ i − L k j n i,k ) = ∂ni, j φ[i pq (ω p, j n q + ω p n q, j ) − L k j n i,k ] while ∂n φ · n˙ = ∂n φ · ω × n = ω · n × ∂n φ. Hence we have ˙ = ξ ω + A ω − G (D + W ), ρ∂n ψ · n˙ + ρ∂∇n ψ · ∇n k k ki k,i ik ik ik where ξk = ρ kr s [n r ∂n s φ + n r, j ∂n s, j φ],

Aki = ρ ks j n s ∂n j,i φ, G ik = ρ n j,i ∂n j,k φ.

Moreover, let gi = i pq G q p ,

1 G skw jk = 2 k j p g p ,

1 skw jk = 2 k j p ξ p ;

observe g = 2aG , aG being the axial vector of Gskw . We find that skw = 1 (∂ skw 1 G ik 2 n j,k φ n j,i − ∂n j,i φ n j,k ), ik = 2 (n k ∂n i φ + n k, j ∂n i, j φ − n i ∂n k φ − n i, j ∂n k, j φ).

We can then write the Clausius-Duhem inequality in the form

636

10 Micropolar Media

− (ϒ + ξ) · ω + (S − A) · ∇ω + (Tsym + Gsym + ρ2 ∂ρ φ1) · D + (ϒ + g) · w −

1 q · ∇θ ≥ 0. θ

(10.21)

For definiteness, we let S, symT, q be independent of w − ω and consider two cases about the vector ϒ. First we let ϒ depend on w − ω so that ϒ → 0 as w − ω → 0. Hence inequality (10.21) holds only if g · w − ξ · ω ≥ 0; the arbitrariness of w and ω requires that g = 0, ξ = 0, and hence Gskw = 0. These conditions in turn imply ∂n j,i φ n j,k − ∂n j,k φ n j,i = 0,

n i ∂n k φ + n i, j ∂n k, j φ − n k ∂ni φ − n k, j ∂ni, j φ = 0 (10.22) ∀i, k = 1, 2, 3. The Clausius-Duhem inequality then reduces to 1 ϒ · (w − ω) + (S − A) · ∇ω + (Tsym + Gsym + ρ2 ∂ρ φ1) · D − q · ∇θ ≥ 0. θ (10.23) Second, let ϒ be independent of w − ω. Hence inequality (10.21) holds only if (ϒ + g) · w − (ϒ + ξ) · ω ≥ 0. The arbitrariness of w and ω requires that g = −ϒ = ξ. The vector condition g = ξ results in skw = 1 [∂ skw 1 G ik 2 n j,k φ n j,i − ∂n j,i φ n j,k ] = 2 [n k ∂n i φ + n k, j ∂n i, j φ − n i ∂n k φ − n i, j ∂n k, j φ] = ik

(10.24) ∀i, k = 1, 2, 3. The Clausius-Duhem inequality then implies ϒ = −g = −2aG and reduces to 1 (S − A) · ∇ω + (Tsym + Gsym + ρ2 ∂ρ φ1) · D − q · ∇θ ≥ 0. θ

(10.25)

¯ Tˆ ∈ Sym and assume As to Tsym and S we let T, ¯ ˆ θ, n, ∇n) + T, Tsym = T(ρ,

¯ θ, n, ∇n) + S, ˆ S = S(ρ,

where Tˆ and Sˆ approach zero as D, ∇θ, and ∇ω approach zero. As a consequence, (10.23) implies the relations T¯ = −ρ2 ∂ρ φ1 − Gsym ,

S¯ = A,

10.4 Nematics

637

and the dissipation inequality ˆ · D − 1 q · ∇θ ≥ 0. ϒ · (w − ω) + Sˆ · ∇ω + T θ

(10.26)

Instead, if (10.24) holds then S¯ = A,

T¯ = −ρ2 ∂ρ φ1 − Gsym , and

1 Sˆ · ∇ω + Tˆ · D − q · ∇θ ≥ 0. θ

In both cases the constitutive equations Sˆ = α∇ω + β(∇ · ω)1 + γ∇θ ⊗ ∇θ, Tˆ = 2μD + λ(∇ · v)1 + ν∇θ ⊗ ∇θ, q = −κ∇θ − ν∇θD − γ∇θ∇ω, involving non-linear cross-coupling terms, are allowed if α, α + 3β, μ, 2μ + 3λ, κ are non-negative. Moreover the dependence ϒ = ζ(w − ω),

ζ ≥ 0,

is consistent with (10.26). A joint dependence of Tˆ on n and D is allowed in the form Tˆ = 2μD + λ(∇ · v)1 + α1 (n ⊗ Dn + Dn ⊗ n) + α2 (n · Dn)n ⊗ n where α1 , α2 are functions of ρ, θ, E subject to α1 ≥ 0, α2 ≥ −α1 . This is so because Tˆ · D = 2μ|D0 |2 + (λ + 23 μ)(∇ · v)2 + α1 |Dn|2 + α2 (n · Dn)2 . We do not consider an additive stress of the form8 ◦



n ⊗ n + n ⊗ n, ◦

in that the additive contribution 2 n ·Dn to Tˆ · D would be indefinite. The thermodynamic requirements (10.22) follow from (10.24) when both G and  vanish. Hence (10.24) is the general thermodynamic requirement on the dependence of φ on n and ∇n. For definiteness we now look at the function φ in the additive ◦ 8n



denoting the corotational derivative of n, i.e. n= n˙ − Wn; such dependence is considered e.g. in [82].

638

10 Micropolar Media

form φ = φ0 (ρ, θ, E) + σ f (n, ∇n), where σ may depend on ρ, θ, E. Hence (10.22) is a requirement on f . First let f = |∇n|2 . Since ∂n p, j |∇n|2 = 2 n p, j then both requirements (10.22) hold and hence g = ξ = 0. We now investigate the functions occurring in the Oseen-Frank energy density (10.20). Let f = (∇ · n)2 . Then both sides of (10.24) equal σ∇ · n(n k,i − n i,k ); (10.24) holds and G ik = 2ρσ ∇ · n n k,i . Let f = ∇n · (∇n)T . Then both sides of (10.24) equal σ(n j,i n k, j − n i, j n j,k ); (10.24) holds and G ik = 2ρσn j,i n k, j . Let f = |(n · ∇)n|2 . We find that both sides of (10.24) equal σn r n j,r (n k n j,i − n i n j,k ); again (10.24) holds while G ik = 2ρσn k n j,i (n · ∇)n j . Now let f = (n · ∇ × n)2 . Since ∂n j,k f n j,i = 2(n · ∇ × n)n j,i n p  pk j , n k ∂ni f + n k, j ∂ni, j f = 2(n · ∇ × n)[n k i pq n q, p + n k, j n r r ji it follows that (10.24) does not hold. For the thermodynamically consistent functions f so examined the stress G = T¯ + p1, where p = ρ2 ∂ρ φ, and the couple stress A = S¯ take the following forms. f |∇n|2 (∇ · n)2 (∇n) · (∇n)T |(n · ∇)n|2

G ik G ik G ik G ik

G = 2ρσn j,i n j,k = 2ρσn h,h n k,i = 2ρσn j,i n k, j = 2ρσn q n j,q n k n j,i

Consequently f = |∇n|2

=⇒

Aik Aik Aik Aik

A = 2ρσks j n s n j,i = 2ρσisk n s = 2ρσis j n s n k, j = 2ρσis j n s n p n j, p n k

¯ = 0, skwT

10.4 Nematics

639

f = (∇ · n)2 f = (∇n) · (∇n)T f = |(n · ∇)n|2

=⇒

(skw T)ik = ρ σ(∇ · n)(n k,i − n i,k ),

=⇒ =⇒

(skw T)ik = ρ σ(n k, j n j,i − n i, j n j,k ),

¯ ik = ρσn q n j,q (n k n j,i − n i n j,k ). (skw T)

Incidentally, if f = |∇n|2 then ϒ = 0 while if f = (∇ · n)2 it follows that ϒ = 2ρσ(∇ · n)∇ × n. The molecules may have a permanent electric dipole. Otherwise an electric field produces rearrangements of charges in the molecules such that an electric dipole results. In any case the polarization P depends on the orientation of the director n relative to the electric field. Moreover the polarization is invariant under the change n → −n. Hence, in the approximation that P depends linearly on the electric field E we have P = 0 [χ⊥ 1 + (χ − χ⊥ )n ⊗ n]E whence P · n = 0 χ E · n,

P × n = 0 χ⊥ E × n.

Accordingly, χ , χ⊥ are the electric susceptibilities in the direction of n and the orthogonal plane. In this case φ = (ρ, θ) −

1 0 E · [χ⊥ 1 + (χ − χ⊥ )n ⊗ n]E + σ f (n, ∇n). 2ρ

Dynamic Equations We now establish the evolution equations of nematics as a continuous distribution of molecules (mesogens). Since the nematic phase is allowed to be compressible the mass density ρ is required to satisfy the standard continuity equation, ρ˙ + ρ∇ · v = 0. Let m be the molecular mass. Hence the mass density ρ and the number density N are related by ρ = N m. For definiteness we let the net charge density and the magnetic moment of molecules be zero but let p be the nonzero electric dipole moment. Hence electromagnetic considerations suggest that ρb = N p · ∇E,

l = N p × E.

640

10 Micropolar Media

An additional term might be considered for l to model the tendency of molecules to align to the external field E. Hence we assume l = N p × E − ζN 2 p × (p × E),

ζ > 0.

For simplicity we let Sˆ = 0 and Tˆ = 2μD + λ(∇ · v)1 so that T = − p1 + 2μD + λ(∇ · v)1 + G,

S = A.

Moreover ϒ, which is twice the axial vector of T, is given by ϒi = i jk G k j . As shown in the previous table, the expressions of G and A depend on the selected free-energy dependence of φ on ∇n. Let I = mI be the inertia tensor of a molecule and then I is the inertia tensor density, per unit mass. Moreover observe that P = Np is just the polarization (per unit volume). The parameters μ, λ, γ are regarded as constants. Since Tik = − pδik + 2μDik + λ(∇ · v)δik + G ik we can write the equation of motion (10.8) in the form ρv˙i = Pk E i,k − p,i + 2μvi,kk + (λ + 2μ)vk,ki + G ik,k . The balance equation for the orientational momentum (10.10) can be given the form ρ(I ω ˙ + ω × I ω) = ϒ + P × E − ζP × (P × E) + ∇ · S. (10.27) The energy balance equation can be written as ρε˙ = − p∇ · v + 2μD · D + λ(∇ · v)2 + ϒ · (w − ω) + S · ∇ω − ∇ · q + ρr, where w = 21 ∇ × v. In general the vector ϒ can be given in the form ϒi = i jk (G k j + Tˆk j ),

10.4 Nematics

641

where G is the stress tensor characterized by the dependence of the free energy φ on n and ∇n while Tˆ may involve appropriate dependences on ∇ω and ∇θ.

10.4.1 Relation to Other Models Involving the Director Field Among the first continuum models of LCs we mention those of Ericksen [146] and Leslie [277]. In these schemes the orientation of molecules is modelled via a director n, |n| = 1, and the form of molecules is described by a parameter, S, called degree of orientation. The main points of these approaches are summarized as follows. The fluid is assumed to be incompressible and the equation of motion is taken in the standard form. Next a generalized scalar momentum P is considered subject to the balance equation P˙ = ∇ · t + f I + f E , where f I and f E represent internal and external body forces. To determine n it is assumed a vector equation π = ∇ · T + gI + gE . Next [146] the balance of energy is assumed in the form d ∫ ρ(e + 21 v2 )dv = ∫ [v · T + n˙ · ϒ + S˙ T − q] · nda + ∫ b · v + f E S˙ + f E · n˙ + ρr ]dv, dt   ∂

where e is said to include the rotational kinetic energy. Hence it follows ρ e˙ = T · L + T · ∇ n˙ + ∇ S˙ · t + ( P˙ − f I ) S˙ + (π˙ − g I ) · n˙ − ∇ · q + ρr. Another approach (see, e.g., [277]) is based on balance laws for linear and angular momentum without appeal to generalized forces and moments associated with the director N. Yet thermal and inertial terms are omitted, which simplifies the whole scheme. Here the essential points are outlined. The nematic is assumed to be incompressible and the balance of linear momentum takes the standard form, ρ v˙ = ∇ · T + ρ b. In our notation, the balance of angular momentum is assumed in the form d ∫ ρ r × v dv = ∫[ρ r × b + τ ]dv + ∫ (r × t + s)da dt   ∂ whence it follows τ + ϒ + ∇ · S = 0.

(10.28)

Hence, neglecting the occurrence of the orientational momentum results in an equilibrium (or constraint) equation for τ , ϒ, S. Correspondingly, the balance of energy is assumed in the form

642

10 Micropolar Media

d ∫ ρ(e + 21 v2 )dv = ∫ [t · v + s · ω]da + ∫[ρ b · v + τ · ω − D]dv, dt   ∂ where D is said to be the rate of viscous dissipation. Thus it follows that ρ e˙ = T · L + S · ∇ω − ϒ · ω − D. Two comments are in order. The form of the mechanical power, T · L + S · ∇ω − ϒ · ω, is consistent with other approaches (and our’s). Yet, the occurrence of (10.28) as an equilibrium condition, instead of the balance of orientational momentum, deprives the scheme of an equation for the time dependence of ω. Further, by arguing with the function e(n, ∇n) it follows that dissipative terms in the balance of energy arise9 subject to ˜ · ω = D ≥ 0. T˜ · L + S˜ · ∇ω − ϒ So, the occurrence of D conceptually remedies the absence of the entropy inequality. Borrowing from [146], the dynamic equations of nematic LCs have been considered [82] by assuming |n| = 1, the incompressibility condition, ∇ · v = 0, the occurrence of an order parameter field s, and the evolution equations ◦

n × n = −γn × Dn,

ρ˙v = ∇ · T, where





T = − p1 + αD + α2 (n ⊗ Dn + Dn ⊗ n + 2α1 (n · Dn)n ⊗ n + γ2 (n ⊗n + n⊗ n) + 2β1 s˙ n ⊗ n,

the coefficients γ, α, α2 , α1 γ2 , β1 being functions of s. This scheme is consistent with thermodynamics if φ is a function of s while T is a function of s and D; the entropy inequality simplifies to −ρφ s˙ + T · D ≥ 0, that is ◦

−ρφ s˙ + α|D|2 + 2α2 |Dn|2 + 2α1 (n · Dn)2 + 2γ2 n ·Dn + 2β1 s˙ n · Dn ≥ 0. Since



n ·n = n˙ · n − n · Wn = 0

n˙ · n = 0, ◦

then the evaluation of n × (n × n) results in ◦

n= γ[Dn − (n · Dn)n].

9

See, e.g., Eq. (3.12) of [277].

10.5 Smectics and Cholesterics

643



Upon substitution of n the inequality can be written as (−ρφ + 2β1 n · Dn)˙s + α|D|2 + 2(α2 + γ2 γ)|Dn|2 + 2(α1 − γ2 γ)(n · Dn)2 ≥ 0. The inequality holds if α1 , α2 > 0, while s satisfies

|γ2 γ| < α1 , α2

s˙ = −ρφ + 2β1 n · Dn.

This equation can be viewed as the evolution equation for the order parameter s. The Landau-de Gennes model describes liquid crystals not via the director n but via an order parameter tensor Q, whose entries are interpreted as the suitably normalized covariance matrix of the probability distribution in space  of the unit N (ni ⊗ vector n [48, 146] or the space average of a traceless dyadic product, i=1 1 ni − 3 1)/N [13]. Otherwise, the order tensor Q is defined by analogy with the magnetic susceptibility ([117], p. 57).

10.5 Smectics and Cholesterics The common feature of smectics and cholesterics is the layered structure; let N be the normal to the plane of the layer. In smectics the molecules have a common orientation within any layer; in smectic A and B LCs the molecules are parallel to N, in smectic C LCs the molecules form a common angle with the normal within any layer and among the layers. In smectic A the molecular axes are not ordered within the planes, in smectics B the axes are packed in a hexagonal form. In cholesteric LCs the molecules form a common angle with the normal within any layer but the angle changes in passing from a layer to another. As a result, the mesogens show a helical structure. To model smectics and cholesterics we keep denoting by n the director of the molecules. In smectics N · n is constant; |N · n| = 1 in smectics A and B, |N · n| < 1 in smectics C. Let α be the angle between N and n, so that N · n = cos α; α = 0 in smectics A and B. To set up any constitutive model we need a view on the energy. While the OseenFrank function εOF in (10.20) is a reference model of energy, smectic crystals require a more involved description. The Landau-de Gennes model is established by an assumed analogy between superconductors and smectics [116]. By this analogy, a complex-valued density modulation (x) is introduced with the view that the absolute value is the amplitude of the smectic order and the phase describes the layer displacement. With this view the additional energy density is

644

10 Micropolar Media

εd G = |(i∇ + qn)|2 + μ(−||2 + 21 ||4 ), where μ is a positive constant and, by [116], (x) = ρ1 exp(iqn · x), where q is the characteristic smectic layer number, 2π/q is the layer thickness. Various energy functions are referred to as Landau-de Gennes models. One involves the symmetric matrix Q in the form ε(Q) = f e (Q) + f L d G (Q), where10 f e (∇Q) = 21 L 1 |∇Q|2 + 21 L 2 |∇ · Q|2 + 21 L 3 ∇Q · ∇QT , f L d G (Q) = a tr Q2 + 23 b tr Q3 + 21 c (tr Q2 )2 . Another one is based on the view that the energy density is the sum of three terms, ε = εlayer + εcoupling + εOF , where εlayer is a function of ||2 while εcoupling is a function of ∇. Here we follow a scheme of continuum mechanics though in a unusual way. The smectic and cholesteric LCs are viewed as a sequence of thin parallel layers within an isotropic fluid. Each layer is viewed as a two-dimensional elastic continuum. The normal N to the layers is then constant, say N = e3 . Let x1 , x2 describe the position within the layer. For technical convenience we adopt the Lagrangian description and denote by X 1 , X 2 the pertinent coordinates for the layer, the third coordinate X 3 being a label for the layer. Hence the restriction of F and C = FT F to the first minor 2 × 2 describes the deformation in a layer. Moreover let α be the angle between N and n. The layered structure allows us to consider α(X 1 , X 2 ) parameterized by X 3 . The restriction e.g. to α(X 1 , X 3 ) is expected to show the helical structure. By the assumed elastic properties of the layers we let εlayer be given in the form εlayer = fl (∇R u, ∇R α), where u is the displacement. While εcoupling involves |∇|2 in a Landau-de Gennes model, we simply let εcoupling be a function of ∇ρ. The Oseen-Frank energy is regarded as elastic energy. Here the elastic energy is considered via the dependence on N × ∇R u. Consequently we let the free energy ψ and the other constitutive functions depend on the set of variables θ, E, ∇R α, 10

In suffix notation, ∇Q · ∇QT = Q ik, j Q i j,k .

10.6 Mixtures of Micropolar Constituents

645

where E is the Green-St Venant strain, E = 21 (∇R u + ∇R uT + ∇R uT ∇R u). We assume the stress T is symmetric and hence Tˆ R R = J F−1 TF−T is symmetric too. The molecules in a layer may rotate, the angle α describing the rotation in the ˙ 2 . Since T = TT then τ = 0. The material couple stress plane (X 1 , X 3 ); let ω = αe −T tensor S R = J SF allows us to write J S · ∇ω = S R · ∇R ω = SiRK ∂ X K ωi . Thermodynamic restrictions are derived by means of the entropy inequality in the material form (10.13). For formal simplicity we neglect heat conduction effects and hence investigate the entropy inequality ˙ + T R R · E˙ + S R · ∇R ω + θ∇R · k R ≥ 0. −ρ R (ψ˙ + η θ) Upon substitution of ψ˙ we have −ρ R (∂θ ψ + η)θ˙ − ρ R (∂E ψ + T R R ) · E˙ − ρ R ∂∇R α ψ · ∇R α˙ + S R · ∇R ω ≥ 0. Since ω = e2 α˙ then

˙ S R · ∇R ω = e2 S R · ∇R α.

˙ E, ˙ ∇R α˙ implies that The arbitrariness of θ, η = −∂θ ψ,

T R R = ρ R ∂E ψ,

e2 S R = ρ R ∂∇R α ψ.

10.6 Mixtures of Micropolar Constituents There are cases where the occurrence of different constituents and the micropolar character indicate that a more appropriate model is that of a mixture of micropolar constituents rather than that of a single micropolar medium. For generality we now look at a mixture of micropolar continua. Later on we simplify the model by regarding only one constituent as strictly micropolar in character, which is the natural model for suspensions of particles in fluids. We consider a mixture of n diffusing and chemically-reacting bodies Bα . The balances of mass and linear momentum are the same as those for non-polar constituents, ∂t ρα + ∇ · (ρα vα ) = τα , ∂t (ρα vα ) + ∇ · (ρα vα ⊗ vα ) = ∇ · Tα + ρα bα + mα , in the Eulerian description, or

646

10 Micropolar Media

ρ`α + ρα ∇ · vα = τα , ρα v` α = ∇ · Tα + ρα bα + mα − τα vα , in the Lagrangian description. Of course, the growths {τα }, {mα } are subject to 

α τα

= 0,



α mα

= 0.

The balance of angular momentum is assumed in the form 

 ∫ ρα (r × vα + σ α )dv ` = ∫ r × (Tα n)da + ∫ [r × (ρα bα + mα ) + ρα lα + cα ]dv,

Pαt

∂ Pαt

Pαt

where σ α is the orientational momentum, lα is the body couple density and cα is the growth of angular momentum, due to the action of the other constituents. We might also account for a surface couple, which then would be represented by a couple stress tensor applied to the normal n, but this is neglected for formal simplicity. Since {cα } represents the set of exchanges we require that 

α cα

= 0.

We recall that a balance equation of the form 

 ∫ ρα φα dv ` = ∫ ξα dv

Pαt

Pαt

implies that ρα φ` α = ξα − τα φα ,

∂t (ρα φα ) + ∇ · (ρα φα vα ) = ξα .

Hence, since r` means the derivative at Xα fixed, r` = vα and the balance of angular momentum results in ρα (r × v` α + σ` α ) = r × (∇ · Tα + ρα bα + mα ) + ρα lα + ϒ α + cα − τα (r × vα + σ α )

and ∂t [ρα (r × vα + σ α )] + ∇ · [ρα (r × vα + σ α ) ⊗ vα ] = r × (∇ · Tα + ρα bα + mα ) + ρα lα + ϒ α + cα ,

where

ϒ α = i jk Tkαj ei .

Hence we find ρα σ` α = ρα lα + ϒ α + cα − τα σ α .

(10.29)

10.6 Mixtures of Micropolar Constituents

647

It is worth remarking that we might have postulated the balance of angular momentum by considering cˇ α = cα + r × mα as the growth of angular momentum. Since 

α cα



α mα



= 0,

ˇα αc

= 0 then the requirements =0

are equivalent. The use of cˇ α , instead of cα , leads to the local balance of orientational momentum in the (equivalent) form ρα σ` α = ρα lα + ϒ α − r × mα + cˇ α − τα σ α . It looks more suggestive to have the form (10.29) for the balance of orientational momentum. As to the Eulerian description, we observe that ∇ · [ρα (r × vα ) ⊗ vα ]|i = ∂xk [ρα (r × vα )i vkα ] = ∂xk (ρα i j p r j v αp vkα ) = i j p r j ∂xk (ρα v αp vkα ) + ρα i j p v αp vkα ∂xk r j = r × ∇ · (ρα vα ⊗ vα )|i + i j p (vkα ∂xk r j )ρα v αp and hence ∂t [ρα (r × vα )] + ∇ · [ρα (r × vα ) ⊗ vα ] = r × ∂t (ρα vα ) + ρα ∂t r × vα +r × ∇(ρα vα ⊗ vα ) + (vα · ∇)r × ρα vα . Moreover, since ∂t r + vα · ∇r = vα then ρα (∂t r + vα · ∇r) × vα = ρα vα × vα = 0. As a consequence, we find that ∂t (ρα σ α ) + ∇ · (ρα σ α ⊗ vα ) = ρα lα + ϒ α + cα .

(10.30)

Borrowing from the kinematics of rigid bodies we let Iα be the symmetric inertia tensor (per unit mass) and assume that σ α = Iα ω α . We also let ρI =



α ρα Iα

define the mean inertia tensor I. The vectors {kαj } attached to the particles of the αth constituent evolve in time according to

648

10 Micropolar Media

k` αj = ω α × kαj . Hence we have ` α + ω α × Iα ω α . σ` α = Iα ω As a consequence, (σ α · ω α )` = 2 σ` α · ω α . The balance of energy is based on the view that the energy per unit mass, of any constituent α, comprises an internal energy εα , a kinetic energy vα2 /2, and an orientational energy σ α · ω α /2. Hence we postulate the balance of energy in the form 

 ∫ ρα (εα + 21 vα2 + 21 σ α · ω α )dv ` = ∫ (vα Tα − qα ) · n da

Pαt

∂Pαt

+ ∫ [ρα vα · bα + ρα lα · ω α + ρα rα + eα ]dv, Pαt

where qα is the heat flux, rα is the external energy supply and eα is the growth of energy. As any set of growths, the set {eα } is required to satisfy 

α eα

= 0.

In view of the equation of motion and of the balance of orientational momentum (10.29) we find the local form of the balance of energy in the form ρα ε`α = Tα · Lα − ∇ · qα + ρα rα − ϒ α · ω α + εˆα ,

(10.31)

where εˆα = eα − τα εα − mα · vα + 21 τα vα2 − cα · ω α + 21 τα ω α · Iα ω α .

(10.32)

The energy supply εˆα is the energy exchanged, per unit volume, by the αth constituent, with the other constituents, relative to the observer at the αth constituent. It is apparent how it consists of three contributions, related to internal energy, linear momentum and orientational momentum. As with micropolar continua, we have Tα · Lα − ϒ α · ω α = Tα · Dα + Tα · Wα − ϒ α · ω α . By means of the axial vector wα of Wα , Wiαj = k ji wkα , we find that

wα = 21 ∇ × vα ,

10.6 Mixtures of Micropolar Constituents

649

Tα · Wα − ϒ α · ω α = ϒ α · (wα − ω α ), which may be viewed as the power of the skew part of the stress Tα on the constituent α. It is of interest that τα εα and τα 21 (vα2 + ω α · Iα ω α ) occur in the energy supply εˆα with different signs. If τα > 0 then τα εα is associated with a decrease of internal energy. Instead, τα 21 (vα2 + ω α · Iα ω α ) produces an increase of internal energy in that mechanical kinetic energy results in internal energy. By (10.31) we obtain the Eulerian description of the energy balance equation in the form ∂t (ρα εα ) + ∇ · (ρα εα vα ) = Tα · Lα − ∇ · qα + ρα rα − ϒ α · ω α + eα −mα · vα + 21 τα vα2 − cα · ω α + 21 τα ω α · Iα ω α .

10.6.1 Second Law Inequality The statement of the second law parallels that for mixtures in Sect. 9.3. The balance of entropy in the Lagrangian and the Eulerian descriptions is given by ρα η`α + τα ηα + ∇ · jα − ρα sα = σα , and ∂t (ρα ηα ) + ∇ · (jα + ρα ηα vα ) − ρα sα = σα , where jα is the entropy flux and sα is the entropy supply. The entropy growths {σα } enter the statement of the second law as follows. The inequality  (10.33) α σα ≥ 0 must hold, at any point x ∈  and for any time t ∈ R, for any set of functions on  × R compatible with the balance equations. Again, the entropy supply sα is identified with rα /θα and the entropy flux jα is taken in the form qα + kα , jα = θα the (unknown) vector kα representing the extra-entropy flux. The analysis of compatibility of constitutive assumptions with the entropy inequality (10.33) is more direct in the Lagrangian description. Hence we write the entropy inequality (10.33) in the form  1 1 [ρα θα η`α + τα θα ηα + ∇ · qα − ρα rα + θα ∇ · kα − qα · ∇θα ] ≥ 0. α θα θα

650

10 Micropolar Media

Substitution of ∇ · qα − ρα rα from (10.31) yields 

α{

1 1 [ρα θα η` α + τα θα ηα − ρα ε` α + Tα · Lα − ϒ α · ω α + εˆ α − qα · ∇θα + θα ∇ · kα ]} ≥ 0. θα θα

In terms of the Helmholtz free energy ψα = εα − θα ηα , the inequality can be written in the form 

α{

1 1 [−ρα (ψ` α + ηα θ` α ) + Tα · Lα − ϒ α · ω α + εˆ α + τα θα ηα − qα · ∇θα + θα ∇ · kα ]} ≥ 0. θα θα

(10.34)

10.7 Nanofluids Nanofluids are envisioned to describe suspensions of (solid) nanoparticles (nominally 1-100 nm in size) in conventional base fluids such as water, oils, or glycols. They are fundamentally characterized by the fact that Brownian agitation overcomes any settling motion due to gravity. The term nanofluid then indicates a mixture where the properties of both the nanoparticles and the base fluid contribute to the whole body. In this sense it is natural to view nanofluids as micropolar mixtures with two constituents: one constituent consists of the nanoparticles, the other one is the base fluid. Hence the nanoparticles are a micropolar constituent, the base fluid is a nonpolar constituent. The non-polar character of the base fluid is made formal by letting the orientational momentum σ and the body couple (or torque) density l be zero. Moreover the constituents are supposed to be non-reacting, τα = 0. Denote by the subscripts s and f the quantities pertaining to solid nanoparticles and base fluid. We let bs = b f = g be the gravity acceleration. Hence the balance equations are ρ`s + ρs ∇ · vs = 0, ρ` f + ρ f ∇ · v f = 0, ρs v` s = ∇ · Ts + ρs g + ms , ρ f v` f = ∇ · T f + ρ f g + m f , ρs σ` s = ρs ls + ϒ s + cs , 0 = ϒ f + cf,

10.7 Nanofluids

651

ρs ε`s = Ts · Ls − ∇ · qs + ρs rs − ϒ s ω s + es − ms · vs − cs · ω s , ρ f ε` f = T f · L f − ∇ · q f + ρ f r f + e f − m f · v f , subject to the constraints ms + m f = 0,

cs + c f = 0,

es + e f = 0.

The two equations of balance of orientational momentum imply that ρs σ` s = ρs ls + ϒ s + ϒ f , which means that the rotation of nanoparticles is governed by the (external) body couple density ls and the skew part of the total stress, ϒ s + ϒ f . Due to the dispersion of the nanoparticles in the fluid, it is reasonable to assume that the nanoparticles are an inviscid, micropolar fluid while the base fluid is viscous. Since ω f is undefined it is consistent to let ϒ f = 0, which follows also as a thermodynamic requirement from (10.34) in view of the arbitrariness of w f . In turn it follows cs = 0. c f = 0, Though ϒ s = 0 would be allowed we assume ϒ s = ζ(ws − ω s ). Hence we let Ts = − ps 1 + Tskw s , ϒ s = ζ(ws − ω s ),

T f = − p f 1 + 2μ f D f + λ f tr D f 1,

where D f is the stretching of the fluid while ps and p f are the partial pressures. Hence we can write the balances of energy in the forms ρs ε`s = − ps ∇ · vs + ζ(ws − ω s )2 − ∇ · qs + ρs rs + es − ms · vs , ρ f ε` f = − p f ∇ · v f + T f · D f − ∇ · q f + ρ f r f − es − m f · v f . It is reasonable to regard ms and m f as interaction forces modelling the friction between the two constituents and then we let ms = ν(v f − vs ) = −m f . Hence it follows that ρs ε` s = − ps ∇ · vs + ζ(ws − ω s )2 − ∇ · qs − ν(v f − vs ) · vs + ρs rs + es , ρ f ε` f = − p f ∇ · v f + 2μ f D f · D f + λ f (tr D f )2 − ∇ · q f + ν(v f − vs ) · v f + ρ f r f − es .

This model is compatible with thermodynamics. Let

652

10 Micropolar Media

ψα = ψα (θα , ρα ) and recall that, since τα = 0, then εˆα simplifies to εˆα = eα − mα · vα . Further, −ms · vs − m f · v f = ν(v f − vs )2 . Hence inequality (10.34) yields ηα = −∂θα ψα . Let θs = θ f = θ. The entropy inequality reduces to −ρs ∂ρs ψs ρ`s − ρ f ∂ρ f ψ f ρ` f − ps ∇ · vs − p f ∇ · v f + 2μ f D f · D f + λ f (tr D f )2 1 +ζ(ws − ω s )2 + ν(v f − vs )2 − (qs + q f ) · ∇θ + θ∇ · k ≥ 0. θ Replace ρ`s and ρ` f with −ρs ∇ · vs and −ρ f ∇ · v f . The linearity of ∇ · vs and ∇ · v f implies that the standard relation has to hold for the partial pressures, ps = ρ2s ∂ρs ψs ,

p f = ρ2f ∂ρ f ψ f .

The reduced inequality holds if k = 0 and μ f ≥ 0,

2μ f + 3λ f ≥ 0,

ν ≥ 0,

qs + q f = −κ∇θ,

κ ≥ 0;

ζ ≥ 0,

and μ f , λ f may depend on θ, ρ f , ∇θ, D f while ν, κ may depend on θ, ρs , ρ f , ∇θ, D f and ζ may depend on θ, ρs . Evolution problems consist in the determination of the fields ρs , ρ f , vs , v f , ω s , θ, es .

10.7.1 Brownian Diffusion and Thermophoresis Brownian diffusion results from continuous collisions between the nanoparticles and the molecules of the base fluid [72]. Correspondingly, the nanoparticle mass flux due to Brownian diffusion is modelled by letting a term of the form

10.7 Nanofluids

653

hs = −ρD∇ϕ, where ϕ is the volume fraction of the nanoparticles. This term is merely the standard model of diffusion possibly replacing the volume fraction with the mass fraction. In nanofluids it is customary to observe the thermophoretic effect, namely the appearance of a force on the particles when the temperature of the surrounding medium (gas) is not uniform. For particles whose diameters are smaller than the gas mean free path, the thermophoretic force depends on the temperature gradient in the surrounding gas molecules. Since a higher temperature means a larger kinetic energy of the molecules then the greater momentum received from the region at higher temperature results in a movement of the nanoparticles toward the cooler region. For particles whose diameters are larger than the gas mean free path the mechanism is more complicated because a temperature gradient is established within the particle. This in turn increases heat conduction and hence the temperature gradient in the gas decreases. In any case, the thermophoretic effect is modelled by letting the force, acting on the particles, be proportional to the temperature gradient in the gas. On the whole, the diffusion flux hs is taken to be of the form hs = −ρωs ω f D  ∇θ − ρD∇ωs , where D  is the thermal diffusion coefficient or thermophoretic mobility and ωs , ω f are the mass fractions [72, 118]. Within the theory of mixtures the diffusion flux is required to be consistent with Eq. (9.68) and hence a model for the thermophoretic effect should be established via the interaction force ms . However, we cannot merely let ms = λ∇θ because then we should let m f = −λ∇θ and account for the second law inequality. Instead, we can consider the standard model of the velocity differences and set ms = Ms f ρs ρ f (v f − vs ). As shown in Sect. 9.4.2, in binary mixtures we have the direct relation ms = −Ms f ρhs . By (9.71) we have

h˙ s + Ms f ρhs = ∇ · Ts − ωs ∇ · T.

In stationary conditions (h˙ s = 0) we have Ms f ρhs = −∇ ps − ωs ∇ · T  −∇ ps ,

654

10 Micropolar Media

the approximation being reasonable for small values of ωs . Hence if ps = ps (ρs , θ) and ρ = ρs + ρ f is (assumed to be) constant then11 ∇ ps = ∂ρs ps ρ∇ωs + ∂θ ps ∇θ. Accordingly hs = −

1 (ρ∂ρs ps ∇ωs + ∂θ ps ∇θ). Ms f ρ

Upon the assumption of negligible values of ∇ωs the corresponding thermophoretic velocity us is then established as us =

1 1 ∂θ p s hs = − ∇θ. ρs Ms f ρρs

These relations for the mass flux of nanoparticles and the thermophoretic velocity are found within the approximation of stationary fluxes in diluted nanofluids.

10.7.2 Model Approximations and Nanofluid Properties In the current literature the properties of nanofluids are often determined by regarding the base fluid as incompressible while the particles have a finite extent. Hence, we let T f = − p f 1 + 2μ f D f , and say that, outside the particles, v f is subject to ∇ · vf = 0 ρ f v` f = −∇ p f + μ f v f . The interaction force m f is taken to be zero in the region occupied by the fluid. The interaction force and torque occur only at the surface of the particles. Hence the equation of motion for the particle is ρs Vs v` s = ∫ T f n da + ρs Vs g, ∂Vs

where Vs is the region occupied by the particle, with volume Vs , g is the gravity acceleration, ρs Vs g is the weight of the particle. The rotational motion, relative to the centre of mass x0 , is governed by 11

This approach parallels the argument given in connection with the Soret effect, see (9.102).

10.7 Nanofluids

655

ρs Vs σ` s = ∫ (x − x0 ) × T f n da + ρs Vs ls . ∂Vs

By the divergence and mean theorems we have ∫ T f n da = ∫ ∇ · T f dv = Vs ∇ · T f (˜x), Vs

∂Vs

x˜ being a point in Vs . Dividing by Vs and letting the diameter of Vs approach zero we find the local equation ρs v` s = ∇ · T f + ρs g. Likewise, ∫ (x − x0 ) × T f n da = ∫ [ϒ f + (x − x0 ) × ∇ · T f ]dv. Vs

∂Vs

The components of ϒ f + (x − x0 ) × ∇ · T f are Vs times appropriate values in Vs . Dividing by Vs and letting the diameter of Vs approach zero we find the expected relation ρs σ` s = ϒ f + ρs ls . For a finite domain Vs we might apply also forces and torques related to v f − vs . We now go back to the mixture approach. If the fluid is incompressible then the properties of the nanofluid are consistently described by the volume fraction ϕ of the particles, and 1 − ϕ of the fluid. Letting ρ0f , ρ0s be the constant intrinsic mass densities we have ρ f = (1 − ϕ)ρ0f . ρs = ϕρ0s , However, while ∇ · v f = 0 in the current literature in the fluid region, where ϕ = 0, in the mixture model we have to regard ϕ ∈ [0, 1] then v f and vs need not be solenoidal. The continuity equations are given the form ∂t ϕ + ∇ · (ϕvs ) = 0, −∂t ϕ + ∇ · ((1 − ϕ)v f ) = 0, whence ∇ · [ϕvs + (1 − ϕ)v f ] = 0. The equations of motion read ρ0s ϕ`vs = −∇ ps + ϕρ0s g + ms , ρ0f ϕ`v f = −∇ p f + μ f (v f + ∇∇ · v f ) + (1 − ϕ)ρ0s g − ms ,

656

10 Micropolar Media

where ps = ps (θ, ϕρ0s ),

p f = p f (θ, (1 − ϕ)ρ0f ).

The balance of orientational momentum is written as ` s + ω s × I s ω s = ls . Is ω Finally, look at the balance of energy under the assumption that the mixture has a single temperature θ. Regard εs and ε f as functions of θ and the corresponding densities ρs and ρ f . Since ps = ρ2s ∂ρs ψs we have ρs ∂ρs εs ρ`s = ρs ∂ρs (ψs + θηs )(−ρs ∇ · vs ) = − ps ∇ · vs − ρ2s θ∂ρs ηs ∇ · vs , the analogue being true for ε f and ρ f . Hence summation of the energy equations and the constraint es + e f = 0 yield ∂θ (ρs εs + ρ f ε f )θ˙ + ∂θ (ρs εs )us · ∇θ + ∂θ (ρ f ε f )u f · ∇θ = ρ2s θ∂ρs ηs ∇ · vs + ρ2f θ∂ρ f η f ∇ · v f −∇ · (qs + q f ) + ρ f r f + ρs rs + 2μ f D f · D f + 2λ(ω ˆ f − ω s )2 + ν(v f − vs )2 ,

where ρs = ϕρ0s and ρ f = (1 − ϕ)ρ0f . One of the two balances of energy provides the energy growth es needed to keep the temperatures θs , θ f at the common value θ. The equation for εs results in ρs ∂θ εs (∂t + vs · ∇θ) + ρ2s ∂ρs θηs ∇ · vs = −∇ · qs + ρs rs + es + ν(vs − v f ) · vs + 2λ(ω s − ω ˆ f ) · ωs .

We have then a system of partial differential equations for the unknowns ϕ, vs , v f , ω s , θ, es . The density of the nanofluid ρn f is taken as ρn f = ϕρs + (1 − ϕ)ρ f . Now, apart from the terms ∂θ (ρs εs )us · ∇θ, ∂θ (ρ f ε f )u f · ∇θ arising from diffusion, the common value of the temperature, which is kept by an appropriate field es of energy growth, shows that the balance of energy holds as though the mixture had an equivalent internal energy ρs εs + ρ f ε f .

10.7 Nanofluids

657

We can then say that the heat capacity ρc = ρ∂θ ε of the nanofluid is (approximately) given by12 ρn f cn f = ϕρ0s cs + (1 − ϕ)ρ0f c f . Numerical values of the shear viscosity μn f of the nanofluid are taken to be given by the shear viscosity μ f of the fluid by, e.g., the relation [67] νn f =

νf (1 − ϕ)2.5

The thermal conductivity of the base fluid (e.g. water, oil) is very low. Usually the nanoparticles (e.g. Cu, Al2 O3 , TiO2 ) have a very high conductivity, which results in an improved heat conduction even with low nanoparticle concentration. For definiteness, the thermal conductivity is approximated by the Maxwell-Garnett formula ([306], p. 364) κs + 2κ f + 2ϕ(κs − κ f ) , κ = κf κs + 2κ f − ϕ(κs − κ f ) valid for spherical particles. Heat conductivity may be further improved by letting the particles be anisotropic and controlling the orientation by means of the body couple density τ s . The literature shows a wide use of a very simple model where the nanofluid is viewed as an appropriate single fluid where micropolar effects are ignored ([72, 106]). Let v be the velocity of the nanofluid. The balance of mass and linear momentum are taken as in the case of a viscous fluid ∂t ρ + ∇ · (ρv) = 0,

∂t (ρv) + ∇ · (ρv ⊗ v) = ∇ · T + ρb,

where b is identified with the acceleration gravity while T = − p1 + 2μn f D + 2 μ ∇ · v1. The balance of energy is simplified to 3 nf c[∂t (ρs θ) + ∇ · (ρs vθ)] = ∇ · (k∇θ) where c is the specific heat and k is the heat conductivity, of the nanofluid. The mixture character of the nanofluid is made formal by the diffusion equation in the form ∂t (ρω) + ∇ · (ρωv) = −∇ · J p , where J p = −ρn f (D B ∇ω + DT ∇θ) is the nanoparticles diffusion flux and D B and DT are the Brownian and thermophoretic diffusion coefficients. In this way a system is available for the unknowns ρ, v, θ, and ω. Observe that ˙ ∂t (ρω) + ∇ · (ρωv) = ω[∂t ρ + ∇ · (ρv)] + ρ[∂t ω + v · ∇ω] = ρω. 12

See, e.g., [254].

658

10 Micropolar Media

Hence the essential approximations of this system are ε = cθ, q = −k∇θ, J p = −ρ(D B ∇ω + DT ∇θ). A further approximated model is sometimes used where the fluid is assumed to be incompressible and the convection steady. The balance of mass and linear momentum are written in the form ∇ · v = 0,

ρ˙v = −∇ p + 2μ∇ · D,

where p stands for the nanofluid pressure. The condition ∇ · v = 0 is questionable; even with incompressible constituents it is ∇ · v = 0. The balance of energy is based on the balance Eq. (9.58); only the nanoparticle flux is considered and hence the balance is taken in the form ρε˙ = −∇ · (q I + χs hs ), where χs = εs + ps /ρs is the enthalpy of the particles. Moreover it is assumed that ρε˙ = ρcθ˙ and q = q I + χs hs . q I = −k∇θ, Upon substitution and the further approximation that ∇ · (χs hs )  hs · ∇χs the balance of energy is written as [7, 72] ρcθ˙ = ∇ · (k∇θ) − hs · ∇χs . Assuming χs is a function of θ completes the scheme.

Chapter 11

Porous Materials

A porous material (medium) is a substance that contains pores (voids), or spaces, between solid materials through which liquid or gas can flow. Examples of naturally occurring porous media include sand, soil, and some types of stone. Sponges, ceramics, and reticulated foam are also manufactured for porous materials. A porous material is characterized by its porosity, or the size of its pores. Materials with low porosity are less permeable and have smaller pores, making it more difficult for gas and/or liquid to flow through them. It is natural to model porous materials as mixtures of a deformable solid and some fluids (gas, liquid). Also, the solid may be viewed as incompressible or compressible. With this in mind, the present approach to the modelling of porous materials is based directly on the theory of mixtures. Emphasis is given to the model consisting of a solid (skeletal material) and the remaining constituents as fluids. Next Darcy’s law is shown to follow by thermodynamic considerations if stationary conditions hold and the Darcy–Forchheimer equation is reviewed. Within the theory of mixtures special models are established: materials with voids, materials with double porosity, and materials with a thermoelastic solid.

11.1 Porous Materials as Mixtures To model porous materials we refer to a mixture of n constituents some of which are solids and the remaining ones are fluids. Hence, the kinematic scheme developed in Sect. 9.8 is adopted for the n constituents. The dependence on (x, t) ∈  × R is often understood and not written. Let ρα represent the mass of the αth constituent per unit volume of the mixture. We refer to ρα as the bulk density. Meanwhile, we call the true density of the αth constituent, denoted by ρ0α , the mass of the αth constituent per unit volume of the αth © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_11

659

660

11 Porous Materials

constituent. Let ϕα be the volume fraction occupied by the αth constituent. Hence the two mass densities are related by ρα = ϕα ρ0α . Since ϕα represents the volume of the αth constituent, per unit volume of the mixture, then  α ϕα ≤ 1. If equality holds we say that the mixture is saturated. Otherwise a void fraction occurs and the mixture is said to be unsaturated. If one of the constituents, say α = 1, is a solid and the other constituents are fluids then f =

n

α=2 ϕα

is the porosity. Fluid mixtures are miscible if they form homogeneous solutions. Fluid mixtures are immiscible if the constituents rest separately also locally (like oil and water). Mathematically, we say that the mixture is miscible if the constitutive properties of the mixture are independent of the volume fractions. Instead we say that the mixture is immiscible if the properties are affected by the volume fractions ϕα . The αth constituent is incompressible if the true density ρ0α is a constant whereas ρα need not be a constant in that is affected by the motion of the constituent. The mixture is said to be incompressible whenever the true densities ρ01 , ρ02 , ..., ρ0n are constants. Yet the mixture as a single body need not be incompressible even though ρ01 , ρ02 , ..., ρ0n are constants. It may be convenient to formulate the balance equations in terms of material coordinates of one of the constituents. For definiteness let the solid be the first constituent (α = 1) and state the balance of mass relative to the observer following the motion of the solid. By the balance of mass ∂t ρα + ∇ · (ρα vα ) = τα we can write ∂t ρ1 + ∇ · (ρ1 v1 ) = τ1 and, for β = 2, 3, ..., n, ∂t ρβ + ∇ · (ρβ v1 ) + ∇ · [ρβ (vβ − v1 )] = τβ . The operator D1 = ∂t + v1 · ∇ is the peculiar time derivative relative to the first constituent. Hence we have

11.2 Constitutive Models of Porous Materials

661

D1 ρβ + ρβ ∇ · v1 + ∇ · [ρβ (vβ − v1 )] = τβ . Multiplying by J1 = det F1 > 0 and observing that D1 J1 = J1 ∇ · v1 we can write D1 (ρβ J1 ) + J1 ∇ · [ρβ (vβ − v1 )] = J1 τβ . For any vector field h(x, t) the identity J1 ∇ · h = ∇X1 · (J1 F1−1 h)

(11.1)

holds where ∇X1 is the gradient relative to the reference coordinates X1 of the first constituent. Hence we have D1 (J1 ρβ ) − ∇X1 · [J1 ρβ F1−1 (vβ − v1 )] = J1 τβ .

(11.2)

Now, J1 ρβ is the mass of the βth constituent per unit volume in the reference configuration of the first constituent. The quantity J1 ρβ F1−1 (vβ − v1 ) is the mass flux of the βth constituent into a material element of the first constituent. The proof of (11.1) is based on a direct application of Nanson formula. For any region P ⊂  we can apply the divergence theorem to obtain ∫ h · n da = ∫ ∇ · h dv = ∫ J1 ∇ · h d V1 , P

∂P

P1

where P1 = χ−1 1 (P). Moreover, by using Nanson formula and the divergence theorem we find ∫ h · n da = ∫ J1 h · F1−T n R d A1 = ∫ ∇X1 · J1 F1−1 h d V1 .

∂P

Consequently

∂P1

P1

∫ J1 ∇ · h d V1 = ∫ ∇X1 · (J1 F1−1 h) d V1 .

P1

P1

The arbitrariness of P1 provides the result.

11.2 Constitutive Models of Porous Materials For definiteness we let the first constituent (skeletal material) be a solid and the remaining constituents be fluids. The mixture is immiscible and hence we allow the constitutive properties to be affected by the volume fractions. Denote by the subscript

662

11 Porous Materials

s the quantities pertaining to the solid (first constituent). Let s = ρ0s be the reference mass density of the solid, s = ρs Js . Hence we let ψs = ψs (θs , s , Fs , ϕs , ∇θs , ∇Fs ),

ψβ = ψβ (θβ , ρβ , ϕβ , ∇θβ , Dβ ),

and the like for Ts , Tβ , qs , qβ , where β = 2, 3, ..., n. The interaction terms τα , mα , eα are allowed to depend on all of the volume fractions and the velocity differences. Hence mα = mα (θs , s , Fs , ∇θs , ∇Fs , θβ , ρβ , ϕβ , ∇θβ , Dβ , vα − vβ ) and the like for τα and eα . Substitution in (9.42) results in 

α {−

ρα ρα ` − ρβ ∂ ψ · D ` α + ...} ≥ 0, (∂θα ψα + ηα )θ`α − ∂∇θα ψα · ∇θ α D β θα θα θβ β

` ,D ` β . The linear dependence the dots representing quantities independent of θ`α , ∇θ α ` ` β , at any point x and time t, imply that the entropy and arbitrariness of θ`α , ∇θα , D inequality holds only if the corresponding coefficients vanish, whence ∂Dβ ψβ = 0, Since

∂∇θα ψα = 0,

ηα = −∂θα ψα .

(11.3)

Ls = F` s Fs−1 ,

the remaining inequality can be written as −

ρs ` ) + 1 (T F−T ) · F` + ... ≥ 0 (∂s ψs `s + ∂Fs ψs · F` s + ∂∇Fs ψs · ∇F s s s s θs θs

` . Since the dots representing quantities independent of F` s and ∇F s `s = Js τs then ∂s ψs `s has to be examined along with the other terms involving τs . Moreover, ` = ∇ F` − LT ∇ F . ∇F s s s s The arbitrariness of ∇ F` s implies that ∂∇Fs ψs has to vanish. Moreover, the arbitrariness of F` s implies that (11.4) Ts = ρs ∂Fs ψs FsT .

11.2 Constitutive Models of Porous Materials

663

Granted (11.3) and (11.4), we now restrict the analysis of the remaining properties to the case where each constituent has the same temperature, say θ. Since ψs = ψs (θ, s , Fs , ϕs ),

ψβ = ψβ (θ, ρβ , ϕβ ),

in view of (11.3) and (11.4) we can write the entropy inequality (9.42) as  −ρs ∂ϕs ψs ϕ` s + nβ=2 Tβ · Lβ − ρβ (∂ρβ ψβ ρ`β + ∂ϕβ ψβ ϕ` β ) − ρs ∂s ψs Js τs  1 + nα=1 [−τα (ψα − 21 vα2 ) + eα − mα · vα − qα · ∇θα + θα ∇ · kα ] ≥ 0. θα  Since α eα = 0 then we let 

1 2 α [−τα 2 vα

+ mα · vα ] =: ({vα })

as in the general case of mixtures. The formal novelty of the modelling of porous media is the occurrence of the time derivative of the volume fractions ϕα . Very often the volume fractions are taken to obey a rate law as with materials with internal variables. The assumption then might be ϕ` α = ϕˆ α (θ, s , Fs , ∇θ, ϕs , ∇Fs , ρβ , ϕβ , Dβ , vα − vβ ). Let

Tβ = − pβ (θ, ρβ , ϕβ )1 + T˜ β ,

T˜ β approaching zero as Lβ approaches zero. Substitution of ρ`β = −ρβ ∇ · vβ + τβ yields   − α ρα ∂ϕα ψα ϕˆ α − β ( pβ − ρ2β ∂ρβ ψβ )∇ · vβ − τs ψs − s ∂s ψs τs    − β τβ (ψβ + ρβ ∂ρβ ψβ ) −  + β T˜ β · Lβ − α (qα · ∇θ − θ∇ · kα ) ≥ 0 For definiteness, let ∇ · kα and  be independent of the velocity gradients. Hence the inequality holds only if pβ = ρ2β ∂ρβ ψβ , Let μα =

β = 2, 3, ..., n.

 ψ +  ∂ ψ , α = 1, s s s s ψβ + pβ /ρβ , α = β = 2, 3, ..., n.

The remaining inequality can be written as −



α (ρα ∂ϕα ψα

ϕˆ α + μα τα ) −  +



˜ · Lβ −

β Tβ



α (qα

· ∇θ − θ∇ · kα ) ≥ 0. (11.5)

664

11 Porous Materials

Simple models arise by letting each term in (11.5) be non-negative. Such is the case if kα = 0, T˜ β and qα are given as in the Navier–Stokes–Fourier model,  is non-positive, and  α μα τα ≤ 0, 

ˆα α ρα ∂ϕα ψα ϕ

≤ 0.

(11.6)

11.2.1 Darcy’s Law Darcy’s law describes the flow of a fluid through a porous medium. The law was established by H. Darcy on the basis of experiments on the flow in stationary conditions. In homogeneous rectilinear flows, if p(xb ) − p(xa ), p(xb ) < p(xa ), is the pressure difference across a cylinder of section A and length L the mass Q flowing per unit time, from xa to xb , is Q=

kA ( pb − pa ); μL

k is the permeability and μ is the viscosity of the fluid. In local form the law for the flux, or relative velocity,1 h is assumed in the form k h = − ∇ p. μ

(11.7)

We now look for a derivation of Darcy’s law within continuum mechanics. The equation of motion for any (fluid) constituent reads ρβ v` β = ∇ · Tβ + ρβ bβ + mβ − τβ vβ . We specialize the model by letting τβ = 0 (non-reacting fluid) and letting Tβ be a pressure tensor, Tβ = − pβ 1. In stationary conditions, we have 0 = −∇ pβ + ρβ bβ + mβ . The standard model of the growths mβ , mβ = satisfies

1





β mβ

α Mβα (vα

= 0,

− vβ ),

≤0

In many contexts of application of Darcy’s law the flux is denoted by q.

(11.8)

11.2 Constitutive Models of Porous Materials

665

provided each tensor Mβα is symmetric and positive definite. This is so because Mβα (vα − vβ ) is skew in α, β and hence 

β mβ

=



α,β Mβα (vα

− vβ ) = 0.

Moreover, 

β mβ

· uβ =



  · Mαβ (uα − uβ ) = α,β uβ · Mβα uα − 21 α,β uβ · Mβα uβ   − 21 α,β uα · Mβα uα = − 21 α,β (uα − uβ ) · Mβα (uα − uβ ).

α,β uβ

Hence  ≤ 0 if and only if each tensor Mαβ is positive semi-definite. The next step is the assumption that Mβα = 0 only if α = s = 1. Consequently mβ = Mβs (vs − vβ ), whence

and, by (11.8),

−1 mβ , vβ − vs = −Mβs

−1 [−∇ pβ + ρβ bβ ]. vβ − vs = −Mβs

It is reasonable to let Mβs be proportional to the volume fraction ϕβ , Mβs = ϕβ β . Hence, we can write vβ − vs = −

β [∇ pβ − ρβ bβ ]. ϕβ

(11.9)

The tensor β is called the mobility of phase β. The local form of Darcy’s law follows by neglecting the body force and letting β = ϕβ (k/μ)1 while h = vβ − vs .

11.2.2 Darcy-Like Equations In modelling the unsaturated groundwater flow, it is common to apply Darcy’s law along with some approximations. It is assumed that water and air flow within the soil but the flow equation for air is neglected and the air pressure is taken to be equal to the atmospheric pressure at the surface, p A . The volume fraction of water, say ϕw , is

666

11 Porous Materials

also called the soil’s moisture content (or water content). Let ρ0w be the true2 mass density of water so that ρw = ϕw ρ0w is the mass density. Look at the continuity equation ∂t (ϕw ρ0w ) = −∇ · (ϕw ρ0w vw ). The pressure head P in the water is defined by P=

pw − p A . ρ0w g

Hence ∇ pw − ρw bw = g[∇(ρ0w P) + ϕw ρ0w ∇z]. Upon identifying vw − vs with vw and then substituting vw from Darcy’s law (11.9), we have ∂t (ϕw ρ0w ) = ∇ · {ρ0w gw [∇(ρ0w P) + ϕw ρ0w g∇z]}. Letting ρ0w be taken as a constant, we have ∂t ϕw = ∇ · [K(∇P + ϕw ∇z)]. The tensor K = ρ0w gw is called the hydraulic conductivity of the soil. To get an equation in which P is the unique unknown it is assumed that ϕw is a function of P so that C(P)∂t P = ∇ · [K(P)(∇P + ϕw (P)∇z)], where C(P) = dϕw /dP is the specific moisture capacity. The one-dimensional version is known as Richards’ equation [4, 378]. Equation (11.7) is improved to get a better fit in applications. In geological engineering, the flow is described as h=−

k eff ∇ p, μ

k eff = k(1 + b/ p),

b being known as the Klinkenberg parameter; it shows that the flow decreases as the (mean) pressure p increases. 2

Or intrinsic.

11.3 Special Models of Porous Materials

667

Nonlinear effects are modelled by the Darcy–Forchheimer equation. Qualitatively the idea is to allow for a nonlinear term in q. In general the equation reads μ ∇ p = − h + ρb − ck −1/2 |h|h, k the body force b depending on the domain of application. Two further equations are used in the literature. One is just τ h˙ + h = −(k/μ)∇ p, with the same form of the Maxwell–Cattaneo equation for the heat flux. The other is Brinkman equation −βh + h = −(k/μ)∇ p, where β is a viscosity parameter. It is justified by viewing the relative velocity h as proportional to the divergence of the stress − p1 + (βμ/k)∇h.

11.3 Special Models of Porous Materials Though the constraint (11.6) on the volume fractions looks quite general, operative models require that we specify the functions ϕˆ α . For definiteness we restrict attention to incompressible constituents. As before, the first constituent is a solid. In incompressible constituents the true density ρ0α is constant and this allows us to establish a simple evolution equation for the volume fraction ϕα . Since ρ0α ϕα = ρα =

α , Jα

by (9.13) we have ϕ` α = −ϕα ∇ · vα +

1 τα . ρ0α

(11.10)

Equation (11.10) is remarkable in that it shows how ϕα changes as a consequence of the chemical reactions, via τα , and the motion, via ∇ · vα . In incompressible constituents, α = ρ0α Jα ϕα is a constant and hence only two variables among α , Jα , and ϕα are independent. Let the remaining constituents, β = 2, 3, ..., n, be incompressible fluids. We allow the constitutive properties to be affected by the volume fractions and let3 3

The model is developed in [326].

668

11 Porous Materials

ψs = ψs (θs , Fs , ϕs , ∇θs , ∇Fs ),

ψβ = ψβ (θβ , ϕβ , ∇θβ , Dβ ),

and the like for Ts , Tβ , qs , qβ , where β = 2, 3, ..., n. The interaction terms τα , mα , eα are allowed to depend on all of the volume fractions and the velocity differences. Hence mα = mα (θs , Fs , ϕs , ∇θs , ∇Fs , θβ , ϕβ , ∇θβ , vα − vβ ) and the like for τα and eα . By arguing as in the previous section we find that, because of the entropy inequality (9.42), the free energy functions simplify to ψs = ψs (θs , Fs , ϕs ),

ψβ = ψβ (θβ , ϕβ )

while ∂θs ψs = −ηs , ∂θβ ψβ = −ηβ . The entropy inequality then becomes −

  ρβ 1 1 ρs (∂Fs ψs · F` s + ∂ϕs ψs ϕ` s ) − nβ=2 ∂ϕβ ψβ ϕ` β + Ts · Ls + nβ=2 Tβ · Lβ θs θβ θs θβ  1 1 + nα=1 (τα θα ηα − qα · ∇θα + εˆα + θα ∇ · kα ) ≥ 0. θα θα

In view of (11.10) and the relation F` s = Ls FsT , we obtain  1 1 (Ts − ρs ∂Fs ψs FsT + ρs ϕs ∂ϕs ψs 1) · Ls + nβ=2 (Tβ + ρβ ϕβ ∂ϕβ ψβ 1) · Lβ θs θβ  1 1 + nα=1 (ζα τα − qα · ∇θα + εˆα + θα ∇ · kα ) ≥ 0, θα θα where ζα = θα ηα − ϕα ∂ϕα ψα . Now, Ts is independent of Ls whereas Tβ is allowed to depend on ∇θβ and Dβ . For definiteness, we assume that Tβ is expressed by v Tβ = Tnv β (θβ , ϕβ ) + Tβ (θβ , ϕβ , ∇θβ , Dβ ),

where Tvβ is subject to Tvβ → 0 as Dβ → 0. Moreover let Tˆ s = Ts − ρs ∂Fs ψs FsT + ρs ϕs ∂ϕs ψs 1. Consequently we find   1 ˆ 1 1 1 + Tvβ ) · Lβ + nα=1 (ζα τα − qα · ∇θα + εˆ α + θα ∇ · kα ) ≥ 0. Ts · Ls + nβ=2 (Tnv θs θβ β θα θα

11.3 Special Models of Porous Materials

669

This inequality holds if Tˆ s = 0, Tnv β = −ρβ ϕβ ∂ϕβ ψβ 1, and n n   1 v 1 1 T · Lβ + (ζα τα − qα · ∇θα + εˆα + θα ∇ · kα ) ≥ 0. θβ β θ θ α α=1 α β=2

In particular, we can find the Navier–Stokes–Fourier model Tvβ = 2νβ Dβ + λβ ∇ · vβ 1, qα = −κα ∇θα , where νβ ≥ 0, 2νβ + 3λβ ≥ 0, κα ≥ 0 and n

α=1

1 (ζα τα + εˆα + θα ∇ · kα ) ≥ 0. θα

To sum up, we can model the skeletal porous solid and the n − 1 fluids by εs = ψs (θs , Fs , ϕs ) − θs ∂θs ψs (θs , Fs , ϕs ), εβ = ψβ (θβ , ϕβ ) − θβ ∂θβ ψβ (θβ , ϕβ ), Ts = ρs ∂Fs ψs FsT − ρs ϕs ∂ϕs ψs 1, Tβ = −ρβ ϕβ ∂ϕβ ψβ 1 + 2νβ Dβ + λβ ∇ · vβ 1,

qα = −κα ∇θα .

Both in solids and fluids we can view pα := ρα ϕα ∂ϕα ψα as the (isotropic) pressure. However, in fluids this view is classical in that ρα ϕα ∂ϕα ψα = ρ2α ∂ρα ψα . In solids, instead, the view of ρs ϕs ∂ϕs ψs as the pressure is purely formal because the additive term ρs ∂Fs ψs FsT may comprise an isotropic tensor. Look now at the case where each constituent has the same temperature, say θ. Hence, we have n α=1 (ζα τα + εˆα + θα ∇ · kα ) ≥ 0. Substitution of εˆα from (9.25) results in

670

11 Porous Materials

n

α=1 [(ζα

− εα )τα − pˆ α · uα − 21 τα uα2 + θα ∇ · kα ] ≥ 0.

Since θα = θ we can write ζα − εα = −ψα − ϕα ∂ϕα ψα , whence ζα − εα = −(ψα + pα /ρα ) =: −μα . We may regard μα as the chemical potential. Since kα may be taken to be zero we obtain the standard form of the reduced inequality, n

α=1 (μα τα

+ pˆ α · uα + 21 τα uα2 ) ≤ 0.

11.4 Materials with Voids By analogy with the previous scheme of porous materials, we now look at porous solids in which the skeletal material is elastic and incompressible whereas the interstices are void of material. Owing to the voids, only the solid is a material constituent and hence we use a superposed dot to denote the time derivative following the motion of the solid. The solid is regarded as incompressible so that ρ = ϕρ0 , the true mass density ρ0 being constant.4 We assume that the interaction terms between solid and voids are zero, namely, τ , m, e are zero. Hence it follows that ρ and ϕ satisfy the same equation, ρ˙ = −ρ∇ · v,

ϕ˙ = −ϕ∇ · v.

The remaining balance equations are formally the classical ones for a single body, say ρ˙v = −∇ · T + ρb, ρε˙ = T · D − ∇ · q + ρr. The effects of the voids on the solid are modelled by the constitutive equations. Let  = (θ, F, ϕ, ∇θ, ∇ϕ, ϕ) ˙ 4

In this section, the subscript s is understood and not written.

11.4 Materials with Voids

671

be the set of variables so that ψ, η, ε, T, q, k are functions of . The dependence on the time derivative is considered to allow for inertial properties in the evolution of ϕ. Let ψ, q, k be differentiable and the remaining functions be continuous. The second-law inequality can be written as ˙ − ρ∂ ψ · ∇ϕ ˙ − ρ∂ ψ ϕ¨ −ρ(∂θ ψ + η)θ˙ − ρ∂F ψ · F˙ − ρ∂ϕ ψ ϕ˙ − ρ∂∇θ ψ · ∇θ ∇ϕ ϕ˙ 1 +T · D − q · ∇θ + θ∇ · k ≥ 0. θ ˙ and ∇ϕ ˙ to obtain By means of (9.32) we can replace ∇θ − ρ(∂θ ψ + η)θ˙ − ρ∂F ψ · F˙ − ρ∂ϕ ψ ϕ˙ − ρ∂∇θ ψ · ∇ θ˙ − ρ∂∇ϕ ψ · ∇ ϕ˙ − ρ∂ϕ˙ ψ ϕ¨ 1 +ρ∂∇θ ψ · LT ∇θ + ρ∂∇ϕ ψ · LT ∇ϕ + T · D − q · ∇θ + θ∇ · k ≥ 0. θ (11.11) Time differentiation of ϕ˙ = −ϕ∇ · v and use of (9.33) yield ϕ¨ = −ϕ∇ ˙ · v − ϕ∇ · v˙ + ϕ(LT ∇) · v. Now, at any point x ∈  and time t, ∇ · v˙ can take any arbitrary value, independent of  and ∇ ϕ, ˙ merely because ∇ · b is regarded to be given by (the divergence of) the equation of motion (9.24)2 . Likewise θ˙ and ∇ θ˙ can take arbitrary values because r (x, t) and ∇r (x, t) are regarded to be given by the energy balance equation (9.24)3 . ˙ ∇ θ, ˙ and ϕ, Consequently, in view of the linear dependence on θ, ¨ inequality (11.11) holds only if ∂∇θ ψ = 0, ∂ϕ˙ ψ = 0. ∂θ ψ + η = 0, Divide the remaining inequality by θ. Upon some rearrangements we can write 1 ρ [−ρ∂F ψ FT + ρϕ∂ϕ ψ 1 − ϕθ∇ · ( ∂∇ϕ ψ)1 + ρ∇ϕ ⊗ ∂∇ϕ ψ + T] · L θ θ 1 ρ − 2 q · ∇θ + ∇ · (k − ∂∇ϕ ψ ϕ) ˙ ≥ 0. θ θ This suggests that we take k=

ρ ∂∇ϕ ψ ϕ˙ θ

and hence ρ 1 [−ρ∂F ψ FT + ρϕ∂ϕ ψ 1 − ϕθ∇ · ( ∂∇ϕ ψ)1 + ρ∇ϕ ⊗ ∂∇ϕ ψ + T] · L − q · ∇θ ≥ 0. θ θ

The symmetry of T and the linearity of L imply that

672

11 Porous Materials

−ρ∂F ψ FT + ρ∇ϕ ⊗ ∂∇ϕ ψ ∈ Sym, ρ T = ρ∂F ψ FT − ρϕ∂ϕ ψ 1 + ϕθ∇ · ( ∂∇ϕ ψ)1 − ρ∇ϕ ⊗ ∂∇ϕ ψ θ

(11.12)

and q · ∇θ ≤ 0, whence q = −κ()∇θ,

(11.13)

κ() being a non-negative function. It is apparent that a dependence on D from the outset would allow us to describe also viscosity effects. This simple model accounts for the dependence on the volume fraction ϕ of the solid, 1 − ϕ being the porosity, and the volume fraction of the voids. Such dependence results in various effects. The stress tensor comprises −ρϕ∂ϕ ψ 1 as with elastic solids where the dependence on ϕ is allowed. The dependence on ∇ϕ produces additive terms, ρ ϕθ∇ · ( ∂∇ϕ ψ)1 − ρ∇ϕ ⊗ ∂∇ϕ ψ, θ which in principle may occur also in the model of single bodies. It is likely, though, that such dependence is more influential in materials with voids. In a dynamic problem, the unknown functions are the volume fraction ϕ (or the mass density ρ), the velocity v, and the temperature θ. They are determined by the differential equations ϕ˙ = −ϕ∇ · v,

ρ˙v = −∇ · T + ρb,

ρε˙ = T · D − ∇ · q + ρr,

the constitutive equations (11.12), (11.13), and ε = ε(θ, F, ϕ, ∇ϕ), along with appropriate initial-boundary-value data.

11.5 Porous Media with Double Porosity Most frequently the flow in porous media is investigated by letting the body consist of a system of similar-sized pores connected by a single pore network as is done in the previous sections. Yet many geo-materials such as aggregated soils or fissured rocks show two or more dominant pore scales connected by multiple pore networks that display different orders of volume fractions. Here the porous medium is treated as a mixture of n constituents. The role of constituent refers to both the porous solid and the pore network. The modelling via the mixture theory makes it natural to describe

11.5 Porous Media with Double Porosity

673

the interacting processes. In particular, the mixture is reacting in that there can be a mass transfer between the pore networks. We keep following the standard notation for mixtures and, for ease of reading, we review the balance equations. The local form of the balance of mass, linear momentum, and energy of the αth constituent can be written in the form (9.24), i.e. ρ`α + ρα ∇ · vα = τα , ρα v` α − ∇ · Tα − ρα bα = pˆ α , ρα ε`α − Tα · Lα + ∇ · qα − ρα rα = εˆα . In terms of the supplies τα , pˆ α , εˆα , the overall balance for the mixture becomes 

α τα

= 0,



ˆα αp

+ τα vα = 0,



α εˆα

+ pˆ α · vα + τα (εα + 21 vα2 ) = 0.

As a simplifying assumption, we let Tα be symmetric so that Tα · Lα = Tα · Dα . Moreover, we have in mind to investigate constitutive models where the extra-entropy fluxes {kα } vanish. Hence, the entropy inequality can be written as 

1 1 1 2 ` ` α { θ [−ρα (ψα + ηα θα ) − τα (ψα − 2 vα ) + eα − mα · vα + Tα · Dα − θ qα · ∇θα ]} ≥ 0, α α

where {eα } and {mα } are subject to 

α eα

= 0,



α mα

= 0.

For definiteness and simplicity, let all the constituents have the same temperature, θα = θ. Hence, the entropy inequality can be given in the form 

` + ηα θ`α ) − τα ψα − pˆ α · uα − 1 τα u2 + Tα · Dα − 1 qα · ∇θ]} ≥ 0, α 2 θ (11.14) where, by (9.18), pˆ α = mα − τα vα . We now specify the properties of the porous material. α [−ρα (ψα

11.5.1 Porous Material with Undeformable Solid There are two pore networks and the skeleton is a rigid material; since the solid is rigid the corresponding balance of linear momentum is trivially satisfied. The constituents 1 and 2 are the two pore networks. To fix ideas, the constituents 1 and 2 are the macro-pore and micro-pore networks. The porosities (or volume fractions) in the macro- and micro-pore networks are denoted by ϕ1 and ϕ2 . The true (or seepage) velocity vector fields are denoted by v1 and v2 whereas v˜ α = ϕα vα ,

α = 1, 2,

674

11 Porous Materials

are the discharge (or Darcy) velocities. If ρ0 is the true mass density of the fluid then the mass densities in the macro- and micro-networks are ρα = ϕα ρ0 ,

α = 1, 2.

The fluid is incompressible and hence ρ0 is constant. Since the skeleton is rigid the geometry of the pore networks is fixed and hence ∂t ϕα = 0,

α = 1, 2.

Hence the continuity equation ∂t ρα + ∇ · (ρα vα ) = τα can be written as ρ0 ∇ · (ϕα vα ) = τα . Moreover, ϕ` α = vα · ∇ϕα and then each continuity equation results in (11.10), ϕ` α =

τα − ϕα ∇ · vα . ρ0

It is worth pointing out that in a mixture of incompressible materials the velocity fields need not be divergence free. Irrespective of the value of τα , a nonzero value of ∇ · vα is made possible by ϕ` α = 0. It is customary to assume that the fluid can be exchanged between the two pore networks if there exists a sufficient difference of pressure between the networks. Formally, the mass transfer is often modelled by the relation β τ1 (x) = − ρ0 ( p1 (x) − p2 (x)), μ

(11.15)

where μ is the viscosity while β is a dimensionless characteristic of the porous medium. This relation traces back to Barenblatt et al. [29] and was derived by means of a dimensional analysis argument. Thermodynamic restrictions on the constitutive equations are now derived, via the second-law inequality, by assuming that ψα , ηα , Tα , qα are functions of θ, ϕα , Dα , ∇θ, while pˆ α and τα may depend on quantities pertaining to both constituents. Moreover, we let Tˇ α = O(Dα ). Tα = − pα (θ, ϕα )1 + Tˇ α ,

11.5 Porous Media with Double Porosity

675

By a routine argument we can show that ψα is independent of Dα and ∇θ. To save writing we then start with the constitutive assumption ψα = ψα (θ, ϕα ) and observe ψ` α = ∂θ ψα (∂t θ + vα · ∇θ) + ∂ϕα ψα ϕ` α . Upon substitution in the second-law inequality (11.14), we have 

α [−ρα (∂θ ψα

τα + ηα )θ`α − ρα ∂ϕα ψα ( 0 − ϕα ∇ · vα ) − τα ψα − pα ∇ · vα ] + ζ ≥ 0, ρ

where θ`α = ∂t θ + vα · ∇θ and ζ=



ˆα α [−p

1 · uα − 21 τα uα2 + Tˇ α · Dα − qα · ∇θ]. θ

The inequality holds if and only if ηα = −∂θ ψα , and



pα = ϕα ρα ∂ϕα ψα ,

α τα (ψα

+

pα ) − ζ ≤ 0. ρα

If, further, τ1 (= −τ2 ) is independent of {vα }, {Dα } and ∇θ then the reduced inequality holds if and only if  pα )≤0 (11.16) α τα (ψα + ρα and ζ ≥ 0. Inequality (11.16) is the standard relation governing the exchanges of mass, ψα + pα /ρα being the α-th chemical potential. If pα /ρα prevails on ψα then we can take the approximate relation 

whence τ1 (

α τα

pα ≤ 0, ρα

p1 p2 − ) ≤ 0. ρ1 ρ2

Consequently, τ1 has the general form τ1 = −

ν p1 p2 ( − ), 0 ρ ϕ1 ϕ2

(11.17)

ν being any positive-valued function of the variables involved. The result (11.17) has a similarity and a difference to the Barenblatt-et-al relation (11.15). The similarity is due to the dependence of τ1 (and τ2 ) on the pressures in the fluid constituents. In this

676

11 Porous Materials

sense ν of (11.17) can be identified with βρ0 /μ of (11.15). The difference is in the quantity p1 /ϕ1 − p2 /ϕ2 rather than p1 − p2 . An insight in favour of (11.17) is that pα ∝ ρα ∝ ϕα . Hence it is the specific quantity pα /ϕα that governs the exchange of mass. As for the positive definiteness of ζ, the classical Navier–Stokes–Fourier model is allowed,  Tα = − pα 1 + 2μDα , α qα = −κ∇θ, where μ and κ are positive. A non-negative value of pair of vectors u1 , u2 follows by letting pˆ α = M



β (uβ



ˆα α [−p

· uα − 21 τα uα2 ] for any

− uα ) − τα uα + 21 τα (u1 + u2 ),

 M being any (symmetric) positive-definite tensor. It follows that α pˆ α + τα vα = 0 and  ˆ α · uα − 21 τα uα2 ] = −(u1 − u2 ) · M(u1 − u2 ) ≤ 0. α [−p

11.5.2 Porous Material with a Thermoelastic Solid The skeletal solid is now allowed to be deformable and, for definiteness, is supposed to be thermoelastic. The porous material is then modelled as a mixture with three constituents, 1 and 2 being the macro-pore and micro-pore fluids while the third constituent is the thermoelastic skeleton. The fluid is still incompressible and hence it is convenient to let ρα = ϕα ρ0 , α = 1, 2. Moreover, Eq. (11.10) holds formally unchanged while now ∂t ϕα = 0. Here we let the subscript s denote the quantities pertaining to the solid (constituent 3). The balance and constitutive equations are significantly affected by the constraints. To begin with, suppose the porous material is saturated. The saturation constraint ϕs +



α ϕα

=1

implies that 

ϕ` s = ∂t ϕs + (vs · ∇)ϕs = −



α [∂t ϕα + vs · ∇ϕα ] = −

` α + (vα − vs ) · ∇ϕα ]. α [ϕ

In addition to be mathematically cumbersome, this constraint seems to be quite unrealistic from the physical point of view. Instead it seems of interest to let the true density of the solid be constant. Hence by ρs = ϕs ρ0s , where ρ0s is constant, it follows that ϕs has to satisfy ϕ` s = −ϕs ∇ · vs .

11.5 Porous Media with Double Porosity

677

Thermodynamic restrictions The constituents 1 and 2 are viscous fluids and hence their state variables are θ, ϕα , Dα , ∇θ,

α = 1, 2.

The solid is thermoelastic and then the state variables of the solid are θ, ϕs , Fs , ∇θ. As in the previous section, we let the free energies ψα and ψs be independent on Dα and ∇θ, ψs = ψs (θ, ϕs , Fs ). ψα = ψα (θ, ϕα ), α = 1, 2; Upon substitution in inequality (11.14) we obtain −ρs (∂θ ψs + ηs )θ` s +



τα + ηα )θ` α − ρα ∂ϕα ψα ( 0 − ϕα ∇ · vα ) − pα ∇ · vα ] ρ  +ρs ∂ϕs ψs ϕs ∇ · vs + (Ts − ρs ∂Fs ψs FsT ) · Ls − α τα ψα − pˆ s · us   ˇ α · Dα ] − 1 ( + α [−pˆ α · uα − 21 τα uα2 + T α qα + qs ) · ∇θ ≥ 0. θ α [−ρα (∂θ ψα

By the standard argument about arbitrariness it follows that ηs = −∂θ ψs ,

ηα = −∂θ ψα ,

Consequently ρα ∂ϕα ψα

pα = ρα ϕα ∂ϕα ψα .

τα pα = τα . 0 ρ ρα

The inequality then simplifies to ρs ∂ϕs ψs ϕs ∇ · vs + (Ts − ρs ∂Fs ψs FsT ) · Ls −



α τα (ψα

+

pα ) − pˆ s · us ρα

 1  + α [−pˆ α · uα − 21 τα uα2 + Tˇ α · Dα ] − ( α qα + qs ) · ∇θ ≥ 0. θ By the arbitrariness of Ls and disregarding cross-coupling terms (Dα ∇θ, uα ∇θ and so on) it follows (11.18) Ts + ρs ϕs ∂ϕs ψs 1 − ρs ∂Fs ψs FsT = 0, 

α τα (ψα

+

pα ) ≤ 0, ρα pˆ s · us +

( 



α qα

ˆα α (p

+ qs ) · ∇θ ≤ 0,

· uα + 21 τα uα2 ) ≤ 0.

Tˇ α · Dα ≥ 0, (11.19)

678

11 Porous Materials

Equation (11.18), namely, Ts = −ρs ϕs ∂ϕs ψs 1 + ρs ∂Fs ψs FsT , means that the stress tensor Ts of the solid is the sum of two terms, one is a pressure tensor, formally the pressure being ρs ϕs ∂ϕs ψs , the other is the standard elastic stress. If ψs depends on Fs through Cs = FsT Fs then ∂Fs ψs FsT = 2Fs ∂Cs ψs FsT . It is apparent that Fs ∂Cs ψs FsT ∈ Sym. If, further, we observe that F = 1 + H = 1 + εF = 1 + ε(1 + εF), then in the linear approximation with respect to ε we can write Cs − 1 (1 + εs )(1 + εs ) − 1 2εs . Hence we have 2Fs ∂Cs FsT ∂εs ψs . Furthermore, if ψs = μs εs · εs + 21 λs (tr εs )2 , then ∂εs ψ = 2μs εs + λs (tr εs )1. The result (11.16) holds here unchanged and this is consistent with the fact that now the solid is elastic but does not exchange mass with the fluid constituents. Hence Eq. (11.17) still follows. Inequality (11.19) holds if the supplies of linear momentum {pˆ β }, β = 1, 2, 3, are given by  (11.20) pˆ β = γ Mβγ (uγ − uβ ) − τβ uβ + 21 τβ (u1 + u2 ), where any tensor Mβγ = Mγβ ,

β, γ = 1, 2, 3

is (symmetric) positive definite while τ3 = 0 and hence τ1 + τ2 = 0. To check the consistency of (11.20), we first observe that 

β mβ =



ˆ β + τβ v β = βp



β,γ Mβγ (uγ − uβ ) +



1 β τβ (vβ − uβ ) + 2 β τβ (u1 + u2 ).

 The sum β,γ Mβγ (uγ − uβ ) vanishes because Mβγ is symmetric, relative to β, γ,  whereas uγ − uβ is skew. The other two sums are trivially zero. Hence β mβ = 0 as is required. Now, to show that (11.19) holds, we compute

11.5 Porous Media with Double Porosity

679

  2 + 1  τ (u + u ) · u . ˆ β · uβ + 21 τβ uβ2 ] = β,γ uβ · Mβγ (uγ − uβ ) − 21 α τα uα α 2 β [p 2 α α 1



Next we observe that 

β,γ uβ

  · Mβγ (uγ − uβ ) = 21 [ β,γ uβ · Mβγ (uγ − uβ ) + β,γ uγ · Mβγ (uβ − uγ )]  = − 21 β,γ (uγ − uβ ) · Mβγ (uγ − uβ ) ≤ 0.

The remaining terms provide a zero net result, 2 2 2 1 1 1 2 α τα (u1 + u2 ) · uα − 2 α τα uα = 2 [τ1 (u1 + u2 ) · (u1 − u2 ) − τ1 (u1 − u2 )] = 0.

If, as is the case in some models exhibited in the literature, pˆ s = 0, the present scheme is valid except 2 for formal changes; (11.20) is nonzero for β = 1, 2 and hence still 3 β=1 becomes β=1 . Evolution equations We now set up the evolution equations. Since we let the fluid and the solid be incompressible it follows that the convenient unknown functions are the volume fractions ϕ1 , ϕ2 , ϕs and the velocities v1 , v2 , vs . Moreover, the temperature θ is a further unknown if the motion is not supposed to be isothermal. The volume fractions are governed by the continuity equations ϕ` α + ϕα ∇ · vα =

τα , α = 1, 2, ρ0

ϕ` s + ϕs ∇ · vs = 0.

(11.21)

The equations of motion take the form ρ0 ϕα v` α = ∇ · Tα + pˆ α + ρ0 ϕα bα , α = 1, 2,

ρ0s ϕs v` s = ∇ · Ts + pˆ s + ρ0s bs . (11.22)

For a linear viscous fluid Tα = − pα 1 + 2μα Dα + λα (tr Dα )1, and for a linear elastic solid Ts = − ps 1 + 2μ¯ α εα + λ¯ α (tr εs )1. The literature shows quite different forms of evolution equations. If ∂t ϕα = 0, ∂t ϕs = 0 then Eq. (11.21) becomes ∇ · (ϕα vα ) =

τα , ρ0

∇ · (ϕs vs ) = 0;

680

11 Porous Materials

these equations coincide, e.g. with the continuity equations determined in [238]. Yet, more often, significantly different equations are considered.5 First, τ1 /ρ0 = −τ2 /ρ0 is supposed to be a constant times the pressure difference p1 − p2 , most likely because of the Barenblatt-et-al equation (11.15). Moreover, ϕ` α (or perhaps ϕ˙ α ) is taken to equal −γα p˙ α − βα ∇ · vs , where γα is said to be the compressibility of pores or fissures while βα is a measure of the change of porosities. Finally, ∇ · vα is evaluated via Darcy’s law vα = −

kα ∇ pα μα

so that ∇ · vα = −(kα /μα )pα . Concerning the equations of motion, an effective stress concept is involved so that the pressures p1 , p2 occur in Ts as the pressure tensor (−β1 p1 − β2 p2 )1 [109, 406]. Moreover, the forces pˆ α of the present approach are disregarded or are taken to be a scalar times ϕ2α Kα−1 vα , Kα being called the permeability [238]. This is questionablein relation to the consistency with the principle of objectivity and the constraint on α pˆ α .

5

See, e.g. [172, 403, 406] and references therein.

Chapter 12

Electromagnetism of Continuous Media

This chapter is devoted to electromagnetism in deformable media. As with any model of material behaviour, the constitutive laws are required to be consistent with the second law of thermodynamics and to comply with the objectivity requirements. Electromagnetism in the matter is obviously based on Maxwell’s equations as the fundamental set of equations governing electromagnetic phenomena. Maxwell’s equations are complemented by the balance equations where the electromagnetic fields enter the balance of linear momentum, angular momentum, and energy. Constitutive laws are needed to express the properties of the material and describe the interaction between the field and matter. Since the electromagnetic fields are dependent on the frame of reference, an appropriate approach to the modelling of electromagnetic media should involve the fields at rest with the body at the point under consideration. Hence attention is mainly restricted to the fields at the frame at rest, at the pertinent point. The models developed in this chapter describe various material properties and applied electromagnetic fields. The general subject of electroelasticity allows us to investigate piezoelectric materials and deformable dielectrics. Magnetoelasticity allows the modelling of magnetic refrigeration. Micromagnetics is viewed as a particle-like scheme leading to rate-type equations for the magnetization. Ferrofluids are modelled as a mixture where the ferromagnetic particles are a micropolar constituent. A magnetoelastic material allows us to describe the effects of an applied magnetic field on the propagation of electric wave fields. Plasmas, i.e. ionized gases with free electrons and ions, are viewed as both binary mixtures and one-fluid thus leading to the magneto-hydrodynamic equations. Further chiral media are investigated through classical models of the literature to explain the phenomenon of optical activity. Ferrites are described as magnetic materials acted upon by a constant field; a superposed circularly polarized wave is found to propagate at different speeds depending on the direction of propagation.

© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_12

681

682

12 Electromagnetism of Continuous Media

12.1 A Thermodynamic Setting for Electromagnetic Solids A general scheme is set up for the interaction between electromagnetic fields and elastic deformations in polarizable materials. For generality, the scheme allows for electric conduction, heat conduction, and viscosity. Specific models are then established in the next sections1 at the frame at rest, at the pertinent point. The balance equations of linear momentum, angular momentum, and energy can be written in the form (Sect. 2.16.1) ρ˙v = ∇ · T + ρb + (P · ∇)E + v × (P · ∇)B + ρπ˙ × B ˙ × D + qE + J × B, +μ0 (M · ∇)H − μ0 v × (M · ∇)D − μ0 ρm skwT = skw(P ⊗ E + μ0 M ⊗ H), ˙ + T · L − ∇ · q + ρr, ρε˙ = E · J + ρE · π˙ + ρμ0 H · m

(12.1)

where b is the mechanical body force, per unit mass. Owing to the choice of the co-moving observer, E = E , J = J . Models of electromagnetic media are framed within thermodynamics by requiring that the constitutive equations be compatible with the second law expressed by the standard inequality 1 ρθη˙ + ∇ · q − ρr − q · ∇θ + θ∇ · k ≥ 0. θ Upon substitution of ∇ · q − ρr we obtain ˙ · H + T · L + θ∇ · k ≥ 0. ρθη˙ − ρε˙ + E · J + ρπ˙ · E + ρμ0 m Using the Helmholtz free energy ψ = ε − θη we can write the entropy inequality in the form 1 ˙ + E · J + ρπ˙ · E + ρμ0 m ˙ · H + T · L − q · ∇θ + θ∇ · k ≥ 0. −ρ(ψ˙ + η θ) θ For technical convenience we consider the free energy φ defined by φ = ψ − π · E − μ0 m · H. Upon substitution of ψ˙ we obtain the entropy inequality in the form

To avoid obvious ambiguities, in this chapter D denotes the electric displacement (vector) and D the stretching tensor. 1

12.1 A Thermodynamic Setting for Electromagnetic Solids

683

˙ + E · J − P · E˙ − μ0 M · H ˙ + T · L − 1 q · ∇θ + θ∇ · k ≥ 0. − ρ(φ˙ + η θ) θ (12.2) Quite a general model of electromagnetic medium might involve a set of constitutive equations where, e.g. φ, η, J, P, M, T, q, k are functions of F, θ, E, H, ∇θ and their space and time derivatives. For instance, F˙ may occur to model viscous effects, ∇θ to model heat conduction, ∇E and E˙ to describe spatial interaction and memory. Also, q may occur among the independent variables if the constitutive equation for q is a rate-type equation. The free energy φ, whose time derivative occurs in (12.2), is required to be Euclidean invariant and hence can depend on the independent variables through their invariants. In the analysis of the degrees of freedom allowed for the electromagnetic fields, Maxwell’s equations play the role of constraints. In this connection it is worth remarking that, since E, B, J, P, M are relative to the co-moving observer, Maxwell’s equations take the form ∇ · D = q,

∇ · B = 0,



∇ × H = J+ D,



∇×E = − B .

For later use we observe that Maxwell’s equations for the co-moving observer 







can be given the form ∇ × H = J + 0 E + P, ∇ × E = −μ0 (H + M), in addition  to ∇ · (0 E + P) = q, ∇ · (H + M) = 0. Since E= E˙ − LE + (∇ · v)E, and the like for H, appropriate values of ∇ × H an ∇ × E allow us to say that arbitrary prescribed ˙ and E, ˙ at any point x ∈ Rt and any time t, are consistent with Maxwell’s values of H ¨ = (∇ × H − J + 0 LE − 0 (∇ · v)E)˙, then arbitrarequations. Moreover, since 0 E ¨ are allowed, at any point x ∈ Rt and any time t, by ily prescribed values of E˙ and E requiring that ∇ × H and (∇ × H)˙take appropriate values. Remarkably, for undeformable bodies, the recourse to the Poynting vector E × H as the electromagnetic energy flux allows us to state the balance of energy in the form ρε˙ = −∇ · (q + E × H) + ρr. Since, by Maxwell’s equations, ˙ · E, −∇ · (E × H) = −(∇ × E) · H + ∇ × H · E = B˙ · H + J · E + D then Eq. (12.1) is recovered provided we re-define the internal energy, ρε → ρεnew = ρε − 21 (0 E2 + μ0 H2 ).

684

12 Electromagnetism of Continuous Media

12.2 Electroelasticity The body is viewed as a deformable dielectric solid. We then assume the magnetization and the electric current are zero. Hence the electromagnetic fields E, P, H, D, B are B = μ0 H. D = 0 E + P, A wide range of material behaviours is established beyond the standard models of electroelasticity. For instance, flexoelectricity indicates that the dependence on the second gradient of deformation is allowed. By analogy with models involving the dependence on the polarization the dependence on the gradient of the electric field is also considered. Heat conduction might be described via a scheme as in Fourier-type models where ∇θ is an independent variable and q is given by a constitutive function. To get a model compatible with discontinuity wave propagation it is natural to describe heat conduction via a rate-type equation. Here, though, for formal simplicity we ignore heat conduction. Hence the electroelastic solid might be modelled by letting ˙ ∇E, E˙ F, θ, E, ∇F, F, ˙ E˙ is be the set of independent variables. The dependence on the time derivatives F, motivated by requirements of thermodynamic consistency. As to the free energy φ = ψ − π · E − μ0 m · H, the constitutive equation is objective if the dependence is through invariants such as C = FT F, ∇E · ∇E, E = FT E. For definiteness we let ˙ ∇RE , E˙ ). φ = φ(C, θ, E , ∇R C, C, Observe that FT E = EF and hence E = E R , E R being the electric field in the material description. Since J, q, and M are assumed to vanish then the entropy inequality (12.2) simplifies to ˙ − P · E˙ + T · L + θ∇ · k ≥ 0. −ρ(φ˙ + η θ) Upon computation of φ˙ and substitution it follows ˙ − ρ∂C˙ φ · C ˙ − ρ∂E φ · E˙ − ρ∂∇R C φ · ∇R C ¨ −ρ(∂θ φ + η)θ˙ − ρ∂C φ · C −ρ∂∇R E φ · ∇RE˙ − ρ∂E˙ φ · E¨ − P · E˙ + T · L + θ∇ · k ≥ 0. ¨ E¨ , and θ˙ implies The arbitrariness of C,

12.2 Electroelasticity

685

∂C˙ φ = 0,

∂E˙ φ = 0,

η = −∂θ φ.

Hence the inequality reduces to ˙ − ρ∂E φ · E˙ − ρ∂∇ C φ · ∇R C ˙ − ρ∂∇ E φ · ∇R E˙ − P · E˙ + T · L + θ∇ · k ≥ 0. −ρ∂C φ · C R R

Multiply by J/θ and observe J ρ = ρR ,

J ∇ · k = ∇R · k R ,

k R := J F−1 k.

We can then write −

ρR ˙ − ρ R ∂∇ C φ · ∇R C ˙ − J P · E˙ + J T · L + ∇R · k R ≥ 0, ˙ − ρ R ∂∇ E φ · ∇R E ˙ − ρ R ∂E φ · E ∂C φ · C θ θ θ R θ R θ θ

whence −

ρR ˙ − ρ R δE φ · E˙ − J P · E˙ + J T · L + ∇R · (k R − ρ R ∂∇ C φ C ˙ − ρ R ∂∇ E φ E˙ ) ≥ 0, δC φ · C θ θ θ θ θ R θ R

where δ C φ = ∂C φ −

 ρR  θ ∇R · ∂∇R C φ , ρR θ

δE φ = ∂E φ −

 ρR  θ ∇R · ∂∇R E φ ρR θ

can be viewed as the variational derivatives of φ with respect to C and E . This indicates that we let ρR ˙ + ∂∇R E φ E˙ ). k R = (∂∇R C φ C θ The identities

˙ = 2(F  FT ) · D , ˙ =  · (F˙ T F + FT F) ·C

˙ ξ · E˙ = (Fξ) · E˙ + (E ⊗ ξ) · F,

(E ⊗ ξ) · F˙ = [E ⊗ (Fξ)] · L,

hold for any symmetric tensor  and vector ξ. Applying these relations to  = δC φ,

ξ = δE φ

˙ − ρδE φ · E˙ − P · E˙ + T · L ≥ 0 in we can write the reduced inequality −ρδC φ · C the form [T − 2ρF δC φ FT − ρE ⊗ (FδE φ)] · D + [T − ρE ⊗ (FδE φ)] · W − (P + ρFδE φ) · E˙ ≥ 0.

The arbitrariness of W implies skw[T − ρE ⊗ (FδE φ)] = 0.

(12.3)

686

12 Electromagnetism of Continuous Media

Let P = P + ρFδE φ.

T = T − 2ρF δC φ FT − ρE ⊗ (FδE φ),

By (12.3) it follows that T ∈ Sym. The reduced inequality then reads T · D − P · E˙ ≥ 0

(12.4)

while T = 2ρF δC φ FT + ρE ⊗ (FδE φ) + T ,

P = −ρFδE φ + P .

˙ and E˙ and are subject to (12.4). We can view T Hence, T and P may depend on C and P as the dissipative parts of the stress and the polarization. If, as is customary T = 0, P = 0) then it follows in electroelasticity, we neglect dissipative effects (T P = −ρFδE φ,

(12.5)

T = 2ρF δC φ FT − E ⊗ P

(12.6)

skw[T + E ⊗ P] = 0.

(12.7)

while (12.3) becomes

The constraint (12.7) coincides with the balance equation of angular momentum. This shows that the dependence on E via the vector FT E is appropriate to satisfy the balance equation. We wonder whether other dependencies might be consistent with the balance of angular momentum. For definiteness let φ depend on the electric field via the invariant scalars ˙ 2. σ = |∇E|2 , = |E| κ = E2 , So we let

˜ φ = φ(C, θ, κ, σ, ).

Computation of φ˙ and substitution in the entropy inequality yields ˙ − 2ρ∂ φ˜ E˙ · E ˜ · E˙ − 2ρ∂σ φ∇E ˜ ¨ · ∇E −ρ(∂θ φ˜ + η)θ˙ − 2ρ(F ∂C φ˜ FT ) · D − 2ρ∂κ φE −P · E˙ + T · L + θ∇ · k ≥ 0.

¨ and θ˙ implies The arbitrariness of E ∂ φ˜ = 0, ˙ = ∇ E˙ − (LT ∇)E then Since ∇E

˜ η = −∂θ φ.

12.2 Electroelasticity

687

˙ = ∂ E ∂ E˙ − ∂ E L ∂ E = ∇E · ∇ E˙ − (∇E ⊗ ∇E) · D . ∇E · ∇E xi j xi j xi j ki xk j Upon substitution we obtain the inequality in the form ˜ · E˙ − 2ρ∂σ φ∇E ˜ · ∇ E˙ [T − 2ρ F ∂C φ˜ FT + 2ρ∇E ⊗ ∇E] · D − 2ρ∂κ φE −P · E˙ + T · W + θ∇ · k ≥ 0 Divide by θ and observe 2ρ ˜ 2ρ ˜ ˜ ˙ − [∇ · ( 2ρ ∂σ φ∇E)] ˙ ∂σ φ∇E · ∇ E˙ = ∇ · ( ∂σ φ∇E E) · E. θ θ θ Hence we have 1 1 ˜ − θ∇ · ( 2ρ ∂σ φ∇E)] ˜ [T − 2ρ F ∂C φ˜ FT + 2ρ∇E ⊗ ∇E] · D − [P + 2ρ∂κ φE · E˙ θ θ θ 2ρ ˜ 1 ˙ ≥ 0. ∂σ φ∇E E) + T · W + ∇ · (k − θ θ

We then let k=

2ρ ˜ ˙ ∂σ φ∇E E. θ

Moreover the arbitrariness of W implies skwT = 0. Hence letting ˜ = T − 2ρ F ∂C φ˜ FT + 2ρ∇E ⊗ ∇E, T

˜ − θ∇ · ( 2ρ ∂σ φ∇E), ˜ P˜ = P + 2ρ∂κ φE θ

we conclude that ˜ T = 2ρ F ∂C φ˜ FT − 2ρ∇E ⊗ ∇E + T,

˜ + θ∇ · ( 2ρ ∂σ φ∇E) ˜ ˜ P = −2ρ∂κ φE + P, θ

˜ ∈ Sym and P˜ are subject to where T T˜ · D − P˜ · E˙ ≥ 0. This inequality is satisfied if T˜ depends linearly on D , as with viscous materials, and ˙ Yet in electroelasticity the inequality is trivially satisfied by letting P˜ is parallel to E. ˜ ˜ T = 0, P = 0. The symmetry of T, skwT = 0, looks inconsistent with the balance of angular momentum, skw(T + E ⊗ P) = 0.

688

12 Electromagnetism of Continuous Media

In fact the dependence on κ produces a term parallel to E while the dependence on σ results in 2ρ ˜ 2ρ ˜ · ∇E + ( ∂σ φ)E. ∇( ∂σ φ) θ θ Thus, in this scheme, consistency happens if φ depends on E only through κ = E2 , not through σ = |∇E|2 . Consistency holds in general in terms of E by substitution of E˙ = −LT E + F−T E˙ and σ = |∇RE |2 (see Sect. 15.1).

12.3 Electroelastic Materials Piezoelectric materials Piezoelectricity means electricity resulting from mechanical stress. In a normal dielectric the polarization is zero in the absence of an electric field. In a piezoelectric solid, instead, a surface electric charge develops when the solid is subject to a mechanical stress, even in the absence of an electric field. This is called the direct piezoelectric effect. The effect is reversible and the inverse piezoelectric effect consists in a change of shape as a consequence of an applied electric field. There are solids, called pyroelectric solids, where a change of temperature induces an electric polarization (change). Opposite effects on polarization are obtained by increasing or decreasing temperature. Pyroelectric crystals are a subset of piezoelectrics; all pyroelectric crystals are piezoelectrics but not all piezoelectrics show pyroelectricity. The piezoelectric effect is understood as the linear electromechanical interaction between the mechanical and the electric behaviours. Hence, piezoelectricity is the combined effect of the linear electrical behaviour D =  E,  being the second-order permittivity tensor, and the linear stress–strain law T = C S, C being the elasticity fourth-order tensor and S being the infinitesimal strain tensor,2 S = sym∇u. Piezoelectricity is then described by letting T = C S +  E, To avoid any ambiguities with the electric field E and the permittivity , here the infinitesimal strain is denoted by S and not by ε.

2

12.3 Electroelastic Materials

689

D =  S +  E, ,  being third-order tensors. First we ascertain the thermodynamic consistency of this description. As in the general scheme of electroelasticity, the material is taken to be non-conducting (J = 0, q = 0) and non-magnetizable (M = 0). Moreover, for simplicity we first neglect effects described by the gradients ∇C and ∇E. The dependence (of the free energy) on F, θ, E can be modelled via C, θ, FT E or C, θ, E2 , ˆ φ = φ(C, θ, E ),

˜ φ = φ(C, θ, κ),

where E = FT E and κ = E2 . We then find that ˆ P = −ρF∂E φ, or

ˆ T − E ⊗ P, T = 2ρF∂C φF

P = −2ρ∂κ φ˜ E,

Furthermore,

T = 2ρ F∂C φ˜ FT .

˜ η = −∂θ φˆ = −∂θ φ,

(12.8)

(12.9)

k = 0.

Equations (12.8) and (12.9) are nonlinear constitutive relations for P and T in electroelasticity. To model piezoelectricity we look at a linearized version and it is natural to choose the reference configuration as that around which we describe the (linearized) piezoelectric properties; as we see in a moment, the same scheme accounts also for pyroelectricity. Equations (12.9) are inadequate as the starting point for the model of piezoelectricity. The dependence on E2 , rather than on FT E or E, is reasonable if the material is isotropic, which is not the case of piezoelectricity. We assume P → 0 as E and C − 1 approach zero. Hence the stress E ⊗ P has to be neglected in a linear scheme of piezoelectricity. Since C = 1 in the reference configuration, we regard C − 1 as the convenient mechanical variable and observe C − 1 = ∇ R u + (∇ R u)T + o(|∇ R u|) = ∇u + (∇u)T + o(|∇ R u|). Hence, in the linear approximation C − 1  2 S,

E = FT E  E.

Let ˆ E + 21  · γE E + ζ(θ) ·  =: φ(C, φ = g(θ) + 18 (C − 1) · α(C − 1) + 21 (C − 1) · βE θ, E )

where α, β, γ are fourth-order, third-order, and second-order tensors. Moreover ζ(θ) is a temperature-dependent vector. Thus by (12.8) we have

690

12 Electromagnetism of Continuous Media

E], T = ρF[ 21 α(C − 1) + βE

E] − ρFζ, P = −ρF[ 21 β T (C − 1) − γE

the term E ⊗ P in T being ignored because of linearization. Hence, in the linear approximation we have T = ρ R αS + ρ R βE,

P = −ρ R β T S − ρ R γE − ρ R ζ.

In the suffix notation, Ti j = ρ R αi j hk Shk + ρ R βi jk E k ,

Pk = −ρ R βi jk Si j − ρ R γkh E h − ρ R ζk .

Since D = 0 E + P then we can write Dk = −ρ R βi jk Si j + (0 δkh − ρ R γkh )E h − ρ R ζk . We then identify ρ R α with the elasticity tensor C and 0 1 − ρ R γ with the permittivity . Moreover we let P = −ρ R ζ,  = ρ R β, so that we can write T = C S +  E,

D = −T S +  E + P

or Ti j = Ci jln Sln + i jl El ,

Di = −kmi Skm + i j E j + Pi .

By definition, and consistent with (12.8), it follows that α enjoys major and minor symmetry properties, αi j hk = α ji hk = αi jkh = αhki j , the same then being true for C. The permittivity  is symmetric too. The vector P provides the polarization P when the applied electric field E and the stress T are zero. The variation of P on heating is reversed on cooling and this is consistent with P˙ = ∂θP θ˙ when T and E are zero. Indeed, ∂θP can be viewed as the pyroelectric vector, the change of polarization per unit of variation of temperature. The scheme so outlined shows that, in models of piezoelectric materials, the electric field should enter the free energy as a vector (FT E) and not merely via scalars such as, e.g. κ = E2 . This is so because, in a quadratic free energy, no coupling would be possible between deformation and electric field. In a E2 -dependent free energy we can write φ = g(θ) + 18 (C − 1) · α(C − 1) + λE2

12.3 Electroelastic Materials

691

and hence T = CS,

D = E,

without any piezoelectric and pyroelectric effect. Mandel–Voigt notation When dealing with symmetric tensors, Voigt notation allows us to reduce the order and correspondingly simplify the matrix form of calculations. We first observe that a symmetric tensor is characterized by 6 entries which can be viewed as the components of a 6-dimensional vector. Following the Mandel–Voigt notation, the entries of S = ˜ T˜ ∈ R6 in the form S T , T = T T ∈ R3×3 are put in correspondence with S, S˜ = (S1 , S2 , S3 , S4 , S5 , S6 ) := (S11 , S22 , S33 ,



2 S23 ,



2 S13 ,



2 S12 )

and the like for T . Moreover we let ⎤ ⎡ 111 112 113 ⎢ 221 222 223 ⎥ ⎥ ⎢ ⎥ ⎢ 331  √ 332 √333 ⎥ , ˜ νi ] = ⎢ √ [ ⎢ 2 231 2 232 2 233 ⎥ ⎥ ⎢√ √ √ ⎣ 2 131 2 132 2 133 ⎦ √ √ √ 2 121 2 122 2 123 ⎡

C1111 C1122 C1133 ⎢ C C2222 C2233 2211 ⎢ ⎢ C C ⎢ 3311 3322 √ √C3333 [C˜ μν ] = ⎢ √ ⎢ √2 C2311 √2 C2322 √2 C2333 ⎢ ⎣ 2 C1311 2 C1322 2 C1333 √ √ √ 2 C1211 2 C1222 2 C1233

√ √2 C1123 √2 C2223 2 C3323 2 C2323 2 C1323 2 C1223

√ √2 C1113 √2 C2213 2 C3313 2 C2313 2 C1313 2 C1213

⎤ √ 2 C 1112 √ ⎥ √2 C2212 ⎥ ⎥ 2 C3312 ⎥ ⎥. 2 C2312 ⎥ ⎥ 2 C1312 ⎦ 2 C1212

Hence we can write the linear equations in the form ˜ μl El , T˜μ = C˜ μν S˜ν + 

T ˜ ˜ iν Di = − Sν + i j E j .

In addition to the constitutive equations for T and D as (linear) functions of S and E we can change the independent variables based on suitable assumptions. In matrix notation we write ˜ E, T˜ = C˜ S˜ + 

˜ T S˜ +  E. D = −

(12.10)

To obtain the corresponding equations with T˜ and E as independent variables we ˜ and hence assume C˜ is invertible. Then C˜ −1 T˜ = S˜ + C˜ −1 E

692

12 Electromagnetism of Continuous Media

˜ S˜ = C˜ −1 T˜ − C˜ −1 E,

˜ T C˜ −1 T˜ + ( +  ˜ ˜ T C˜ −1 )E. D = −

(12.11)

˜ T C˜ −1  ˜ ∈ R3×3 be invertible. Consequently we can write Let A :=  +  ˜ T C˜ −1 )T˜ − C˜ −1 A ˜ −1 D. ˜ −1  S˜ = C˜ −1 (1l − A (12.12) Finally, let  be invertible. By (12.10) we find ˜ T C˜ −1 T˜ , E = A−1 D + A−1 

˜ T ) S˜ +  ˜ −1 D, ˜ −1  T˜ = (C˜ + 

˜ T S˜ + −1 D. E = −1 

(12.13)

˜ , and  +  ˜ ˜ T C˜ −1  The full equivalence of Eqs. (12.10)–(12.13) holds provided C, are invertible. Now, invertibility of the elasticity matrix C˜ and the permittivity matrix  ˜ is reasonable inasmuch ˜ T C˜ −1  is a customary assumption. The invertibility of  +  ˜ as a matrix with suitably small entries. as we can regard  To account for pyroelectricity, relations (12.10) to (12.13) hold with the formal change Di → Di − Pi . According to the thermodynamic restrictions, in the linear approximation, ∂E φˆ = −

1 P, ρR

∂S φˆ =

1 T. ρR

Hence it follows that ∂S P = −(∂E T)T . Similar (Maxwell) relations follow from the linear equations (12.10)–(12.13). It is apparent that ˜ E), ˜ E) = −(∂ S˜ D)T ( S, ∂ E T˜ ( S,

˜ T˜ , E) = (∂T˜ D)T (T˜ , E), ∂ E S(

˜ T (T˜ , D), ∂T˜ E(T˜ , D) = −(∂ D S)

˜ D). ˜ D) = (∂ S˜ E)T ( S, ∂ D T˜ ( S,

˜ E, to the other To change the independent variables from the original set, here S, ones we could have performed appropriate Legendre transformations. Yet the direct procedure adopted here gives evidence to the assumption in passing from one set to another one. Flexoelectricity and electrostriction Flexoelectricity occurs in dielectrics and shows up as an electrical polarization induced by a strain gradient [127, 227, 394]. From the modelling viewpoint, flexoelectricity is closely related to piezoelectricity. Yet while piezoelectricity accounts for polarization due to a (uniform) strain, flexoelectricity models the occurring of polarization due to a non-uniform strain field.

12.3 Electroelastic Materials

693

Electrostriction occurs in all dielectrics; it describes the strain as a quadratic effect produced by the electric field (E) or the polarization (P). The electrostrictive effect is generally small when compared with that induced by piezoelectricity. However, there are single crystals/ceramics in which a high electrostrictive strain occurs because of the high dielectric response. These materials are referred to as relaxor ferroelectrics or relaxors. While the normal ferroelectrics3 have a hysteresis loop that, at zero electric field, retains a large polarization, in the relaxor this zero field polarization is significantly smaller so that it is reasonable to regard relaxors as nonlinear paraelectrics with high values of permittivity [53, 383]. A description of flexoelectricity and electrostriction requires a scheme where both deformation and deformation gradient are among the independent variables. Hence we let (C, ∇R C, θ, E) be the set of independent variables. Moreover, as with piezoelectricity, we let the free energy φ depend on E via the Euclidean invariant E = FT E. The thermodynamic analysis developed for the general scheme of electroelasticity applies as a particular case. If φ = (C, θ, E , ∇R C) it follows that η = −∂θ , k=

P = −ρF∂E , ρ ˙ ∂∇ C  C, θ R

and T = 2ρF δC  FT − E ⊗ P, where δ C  = ∂C  −

θ ρR ∇R · ( ∂∇R C ). ρR θ

˙ and to obey the It is worth remarking that to allow for a dependence, of k R , on C ˙ as the set of E rule of equipresence, we should have started with C, ∇R C, θ, , and C ˙ and independent variables. We would have shown that  is in fact independent of C then we would have arrived to the same conclusions. Also to adhere to the literature, we now consider T and P in the linear approximation. Since F  1,

C − 1  2 S,

J  1,

ρ  ρR ,

we obtain T  ρδS  − E ⊗ P, while 3

See Chap. 15.

∇R  ∇,

694

12 Electromagnetism of Continuous Media

P  −ρ ∂E . For definiteness we let ρ, θ be uniform and assume ρφ(S, θ, E, ∇S) = 21 S · CS − S · E − ∇S · E − 21 E · γE − 21 ∇S · ∇S, or, in suffix notation, ρ φ = 21 Si j Ci j hk Shk − Si j i j h E h − (∂xh Si j )i j hk E k − 21 E i γi j E j − 21 (∂xi S jk )i jkpqr (∂x p Sqr ).

Further, assume that C, , γ, and  are tensors parameterized by ρ and θ and that C, γ, and  are symmetric. Hence we obtain the constitutive equations Ti j = Ci j hk Shk − i j h E h + i j hk ∂xh E k + ki j pqr ∂xk ∂x p Sqr − E i P j , Pi = Shk hki + k j hi ∂xh Sk j + γi j E j . In compact notation T = CS − E + ∇E − ∇∇S − E ⊗ P,

(12.14)

P = γE + T S +  T ∇S.

(12.15)

The dependence of P on ∇S characterizes the flexoelectric effect [233, 343]. The dyadic product E ⊗ P is neglected in linear approximations. The model for piezoelectricity follows in the event  = 0,  = 0. If  and  vanish then the interaction terms between E and S in the Gibbs free energy do not occur and ρφ takes the more familiar form ρφ = 21 S · CS − 21 E · γE − 21 ∇S · ∇S, The literature4 shows also the use of the version with P and S as independent variables. Since E = γ −1 [P − T S −  T S] then substituting this expression of E in (12.14) we deduce the function of T in terms of S, P, ∇S, ∇∇S. Often the equation for P is given in terms of T and ∇S (see, e.g. [343]). As the simplest stress–strain relation we take S = C−1 T. Upon substitution we obtain the familiar equation of flexoelectricity,

4

See, e.g. [408].

12.4 Dielectrics with Polarization Gradient

695

P = γE + T C−1 T + ∇S, where γE is the classical dielectric term. Quite naturally we wonder whether electrostriction is described by Eqs. (12.14)– (12.15). Since electrostriction denotes the occurrence of a strain as a (quadratic) effect of the electric field, we restrict attention to (12.14)–(12.15) in uniform conditions (∇E = 0, ∇S = 0). If further the tensor  produces negligible effects then we can write T = CS − E ⊗ P, P = γE. Since electrostriction is an effect of the electric field, not of the stress, we look at stress-free conditions, T = 0, and find that S = C−1 E ⊗ γE or Si j = Mi j pr E p Er ,

Mi j pr := Ci−1 j pq γqr .

This result describes the electrostrictive effect and shows that the electrostrictive coefficients Mi j pr are given by the matrix product C−1 and γ. Hence we see that electrostriction is favoured by small values of the elastic coefficients and large values of the permittivity. This result in turn is consistent with the standard observation that high electrostrictive strains occur because of high dielectric permittivities.

12.4 Dielectrics with Polarization Gradient Similar equations hold for dielectrics where the polarization gradient is involved to describe properties of a crystal lattice arising from the discrete structure.5 We assume P transforms as an objective vector, that is P → QP under a change of frame with rotation matrix Q. Hence, to account for the dependence on the polarization, and the polarization gradient, we observe that both F−1 P and FT P are objective. This is so because F−1 P FT P

→ →

(QF)−1 QP = F−1 Q−1 QP = F−1 P, (QF)T QP = FT QT QP = FT P.

For a non-magnetizable and non-conducting body the second-law inequality can be written in the form 5

From the point of view of lattice theory of crystals, the polarization gradient is regarded to model, in the long-wave approximation, the shell-shell and core-shell interactions between atoms [17]; see also [311] and [404].

696

12 Electromagnetism of Continuous Media

˙ + ρπ˙ · E + T · L + θ∇ · k ≥ 0. −ρ(ψ˙ + η θ) where π = P/ρ. Multiply the inequality by J/θ and observe J ∇ · k = ∇R · k R , to obtain −

k R = J F−1 k,

ρR ˙ ˙ + ρ R π˙ · E + J (TF−T ) · F˙ + ∇ R · k R ≥ 0. (ψ + η θ) θ θ θ

For definiteness look first at  = F−1 π =

1 −1 1 F P= J F−1 P; ρ ρR

to within the scalar 1/ρ R , the vector  equals the (invariant) polarization P R in the reference configuration. Moreover we have ˙ −1 π. ˙ = F−1 π˙ − F−1 FF  Hence, for any vector function w we can write ˙ ˙ = (F−T w) · π˙ − (F−T w) ⊗ (F−1 π) · F. w·

(12.16)

Let (C, θ, , ∇R ) be the set of independent variables. Computation of ψ˙ and substitution in the entropy inequality results in −

ρR ρR ρR ˙ ˙ − ρ R ∂C ψ · C (∂θ ψ + η)θ˙ + π˙ · E − ∂ ψ ·  θ θ θ θ ρR J ˙ + ∇R · k R ≥ 0. + (TF−T ) · F˙ − ∂∇R  ψ · ∇R  θ θ

The arbitrariness of θ˙ implies the classical relation η = −∂θ ψ. Now observe −

ρR ˙ + [∇R · ( ρ R ∂∇R  ψ)] · . ˙ ˙ = −∇R · ( ρ R ∂∇R  ψ ) ∂∇  ψ · ∇R  θ R θ θ

Moreover, applying the identity (12.16) with w = ∇R · (ρ R ∂∇ R  ψ/θ) and w = ∂ ψ we eventually can write the remaining entropy inequality in the form

12.4 Dielectrics with Polarization Gradient

697

ρR ρR 1 [E − F−T δ ψ] · π˙ + [ TF−T − 2F∂C ψ + (F−T δ ψ) ⊗ (F−1 π)] · F˙ θ θ ρ ρR ˙ ≥ 0. +∇R · (k R − ∂∇R  ψ ) θ ˙ = 2(F∂C ψ) · F˙ are used and where the identities T · L = (TF−T ) · F˙ and ∂C ψ · C δ ψ = ∂ ψ − The inequality holds if kR = E = F−T δ ψ,

θ ρR ∇R · ( ∂∇R  ψ). ρR θ

ρR ˙ ∂∇  ψ  θ R

1 −T TF = 2F∂C ψ − (F−T δ ψ) ⊗ (F−1 π). ρ

(12.17)

Some comments on these results are in order. The dependence on the polarization gradient is compatible with thermodynamics if the extra-entropy flux is nonzero. Moreover, the free energy ψ is a potential for the electric field E through the joint dependence on  and ∇R . The tensor TF−T /ρ is just 1/ρ R times the first PiolaKirchhoff stress T R . Hence we can say that T R and T follow from the potential ψ via the joint contributions of differentiations with respect to C, , ∇R . Indeed, upon right multiplication by ρFT we have T = 2ρ F ∂C ψFT − ρ(F−T δ ψ) ⊗ π. The observation that ρπ = P and F−T δ ψ = E allows us to write T in the form T = 2 ρ F∂C ψFT − E ⊗ P. Hence we can view T as the sum of two tensors. One is the standard symmetric tensor 2 ρ F∂C ψFT of elasticity. The other one is the dyadic product −E ⊗ P. Consequently skwT = −skwE ⊗ P = skwP ⊗ E as it must be in view of (2.73) and therefore confirming the same result (12.6) obtained ˆ within the model given by φ = φ(C, θ, FT E). Things are quite similar if we replace  by P = FT π =

1 J FT P. ρR

We parallel the previous procedure. Observe ˙ w · P˙ = (Fw) · π˙ + (π ⊗ w) · F.

698

12 Electromagnetism of Continuous Media

Take ψ = ψ(C, θ, P , ∇R P ), compute ψ˙ and substitute in the entropy inequality. It follows that the inequality holds if η = −∂θ ψ and kR = E = FδP ψ,

ρR ∂∇ P ψ P˙ , θ R

T = 2 ρ F∂C ψ FT + P ⊗ E.

where δP ψ = ∂P ψ − (θ/ρ R )∇R · ((ρ R /θ)∂∇R P ψ). Often the literature describes the polarization effects by means of (C, θ, π, ∇R π) as the independent variables. The essential difference is due to the direct dependence on π rather than on FT π. Upon computation of ψ˙ and substitution in the entropy inequality we have −ρ(∂θ ψ + η)θ˙ + ρ(E − ∂π ψ) · π˙ − ρ∂∇ R π ψ · (∇ R π) ˙ + (TF−T − 2ρF∂C ψ) · F˙ + θ∇ · k ≥ 0.

The arbitrariness of θ˙ and F˙ implies η = −∂θ ψ,

T = 2ρF ∂C ψ FT .

Now, for technical convenience multiply by J/θ. Hence, in view of the identity −

ρR ρR ρR ∂∇ R π ψ · (∇ R π) ˙ = −∇ R · ( ∂∇ R π ψ π) ˙ + [∇ R ( ∂∇ R π ψ)] · π˙ θ θ θ

and the definition δπ ψ = ∂π ψ − (θ/ρ R )∇ R · ((ρ R /θ)∂∇ R π ψ) we have ρR ρR [E − δπ ψ] · π˙ + ∇ R · [k R − ∂∇ R π ψ π] ˙ ≥ 0. θ θ This inequality holds if E = δπ ψ,

kR =

ρR ∂∇ π ψ π. ˙ θ R

Thus the model of dielectric is consistent with the dependence on the polarization gradient ∇R π if the thermodynamic restrictions η = −∂θ ψ,

T = 2ρF ∂C ψ FT ,

E = δπ ψ

hold.6 In the linear approximation (ρ  ρ R , ∇ R  ∇, C  2S) we have Here skwT = 0. Since ψ is taken to depend directly on π and hence via the invariant π 2 , not e.g. via FT π, then E is parallel to π and skwE ⊗ P = 0, which is consistent with skwT = 0.

6

12.5 Fluids in Electromagnetic Fields

η = −∂θ ψ,

699

T = ρ R ∂S ψ,

E = δπ ψ.

The literature shows further approaches which are based on variational principles rather than on the second-law inequality. The variational principles trace back to a paper by Toupin [420] that has deeply influenced the subsequent research.7 By means of the electric enthalpy H = W − E · D, W being the energy density, the principle is assumed to assert that −δ ∫ H dv + ∫(f · δu + E · δP)dv + ∫ t · δu da V∗

V

S

where V ∗ is a volume comprising V and S = ∂V while V ∗ \ V is an outer vacuum. Hence the variation of the electric enthalpy equals the work of the body force, the electric field, and the surface traction. In isothermal conditions, an appropriate enthalpy provides the equation of motion and8 E = ∂P W − ∇ · ∂∇P W, where W = W (S, P, ∇P). Another approach establishes the balance equations by assigning a kinetic energy density 21 ρ d π˙ 2 to the time-dependent polarization [304] thus providing a second-order differential equation for π. A further approach [404] involves both a variational principle and micro-force fields. Following the work of Gurtin [215], a micro-force tensor ξ, an internal micro-force τ and an external micro˙ is the power density expended across force γ are considered such that ξ · (n ⊗ P) ˙ ˙ surfaces, while τ · P and γ · P are the internal and external power densities (per unit volume).

12.5 Fluids in Electromagnetic Fields We start from the balance of energy in the form (2.75) with the fields in the co-moving frame, ρε˙ = E · J + ρ(P/ρ)˙· E + ρμ0 (M/ρ)˙· H + T · L − ∇ · q + ρr. and consider the second-law inequality 1 ρθη˙ + ∇ · q − ρr − q · ∇θ + θ∇ · k ≥ 0. θ Upon substitution of ∇ · q − ρr we obtain 7 8

See, e.g. [290, 312]. See Eq. (2.3) of [311].

700

12 Electromagnetism of Continuous Media

ρθη˙ − ρε˙ + E · J + ρ(P/ρ)˙· E + ρμ0 (M/ρ)˙· H + T · L + θ∇ · k ≥ 0. By means of the free energy ψ = ε − θη we can write the second-law inequality in the form ˙ + E · J + ρ(P/ρ)˙· E + ρμ0 (M/ρ)˙· H + T · L − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ(ψ˙ + η θ) θ

For technical convenience we consider the free energy (density) φ defined by μ0 1 φ = ψ − P · E − M · H. ρ ρ Upon substitution of ψ˙ we obtain the second-law inequality in the form ˙ + E · J − P · E˙ − μ0 M · H ˙ + T · L − 1 q · ∇θ + θ∇ · k ≥ 0. − ρ(φ˙ + η θ) θ (12.18) For definiteness we consider heat-conducting, viscous fluids. We then let the constitutive equations have the form φ = φ(ρ, θ, E, H, ∇θ, D ), and the like for η, P, M, J, T, q, where D denotes the stretching tensor. For the present purpose there is no loss of generality in letting the extra-entropy flux k be zero. The function φ is differentiable while the other constitutive functions are continuous. Moreover we let ˇ T = − p(ρ, θ)1 + T(ρ, θ, E, H, ∇θ, D),

D). Tˇ = O(D

Upon substitution of φ˙ we have ˙ +∂ φ·D ˙] ˙ + ∂∇θ φ · ∇θ −ρ[∂ρ φ ρ˙ + (∂θ φ + η)θ˙ − ρ[∂E φ · E˙ + ∂H φ · H D ˙ − p∇ · v + Tˇ · L − 1 q · ∇θ ≥ 0. +E · J − P · E˙ − μ0 M · H θ By the continuity equation − p∇ · v =

p ρ. ˙ ρ

˙ D ˙ , θ, ˙ ρ˙ implies that the inequality holds only if The arbitrariness of ∇θ, ∂∇θ φ = 0, ∂D φ = 0, η = −∂θ φ,

p = ρ2 ∂ρ φ.

12.5 Fluids in Electromagnetic Fields

701

˙ take arbitrary At any point x ∈ Rt and any time t, Maxwell’s equations allow E˙ and H vector values.9 Hence the inequality holds only if P = −ρ∂E φ,

μ0 M = −ρ∂H φ.

Incidentally, since ρψ = ρφ + P · E + μ0 M · H, if P is independent of H and M is independent of E then it follows ∂E ψ =

1 E ∂E P, ρ

∂H ψ =

μ0 H ∂H M; ρ

ˆ θ, ∂ Ei ψ = (1/ρ)E j ∂ Ei P j , ∂ Hi ψ = (μ0 /ρ)H j ∂ Hi M j . As a consequence, by ψ(ρ, P, M) we have ∂H ψ = ∂M ψˆ ∂H M. ∂E ψ = ∂P ψˆ ∂E P, Hence, if det ∂E P = 0, det ∂H M = 0 it follows ˆ E = ρ∂P ψ,

ˆ H = ρ∂M ψ,

in accordance with the classical Legendre transforms. Now, L = D + W. Since W is arbitrary and occurs linearly (whereas D does not) then the entropy inequality implies skwTˇ = 0,

1 E · J + Tˇ · D − q · ∇θ ≥ 0. θ

Since skwT = skw(P ⊗ E + μ0 M ⊗ H) is required by the balance of angular momentum, the restriction T ∈ Sym is allowed provided P ⊗ E + μ0 M ⊗ H ∈ Sym. If φ depends on E and H through the scalars E2 , H2 , and E · H then P ⊗ E + μ0 M ⊗ H = −ρ[2∂E2 φ E ⊗ E + 2∂H2 φ H ⊗ H + ∂E·H φ (H ⊗ E + E ⊗ H)] ∈ Sym,

and the balance of angular momentum holds. If J and q are independent of D then the entropy inequality reduces to Tˇ · D ≥ 0 and hence 



Maxwell’s equations for the co-moving observer are ∇ × H = J + 0 E + P = J + 0 (E˙ − LE +    ˙ − LH + (∇ · v)H) − μ0 M. Hence appropriate values (∇ · v)E) + P and ∇ × E = − B = −μ0 (H of ∇ × H and ∇ × E , which do not occur in the independent variables, allow us to say that arbitrary ˙ and E˙ are consistent with Maxwell’s equations. values of H

9

702

12 Electromagnetism of Continuous Media

Tˇ = 2μ D + λ(∇ · v)1 where μ ≥ 0, 2μ + 3λ ≥ 0. The viscosity coefficients μ, λ are allowed to depend D| and |E|, |H| thus providing models of electro-rheological fluids. If D = 0 it on |D follows that the functions J and q are required to satisfy 1 E · J − q · ∇θ ≥ 0. θ

(12.19)

Inequality (12.19) holds if J = A1 E + A2 ∇θ,

q = A3 E + A4 ∇θ,

where A1 , ..., A4 are matrices possibly parameterized by ρ, θ, E, H, the 6 × 6 matrix A=

1 (A2 2

1 A1 (A2 − A3T /θ) 2 T − A3 /θ) −A4 /θ

being positive definite. More general models are allowed by letting J = A1 E + A2 ∇θ + aE × H + b∇θ × H, q = A3 E + A4 ∇θ + cE × H + d∇θ × H, where a, b, c, d are scalars subject to b + c/θ = 0. For isotropic materials, the analogous relations with J and ∇θ as independent variables are written in the form E = J/σ + α∇θ + RH × J + N H × ∇θ, q = βJ − κ∇θ + SH × J + LH × ∇θ. The entropy inequality holds if10 σ > 0, κ > 0, (θα − β)2 ≤ 4κθ/σ, N + S/θ = 0. Each term models a well-known effect. The term J /σ models Ohm’s law, Je = σE, σ being the electric conductivity. The term α∇θ represents the thermoelectric effect which is the conversion of temperature differences to electric voltage; it is referred to as Thompson effect. The term βJ is said to represent the Peltier effect while q = −κ∇θ is Fourier’s law of heat conduction. In addition, RB × J is the electric field perpendicular to J and H; it is called the Hall effect. Also N H × ∇θ is a further 10

These inequalities follow from the observation that, for any two vectors u, v and scalars a, b, c the requirement au2 + bu · v + cv2 ≥ 0 holds if au2 − |b||u||v| + cv2 ≥ 0 and hence if a ≥ 0, c ≥ 0, b2 ≤ 4ac.

12.6 Magnetoelasticity

703

contribution to E and is called the Nernst effect. Moreover, SH × J and LH × ∇θ are denoted as Ettingshausen and Leduc-Righi effects. Let 1 F := E · J − q · ∇θ. θ We can view F as given by a function F(E, ∇θ) parameterized by ρ, θ, H. By (12.19) it follows that F(E, ∇θ) ≥ 0, F(0, 0) = 0. Hence F must have a minimum at E = 0, ∇θ = 0. This implies that, if J and q are differentiable, J(0, 0) = ∂E F(0, 0) = 0,

q(0, 0) = −θ∂∇θ F(0, 0) = 0.

Both J and q vanish as E = 0, ∇θ = 0. With this restriction we can represent F by Taylor’s formula, relative to the variables E, ∇θ, and find higher-order terms (corrections) to the above electromagnetic effects. These expansions, up to fourthorder, are given in [58, 59].

12.6 Magnetoelasticity Magnetoelasticity models the interaction of magnetic fields with elastic solids. The scheme deals with the deformation and magnetization of elastic materials under external effects, namely applied loads and/or deformations and thermal and magnetic fields. Accordingly the free charge density q and the electric polarization P are neglected and hence D = 0 E, B = μ0 (H + M). A conduction current density J is allowed to occur. Again we regard the electromagnetic fields as those at the reference at rest with the pertinent point of the body. The balance equations (2.72), (2.73), (2.75) simplify to ˙ × E + μ0 J × H, ρ˙v = ∇ · T + ρb + μ0 (M · ∇)H − 0 μ0 ρm skwT = skwμ0 (M ⊗ H), ˙ + T · L − ∇ · q + ρr. ρε˙ = E · J + μ0 ρH · m Hence the entropy inequality ρη˙ + ∇ ·

q ρr − +∇ ·k ≥0 θ θ

and substitution of ∇ · q − ρr from the balance of energy result in

704

12 Electromagnetism of Continuous Media

1 ˙ + T · L − q · ∇θ + θ∇ · k ≥ 0. − ρε˙ + ρθη˙ + E · J + μ0 ρH · m θ

(12.20)

We find it natural to regard H, rather than M or B, as the variable of magnetic character within the set of independent variables. This suggests that we consider the free energy φ = ε − θη − μ0 H · m and hence we can write the Clausius–Duhem inequality in the form ˙ − μ0 M · H ˙ + E · J + T · L − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ(φ˙ + η θ) θ We let  = (F, θ, ∇θ, E, H) be the set of independent variables.11 Accordingly we assume φ, η, M, J, T, q, k as functions of . Moreover we let φ be continuously differentiable. Objectivity requires that φ depend on invariants under Euclidean transformations. Now, we assume H is a vector in that H → QH, Q being the rotation tensor, Q ∈ SO(3). Since F → QF then F−1 H, J F−1 H, FT H, and J FT H, are invariant vectors under a Euclidean transformation.12 Of course also the scalars H · H and H · FFT H are invariant,13 (QH) · (QH) = H · QT QH = H · H, QH · QFFT QT QH = H · QT QFFT H = H · FFT H. Instead, H · CH is not invariant. To fix ideas we let φ depend on H via the vector H = FT H,

11 As with electroelasticity we might allow for a dependence on D so that dissipative mechanical effects are described. 12 Incidentally, J F−1 B is considered in [136] as an independent variable on the view that J F−1 B is the magnetic induction in the reference configuration, and is regarded as a Lagrangian field. 13 As well as B · B and B · FFT B; see, e.g. [152], p. 310.

12.6 Magnetoelasticity

705

and, as in electroelasticity, we let φ depend on E via E = FT E. Hence we start with the free energy φ as given by14 φ = φ(C, θ, ∇θ, E , H ). Upon computation of φ˙ and substitution in the Clausius–Duhem inequality we have ˙ − ρ∂ φ · E˙ − ρ∂ φ · H˙ ˙ − ρ∂∇θ φ · ∇θ −ρ(∂θ φ + η)θ˙ − ρ∂C φ · C E H 1 ˙ + E · J + T · L − q · ∇θ + θ∇ · k ≥ 0. −μ0 M · H θ Observe

˙ = FT LT H + FT H ˙ H˙ = F˙ T H + FT H

˙ = 2FT D F. Substitution in the Clausius–Duhem and the like for E . Moreover C inequality, the observation that no significant generality is lost by letting k = 0, and some rearrangements yield ˙ −ρ(∂θ φ + η)θ˙ + [T − 2ρF∂C φFT − ρ(E ⊗ F∂E φ + H ⊗ F∂H φ)] · L − ρ∂∇θ φ · ∇θ ˙ + E · J − 1 q · ∇θ ≥ 0 −ρ(F∂E φ) · E˙ − (ρF∂H φ + μ0 M) · H θ

˙ implies ˙ ∇θ The arbitrariness of θ, ∂∇θ φ = 0,

η = −∂θ φ.

˙ can be chosen arbitrarily; in In view of Maxwell’s equations, the values of E˙ and H ˙ do not affect the remaining terms of the inequality, addition, the values of E˙ and H ˙ H ˙ implies which depend on F, θ, ∇θ, E, H. Hence the arbitrariness of E, ∂E φ = 0,

μ0 M = −ρF∂H φ.

(12.21)

The inequality then reduces to [T − 2ρF∂C φFT − ρH ⊗ F∂H φ)] · D + [T − ρH ⊗ F∂H φ)] · W + E · J −

1 q · ∇θ ≥ 0. θ

The arbitrariness of D and W implies T = 2ρF∂C φFT + ρH ⊗ F∂H φ,

(12.22)

T − ρH ⊗ F∂H φ ∈ Sym,

(12.23)

In view of (12.21), the requirement (12.23) can be written 14

We observe that FT H = H R , FT E = E R .

706

12 Electromagnetism of Continuous Media

skw T = skw μ0 M ⊗ H, in accordance with (2.73). We can then write the reduced inequality in the form 1 E · J − q · ∇θ ≥ 0. θ Possible models of J and q are given by15 J = A1 E + A2 ∇θ,

q = A3 E + A4 ∇θ,

subject to the positive semi-definiteness of the matrix A=

1 (A2 2

A1 − A3T /θ)T

− A3T /θ) , −A4 /θ

1 (A2 2

1 det A1 det A4 − 41 [det(A2 − A3T /θ)]2 ≤ 0. θ The simplest nontrivial model satisfying the inequality is given by A1 = σ(J, θ)1,

A4 = −κ(J, θ)1,

while A3 = 0, A4 = 0, namely J = σ(J, θ)E,

q = −κ(J, θ)∇θ,

with σ ≥ 0, κ ≥ 0 while J = det F = (det C)1/2 . Accordingly, σ and κ are required to be positive valued thus showing once again that the electric conductivity and the heat conductivity have to be positive. As a comment on (12.22), we observe that F∂C φFT ∈ Sym and hence it follows that T is the sum of a mechanical part, 2ρF∂C φFT ∈ Sym, and a magnetic part −μ0 H ⊗ M, which is the stress induced by the magnetic intensity H in the magnetizable body. The dependence of φ on H , e.g. H = FT H may be viewed as a modelling of the interaction between magnetic field and deformation. Observe skw T = skw μ0 M ⊗ H arises from the balance of angular momentum. It is worth remarking that this condition holds unchanged if H is taken to be one of the other invariant vectors. We have H = F−1 H : 15

T − ρH ⊗ (F−T ∂H φ) ∈ Sym,

μ0 M = −ρF−T ∂H φ,

Rate-type models of q and J require the insertion of q and J among the independent variables.

12.6 Magnetoelasticity

H = FT H :

707

T − ρH ⊗ (F∂H φ) ∈ Sym,

H = J FT H :

T − ρJ H ⊗ (F∂H φ) ∈ Sym,

μ0 M = −ρF∂H φ, μ0 M = −ρJ F∂H φ.

In all of these cases the symmetry requirement T + μ0 H ⊗ M ∈ Sym holds. Things are consistent if φ depends on H through the scalar invariant α = H · H. We observe that the restriction to the pertinent terms in the Clausius–Duhem inequality result in ˙ ≥ 0. −ρ∂α φ α˙ + [T − 2ρF∂C φ FT ] · L − μ0 M · H Since

˙ α˙ = 2H · H

then we can write ˙ ≥ 0. [T − 2ρF∂C φ FT ] · D + T · W − [μ0 M + 2ρ∂α φ H] · H Hence it follows T ∈ Sym,

μ0 M = −2ρ∂α φ H.

Since μ0 M ⊗ H = −2ρ∂α φ H ⊗ H ∈ Sym it follows that also the dependence on α = H · H satisfies T + μ0 H ⊗ M ∈ Sym. The literature addresses attention also to the scalar invariant κ = H · FFT H. Now, H · FFT H = (FT H) · (FT H) and hence the previous analysis on the dependence on H = FT H applies. Yet, we may follow a different procedure and look at the pertinent terms in the Clausius–Duhem inequality, ˙ ≥ 0. −ρ∂κ φ κ˙ + T · L − μ0 M · H Let u = FFT H. Since ˙ + (H ⊗ u) · L + (u ⊗ H) · LT κ˙ = 2u · H we can write the inequality in the form ˙ ≥ 0, [T − 2ρF∂C φ FT − 2ρ∂κ φ symH ⊗ u] · D + [T − 2ρ∂κ φ skwH ⊗ u] · W − [μ0 M + 2ρ∂κ φ u] · H

whence T − 2ρ∂κ φ H ⊗ u ∈ Sym,

μ0 M = −2ρ∂κ φ u.

708

12 Electromagnetism of Continuous Media

Hence T + μ0 H ⊗ M ∈ Sym. Linear approximation A simple scheme for applications follows if the fields involved justify a linear approximation. By linearizing, relative to F − 1, we have C  1 + 2S,

S = sym∇u.

Also we let the initial values of H and E be zero and hence we linearize also with respect to H and E. Moreover we let M vanish if both H and S approach zero. Hence M ⊗ H is a nonlinear term. The temperature θ is taken to be constant, θ = θ0 , in the reference configuration. The constitutive relations (12.22) then simplify to T = ρ0 ∂S φ,

μ0 M = −ρ0 ∂H φ,

where ρ0 = ρ R . Consequently, in the linear approximation, ∂S μ0 M = −∂H T, or, in suffix notation, μ0 ∂ Si j Mk = −∂ Hk Ti j . Let ϑ = θ − θ0 . The free energy φ is considered in the (quadratic) form ρ0 φ = ρ0 φ0 − ρ0 η0 ϑ −

ρ0 β 2 ϑ − ξkl ϑSkl + 21 Skl klmn Smn + Skl γklm Hm − νk ϑHk − 2θ0

By the entropy relation and (12.21), (12.22) it follows η = η0 +

β 1 νk ϑ + κkl Skl + Hk , θ0 ρ0 ρ0

Tkl = −ξkl ϑ + klmn Smn + γklm Hm , μ0 Mk = −γi jk Si j + νk ϑ + χkl Hl , while q and J are taken in the form qk = −κkl ∂xl ϑ + ζkl El ,

Jk = σkl El + αkl ∂xl ϑ.

1 2

Hk χkl Hl .

12.6 Magnetoelasticity

709

Let ϑ = 0. Using the Mandel-Voigt notation we can write μ0 M = −γ˜ T S˜ + ξ H,

˜ S˜ + γ˜ H, T˜ = 

˜ = ˜ T ∈ R6×6 , γ˜ ∈ R6×3 , χ = χT ∈ R 3×3 . If, as is where T˜ , S˜ ∈ R6 , H, M ∈ R3 ,  often the case, we regard the stress rather than the strain as an independent variable we observe ˜ −1 T˜ + ξ H. ˜ −1 γ˜ H, ˜ −1 T˜ −  M = −γ˜ T  S˜ =  Hence it follows that

∂ H S˜ = (∂T˜ M)T .

(12.24)

In isotropic materials the odd-order tensors vanish and the even-order tensors are expressed in terms of Kronecker deltas so that ρ0 φ = ρ0 φ0 − ρ0 η0 ϑ −

ρ0 β 2 ϑ − ξ ϑtr S + μS · S + 21 λ(tr S)2 − 21 χH2 . 2θ0 η = η0 +

β ξ ϑ + tr S, θ0 ρ0

T = (−ξϑ + λtr S)1 + 2μS, μ0 M = χ H, and q = −κ ∇ϑ + ζ E,

J = σ E + α ∇ϑ.

The requirement E · J − q · ∇θ/θ ≥ 0 results in κ ≥ 0,

σ ≥ 0,

4κσ ≥ θ(α − ζ/θ)2 .

Magnetostriction is a property of ferromagnetic materials that causes them to change their shape when subjected to a magnetic field. The Villari effect consists in a change of magnetization as a consequence of an applied stress. Equation (12.24) means that the two effects are the inverse of each other and they are quantitatively equal. There are approaches16 where the sum of the stress parts is obtained upon an additive decomposition of the magnetic induction (or the electric field) in elastic and viscous internal variables while the deformation gradient is decomposed into the product of the corresponding parts, F = Fe Fv .

16

See, e.g. [385, 386].

710

12 Electromagnetism of Continuous Media

12.7 Micromagnetics and Magnetism in Rigid Bodies The magnetic behaviour of materials is modelled within micromagnetics by starting from the evolution of single spin under the action of a time-dependent external magnetic field. Following the classical picture of a charge, we start from the equation for (the expectation value of) the time-dependent spin s s˙ = −γs × B, where γ=

|g|μ B , 

g < 0 is the gyromagnetic ratio and μ B is the Bohr magneton; if e is the charge and m is the mass then μ B = e/2m. The magnetic moment17 m = γs of an electron then satisfies ˙ = −γm m × B. m Really this equation was introduced by Landau and Lifshitz [270] with B replaced by an effective field Heff which is the resultant of the external magnetic field H, the demagnetizing field and also quantum mechanical effects due to the interaction between the magnetic moments (and γ comprising the permeability of free space μ0 ). Therefore ˙ =0 m ·m m| would be constant in time. Moreover, if B is constant and hence |m ˙ = 0, B ·˙ m = B · m so that m precesses about the B axis with constant frequency. Consistent with Landau and Lifshitz [270], we replace B with H, regarded as the effective field. Further, experiments show that beyond certain critical values of the applied magnetic field the magnetization saturates, becomes uniform and aligns parallel to the magnetic field. To incorporate this experimental fact from phenomenological grounds one can add a damping term. Landau and Lifshitz added a term in the evolution equation in the form ˙ = −γm m × H), m × H − αL L m × (m m

αL L > 0.

˙ = 0 whereas, if H is constant, Again m · m ˙ = αL L [m m2 H2 − (m m · H)2 ] ≥ 0, m|cos˙ θ = H · m |H| |m 17

Often denoted by μ.

(12.25)

12.7 Micromagnetics and Magnetism in Rigid Bodies

711

whence the angle of precession, θ, decreases in time. By arguing on a Rayleigh dissipation functional, Gilbert [191, 192] modified the evolution equation to a form that can be written ˙, ˙ = −γG m × H + αG m × m (12.26) m γG < 0 and αG being called the Gilbert gyromagnetic ratio and the Gilbert damping ˙ = 0 (saturated dipoles) it follows constant. Since, again, m · m ˙ = −γG m × (m ˙. m × H) − αG m 2m m ×m Hence a little algebra shows that (12.26) can be written ˙ =− m

αG γG γG m × H). m×H − m × (m 1 + α2G m 2 1 + α2G m 2

(12.27)

This shows the formal equivalence of the Landau–Lifshitz equation and the Gilbert equation provided only we make the identifications γ=

γG , 1 + α2G m 2

αL L =

αG γG , 1 + α2G m 2

and γG < 0. That is why Eq. (12.27) is referred to as Landau–Lifshitz–Gilbert equation (LLG for short). It is worth observing that αG → ∞ implies αL L → ∞. Hence, ˙ → 0 whereas αL L → ∞ makes m ˙ undeas αG → ∞, in view of (12.27) we have m termined. While low temperatures, θ  θC , make the saturation assumption physically grounded, at relatively high temperatures, θ > θC , the saturation assumption is no longer valid. We then let m = m e, e being the unit vector of m , and observe ˙ =m ˙ e + m e˙ m ˙ consists of a vector parallel to m , m ˙ e, and a vector perpenso that, geometrically, m ˙ is no longer perpendicular to m then it is natural to add a dicular to m , m e˙ . Since m term parallel to m in the evolution equation. This step traces back to Bloch [51, 185] and results in the Landau–Lifshitz–Bloch (LLB) equation ˙ = −γm m × H) + γα (m m · H)m m, m × H − γα⊥m × (m m where again H stands for the effective field. For definiteness the parameters α , α⊥ are taken to depend on temperature in the form

712

12 Electromagnetism of Continuous Media

α =

2 θ λ, 3 θC

α⊥ =

λ(1 − θ/3θC ) if θ < θC , if θ ≥ θC , α (θ)

λ being a positive parameter.18 Two remarks are in order. As θ → 0 LLB reduces to ˙ = −γμ × H − γλm m × (m m × H), m and hence the Landau–Lifshitz equation is recovered with αL L = γλ. If, instead, θ > θC then α = α⊥ and hence the evolution equation becomes ˙ = −γm m × H + γα⊥ H. m

12.7.1 Rate Equations and Thermodynamic Restrictions Borrowing from the evolution equations of micromagnetics we now set up a model for thermomagnetic solids. We assume the body is undeformable and hence regard the mass density, ρ, and the number density of dipoles, n, as constant. Let M = n m. The LLB equation indicates that we write the evolution equation ˙ = α1 M × H + α2 M2 H + α3 (M · H)M, M

(12.28)

where α2 and α3 arise as γα⊥ /n and γ(α − α⊥ )/n. Since α1 , α2 , α3 are allowed to depend on temperature, Eq. (12.28) is a particular case of rate equations of the form ˙ = M (θ, H, M). M It is understood that the (Lagrangian) time derivative is relative to the observer at rest with the body, at the point under consideration. In undeformable media, for the observer at rest the balance of energy (2.76) simplifies to ˙ − ∇ · q + ρr. ρε˙ = E · J + μ0 H · M Hence the Clausius–Duhem inequality (2.85) can be written as ˙ + E · J + μ0 H · M ˙ − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ(ψ˙ + η θ) θ

18

λ is called thermal bath coupling constant.

12.7 Micromagnetics and Magnetism in Rigid Bodies

713

The fields E, B, H, M are required to satisfy Maxwell’s equations which, for rigid bodies, read ˙ ∇ × E = −B,

∇ · B = 0,

∇ × H = 0 E˙ + J,

D = 0 E,

B = μ0 (H + M).

∇ · E = q/0 .

Moreover

These relations are to be viewed as constraints in the derivation of thermodynamic restrictions. To allow for heat conduction and electric conduction we let ∇θ and E be among the independent variables. Moreover we let a dependence on ∇M occur so that, at a macroscopic level, we account for the exchange energy of the physical picture [192]. Hence to ensure the equipresence rule we take  = (θ, ∇θ, E, H, M, ∇M, ∇∇M) as the set of independent variables.19 So we let ψ = ψ(θ, ∇θ, E, H, M, ∇M, ∇∇M) and the like for J, q, k, and ˙ = M (θ, ∇θ, E, H, M, ∇M, ∇∇M). M Compute ψ˙ and substitute in the Clausius–Duhem inequality to obtain ˙ − ρ∂M ψ · M ˙ − ρ∂∇M ψ · (∇ M) ˙ −ρ(∂θ ψ + η)θ˙ − ρ∂∇θ ψ · ∇ θ˙ − ρ∂E ψ · E˙ − ρ∂H ψ · H ˙ +E·J+B·M ˙ − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ∂∇∇M ψ · (∇∇ M) θ

˙ can be chosen arbitrarily without affecting the other terms within The value of ∇∇ M ˙ too can be chosen arbitrarily. To show this, we the inequality. The values of E˙ and H observe that, given J by the constitutive equation, any value of E˙ is allowed subject to ∇ × H = 0 E˙ + J. Hence, for any value of E˙ the constraint is satisfied by an appropriate value of ∇ × H ˙ is allowed subject to which does not enter the inequality. Likewise, any value of H ˙ + M) ˙ ∇ × E = −μ0 (H

˙ which would allow the modelling of hysteretic Here we neglect from the start a dependence on H materials. Hysteresis is developed in Sect. 15.3.

19

714

12 Electromagnetism of Continuous Media

and ∇ × E does not enter the inequality. Meanwhile, as is customary, we can give θ˙ and ∇ θ˙ arbitrary values without affecting the remaining terms of the inequality. The ˙ E, ˙ ∇ θ, ˙ H, ˙ θ˙ implies arbitrariness of ∇∇ M, ∂∇∇M ψ = 0,

∂∇θ ψ = 0,

∂E ψ = 0,

∂H ψ = 0,

η = −∂θ ψ.

For formal convenience we write the remaining inequality in the form ρ ˙ − ρ ∂∇M ψ · (∇ M) ˙ + 1 E · J + μ0 H · M ˙ − 1 q · ∇θ + ∇ · k ≥ 0. − ∂M ψ · M θ θ θ θ θ2 In view of the identity 1 ˙ = −∇ · ( 1 ∂∇M ψ M) ˙ + (∇ · 1 ∂∇M ψ) · M ˙ − ∂∇M ψ · (∇ M) θ θ θ it follows 1 ˙ + 1 E · J − 1 q · ∇θ + ∇ · (k − ρ ∂∇M ψ M) ˙ ≥ 0, (μ0 H − ρ δM ψ) · M θ θ θ2 θ where

ψ δM ψ := ∂M ψ − θ ∇ · ∂∇M . θ

This suggests that we let k= and hence k=

ρ ˙ ∂∇M ψ M θ

ρ ∂∇M ψ M (θ, ∇θ, E, H, M, ∇M). θ

Consequently we are left with the reduced dissipation inequality ˙ + E · J − 1 q · ∇θ ≥ 0. (μ0 H − ρ δM ψ) · M θ ˙ the reduced inequality holds if Irrespective of the chosen function M for M, ˙ ≥ 0, (μ0 H − ρδM ψ) · M

(12.29)

1 E · J − q · ∇θ ≥ 0. θ We append some comments on these results. The operator δM is the variational derivative, with respect to M. If ∂∇M ψ = 0 then δM ψ = ∂M ψ. The occurrence of ψ/θ shows a feature that is related to the dependence on gradients, here ∇M. The

12.7 Micromagnetics and Magnetism in Rigid Bodies

715

ratio ψ/θ is called rescaled (or reduced) free energy. Rescaled potentials are found also in other frameworks such as in phase separation problems [9]. The result for k is consistent with the view that in phase-field theories the extra-entropy flux is linear in the time derivative of the phase variable [155]. Moreover, the flux is larger where the magnetization gradient is larger. The classical inequality E · J − q · ∇θ/θ ≥ 0 is satisfied, e.g. by Ohm’s law and Fourier’s law, J = σ(θ)E, σ > 0;

q = −κ(θ)∇θ, κ > 0.

We now determine possible explicit functions M consistent with (12.29). Let Heff := H − Inequality (12.29) becomes

ρ δM ψ. μ0

˙ ≥ 0. Heff · M

(12.30)

To establish the compatibility with (12.30) of specific evolution equations it is convenient to examine first the constitutive equation ˙ = u + βe × M, ˙ M

(12.31)

˙ and e = M/|M|. Equation (12.31) generalizes the where u is independent of M Gilbert equation (12.26). The following statement provides an equivalent explicit ˙ relation for M. Equation (12.31) is equivalent to ˙ = M

1 [u + β 2 (u · e)e + βe × u]. 1 + β2

(12.32)

To verify this, observe ˙ = e × u + β[(e · M)e ˙ − M], ˙ e×M

˙ = e · u. e·M

Substitution in (12.31) and some rearrangements yield (12.32).



For definiteness we now look at the evolution equation [40] ˙ ˙ = α1 (Heff · e)e − α2 e × (e × Heff ) + βe × M, M

(12.33)

which amounts to letting u = α1 (Heff · e)e − α2 e × (e × Heff ) in (12.31). We can then state the following property. The evolution equation (12.33) is compatible with (12.30) if and only if α1 ≥ 0, α2 ≥ 0. Observe that Eq. (12.33) is equivalent to

716

12 Electromagnetism of Continuous Media

˙ = α1 (Heff · e)e − M

α2 α2 β [(Heff · e)e − Heff ] + e × Heff . 2 1+β 1 + β2

(12.34)

Consequently, ˙ = α1 |Heff · e|2 + Heff · M Hence

˙ ≥0 Heff · M

α2 [|Heff |2 − |Heff · e|2 ]. 1 + β2

⇐⇒

α1 ≥ 0, α2 ≥ 0 

and the conclusion follows.

It is worth remarking that the scalars α1 , α2 , β are allowed to depend on θ, B, M provided only that the inequalities α1 ≥ 0, α2 ≥ 0 hold whereas no restriction is placed on β. ˙ Let M = |M| and hence Equation (12.33) allows for nonzero values of M · M. M = Me so that

˙ = Me ˙ + M e˙ . M

In view of (12.33) we find

M e˙ = −

M˙ = α1 Heff · e,

α2 α2 β [(Heff · e)e − Heff ] + e × Heff . 2 1+β 1 + β2

Since20 α1 > 0 then M increases or decreases according as Heff · e is positive or negative, M˙  0 ⇐⇒ Heff · e  0. Equilibrium magnetization is then characterized by H−

ψ ρ δM = 0. μ0 θ

If ψ is independent of ∇M then the equilibrium is given by H− If we let 20

We disregard the trivial case α1 = 0.

ρ ∂M ψ = 0. μ0

12.7 Micromagnetics and Magnetism in Rigid Bodies

ψ = (θ) +

1 χ(θ)M2 2ρ

then Heff = H − and we find

717

χ M μ0

χ M˙ = α1 (H · e − M). μ0

Hence M increases or decreases according as H · e is larger or smaller than χM/μ0 . The evolution equation (12.34) can be viewed as strictly analogous to the LLB equation (12.28) with α2 β/(1 + β 2 ) playing the role of the opposite of the gyromagnetic ratio. In addition, about (12.34), we have proved that α1 and α2 are non-negative as a consequence of the second law of thermodynamics. We now go back to the possible expression of Heff and hence of the free energy ψ. The physical picture suggests that there is an internal magnetic energy, quadratic in M, and an energy related to exchange interactions, and hence proportional to |∇M|2 . We then let 1 1 ψ = (θ) − M · L (θ)M − a(θ)|∇M|2 . 2ρ 2ρ Consequently Heff =

1 a(θ) ∇M)]. [B − L (θ)M − θ∇ · ( μ0 θ

A detailed form of the tensor function L (θ) is given in Sect. 15.3 within the description of paramagnetic and ferromagnetic behaviours. Micromagnetics and dissipation mechanisms A different picture describes ferromagnets as two interacting continua, one with a mechanical structure and the other with a magnetic structure. It is a central point of the model that the dynamics is made formal by postulating the power expended in a typical process. The evolution equation for the magnetic moment m is assumed in the form [119, 365] ˙ = γm × (f + h + ∇ · ). m whereby the interaction between the two continua is modelled by the force h and the couple stress . The body is assumed to be undeformable and the power expended is taken to be ˙ − h · m, ˙  · ∇m ˙ = i j ∂x j m˙ i . Hence the inequality where  · ∇ m

718

12 Electromagnetism of Continuous Media

˙ +  · ∇m ˙ ≥0 −ψ˙ − h · m is assumed to express the dissipation property. The free energy density ψ, per unit ˙ and mass, is allowed to depend on m and ∇m while h and  may depend also on m ˙ Upon computation of ψ˙ and substitution we have ∇ m. ˙ − (∂∇m − ) · ∇ m ˙ ≥ 0. −(∂m ψ + h) · m Let heq = −∂m ψ,

eq = ∂∇m ψ

and hvs = h − heq ,

vs =  − eq .

So we can write the inequality in the form ˙ + vs · ∇ m ˙ ≥ 0, −hvs · m hvs and vs playing the role of viscous terms. Let ψ = 21 β(m · e)2 + 21 α|∇m|2 ,

α > 0,

where e is the unit vector of the applied field (H). Hence heq = −β(m · e),

eq = α∇m

while the viscous terms hvs , vs are chosen in the form ˙ ν > 0; hvs = −ν m,

vs = 0.

Upon substitution in the evolution equation for m we obtain the standard Gilbert equation ˙ = m × (f − β(m · e)e + αm) − ν m × m. ˙ γ −1 m The vector f − β(m · e)e + αm may then be viewed as the effective applied field acting on m. A simple variant of this scheme avoids the use of the couple stress. Start with ˙ = m × (f + h) γ −1 m and state the dissipation inequality as ˙ + θ∇ · k ≥ 0, −ψ˙ − h · m

12.7 Micromagnetics and Magnetism in Rigid Bodies

719

where k is the extra-entropy flux and θ is the constant temperature. As before, assume ψ is a function of m and ∇m so that ˙ − ∂∇m ψ · ∇ m ˙ −h·m ˙ + θ∇ · k ≥ 0. −∂m ψ · m Since ˙ = ∇ · (∂∇m ψ m) ˙ − (∇ · ∂∇m ψ) · m ˙ ∂∇m ψ · ∇ m the inequality can be written ˙ + ∇ · (θk − ∂∇m ψ m) ˙ ≥ 0. (−∂m ψ + ∇ · ∂∇m ψ − h) · m ˙ and The inequality holds if θk = ∂∇m ψ m ˙ h = −∂m ψ + ∇ · ∂∇m ψ − ν m,

ν > 0.

Assuming ψ as in the previous scheme we obtain (again) the Gilbert equation ˙ = m × (f − β(m · e)e + αm) − ν m × m. ˙ γ −1 m

12.7.2 Evolution Equations of Magnetization Evolution equations are derived for the magnetization M by looking at M as the number density times the magnetic moment μ. The body under consideration is a rigid solid and M, H are fields at the frame locally at rest with the body. The simplest model originates from micromagnetics and is due to Landau and Lifshitz, ˙ = −γM × H M (12.35) where H is the effective magnetic field in the frame (locally) at rest. It is apparent that, ˙ = 0 and M ˙ · H = 0. Consequently two quantities are constant in by (12.35), M · M time, |M| and, if H is constant, the angle between M and H. A dissipation term was then introduced by Landau and Lifshitz thus making the equation in the form ˙ = −γM × H − αM × (M × H). M Still M = |M| is constant. If, futher, H is constant then ˙ = α[M 2 H 2 − (M · H)2 ] > 0 (H · M)˙ = H · M as M × H = 0; the angle between M and H decreases in time.

(12.36)

720

12 Electromagnetism of Continuous Media

To account for dissipation Gilbert, considered a different term so that the evolution equation becomes ˙ ˙ = −γG M × H + βM × M. (12.37) M ˙ = 0. Hence the cross product of (12.37) with M By (12.37) we have again M · M ˙ while M · M ˙ = 0, results in and substitution of M × M, ˙ =− M

βγG γG M× H − M × (M × H). 2 2 1+β M 1 + β2 M 2

Hence (12.37) is equivalent to (12.36) provided only that β=

α , γ

γG = γ[1 + (α/γ)2 M 2 ].

Lately generalizations of (12.35) or (12.36) have been considered. We mention the equation ˙ = −γM × H + t M where t denotes a torque with the role of forcing the system to approach to equilibrium. In this sense the Bloch equation specifies t so that ˙ = −γM × H − 1 [M − Meq (H)], M τ

(12.38)

where [M − Meq (H)]/τ represents the forcing term toward equilibrium. The torque t has been specified in the form t = λ1 H − λ2 H. A further equation has been considered in the form ˙ = −γM × H − αt M × (M × H) + αl (M · H)M. M

(12.39)

Both (12.38) and (12.39) are named after Bloch [40, 185]; αt and αl denote transverse ˙ and longitudinal contributions of M to M. Further modifications of (12.35) have been considered as the equations modelling the evolution of Bloch walls between magnetic domains. Physically, the torque t is taken to represent the contribution of spin-polarized electric currents. Hence t is modelled in the form t = M× p (12.40) p representing the polarization of the electric spin current. In, e.g. [412] the torque is expressed by t = −(u · ∇)M + βM × [(u · ∇)M], (12.41)

12.7 Micromagnetics and Magnetism in Rigid Bodies

721

u being a velocity related to the incident beam of spin polarized charges. It is natural to ask about the thermodynamic consistency of the (evolution) equations (12.35)–(12.41). For simplicity consider a rigid and non-conducting solid, and let the Helmholtz free energy ψ, the entropy η, the entropy production σ, the ˙ and the extra-entropy flux be functions of the set of variables time derivative M, θ, H, M, ∇M. Time differentiation of ψ(θ, H, M, ∇M) and substitution into the Clausius– Duhem inequality yields ˙ + (μ0 H − ρ∂M ψ) · M ˙ − ρ∂∇M ψ · ∇ M ˙ + θ∇ · k = θσ ≥ 0. −ρ(∂θ ψ + η)θ˙ − ρ∂H ψ · H

By the standard arguments we conclude that η = −∂θ ψ. Differently from models of ˙ and M ˙ are not independent, here we let H ˙ and M ˙ be independent. hysteresis where H ˙ The arbitrariness of H implies ∂H ψ = 0. Hence we obtain the inequality ˙ − ρ∂∇M ψ · ∇ M ˙ + θ∇ · k = θσ ≥ 0. (μ0 H − ρ∂M ψ) · M

(12.42)

Divide by θ and employ the identity ρ ˙ = −∇ · ( ρ ∂∇M ψ M) ˙ +M ˙ · ∇ · ρ ∂∇M ψ − ∂∇M ψ · ∇ M θ θ θ to obtain

1 ˙ + ∇ · (k − ρ ∂∇M ψ M) ˙ =σ (μ0 H − ρδM ψ) · M θ θ

where δM ψ = ∂M ψ − (θ/ρ)∇ · [(ρ/θ)∂∇M ψ]. Hence we let k=

ρ ˙ ∂∇M ψ M, θ

˙ is required to satisfy where M ˙ = θσ ≥ 0. (μ0 H − ρδM ψ) · M

(12.43)

Any relation ˙ = G(θ, H, M)(μ0 H − ρδM ψ), M

G(θ, H, M) ∈ Psym,

obeys the reduced inequality (12.43). If we address attention to Eqs. (12.35)–(12.41) we observe that they have the form ˙ = −γM × H + t M with various forms of t . This suggests that we take ∂∇M ψ = 0, define

722

12 Electromagnetism of Continuous Media

w := μ0 H − ρ∂M ψ, and assume ∂M ψ = αM + βH. It follows w = (μ0 − ρβ)H − ραM,

w · (−γM × H + τ ) = w · t ≥ 0.

Hence the reduced inequality (12.42) holds if ˙ = −γM × H + ξM × w + νH × w + t d , M with arbitrary γ, ξ, ν while τ d is required to satisfy w · t d = θσ ≥ 0. Since σ is required to be non-negative then we let A be a positive definite tensor and assume θσ = t d · Attd and regard τ d as a function of the chosen variables θ, H, M. Then we have 0 = w · t d − t d · Attd = (w − Attd ) · t d . This condition holds if w − Attd = 0, Hence

t d = A−1 w.

˙ = −γM × H + ξM × w + νH × w + A−1 w M

(12.44)

is thermodynamically consistent. Equation (12.44) shows possible evolution equations of M. The gyromagnetic term γM × H is consistent provided ∂M ψ embodies a term αM. The torques ξM × w, νH × w are allowed for arbitrary ξ and ν. The dissipative torque t d is thermodynamically consistent if t d = A−1 (μ0 H − ρ∂M ψ), where A is positive definite. The torques γM × H, ξM × w, and νH × w are nondissipative in that the corresponding entropy production is zero. Moreover, M × w and H × w are uncommon in the literature. We might view H − ρ∂M ψ/μ0 = w/μ0 as the effective field; both M × w and H × w have a decreasing influence inasmuch as w → 0. We might then think that ˙ approaches zero as far as w → 0 namely as H approaches the equilibrium value M ρ∂M ψ/μ0 . Since w = −ρα[M − (μ0 − ρβ)H/ρα] then Eq. (12.38) holds with

12.7 Micromagnetics and Magnetism in Rigid Bodies

Meq (H) =

723

μ0 − ρβ H. ρα

As with (12.39), we find that ˙ = (μ0 − ρβ)αt (M × H)2 + (μ0 − ρβ)αt (M · H)2 − ρααl M2 (M · H). (μ0 H − ραM) · M

Thermodynamic consistency holds if αt ≥ 0 and αl = 0, which is the case of the Landau–Lifshitz equation (12.36). The result for the dissipative torque t d allows us to establish an interesting connection with the literature. Denote by m the unit vector of M and consider the tensor ˆ = α1 m ⊗ m + α2 (1 − m ⊗ m), A

α1 , α2 > 0.

Hence, for any vector u, ˆ = α1 (m · u)m + α2 (u − (m · u)m) = α1 (m · u)m − α2 m × (m × u). Au Incidentally (m · u)m = u ,

−m × (m × u) = u⊥

ˆ are the parallel and perpendicular parts of u with respect to m. Now let A−1 = A and then t d = Aw = α1 (m · w)m + α2 (w − (m · w)m) = −α2 w + (α1 − α2 )(m · w)m. Letting γ, ν = 0 we can write Eq. (12.44) in the form21 ˙ = ξM × w + α1 (w · m)m − α2 m × (m × w). M

(12.45)

The coefficients α1 , α2 have the dimensions of the inverse of time. Their reciprocals, say τ , τ⊥ , are then named ferromagnetic longitudinal and transverse relaxation times. The larger are the values α1 , α2 the faster is the damping of M , M⊥ to their equilibrium values. ˙ = Mm ˙ + Mm ˙ and m ˙ · m = 0 then Eq. (12.45) splits into the pair of Since M evolution equations

˙ = −γm × w − (α2 /M)m × (m × w), m , M˙ = α1 w · m

where w = μ0 H − ρ∂M ψ. The first equation governs the evolution of the unit vector and, except for w instead of H, has the form of Eq. (12.36). The second equation tells us that the amplitude M may change if α1 = 0. 21

An analogous result is obtained in [155], Eq. (4.7), by looking for thermodynamic restrictions ˙ = t p . Equation (12.45) is similar to (12.39), with w instead of H. on an equation like M

724

12 Electromagnetism of Continuous Media

12.8 Ferrofluids Ferrofluids are colloidal liquids made of nanoscale ferromagnetic, or ferrimagnetic, particles, with typical dimensions of about 10 nm, suspended in a carrier fluid (usually an organic solvent or water). Large ferromagnetic particles can be ripped out of the homogeneous colloidal mixture, forming a separate clump of magnetic dust when exposed to strong magnetic fields. The magnetic attraction of nanoparticles is weak enough that the surfactant’s Van der Waals force is sufficient to prevent magnetic clumping or agglomeration. Ferrofluids usually do not retain magnetization in the absence of an externally applied field. Often the particles are made of magnetite, Fe3 O4 , coated with surfactant agents in order to prevent their aggregation. The particles have a symmetry axis of easy magnetization; denote by e the unit vector of the symmetry axis. In ionic ferrofluids the nanoparticles are electrically charged. Quite naturally we may model ferrofluids by borrowing from the model of nanofluids. Yet two approaches can be developed. One describes ferrofluids as mixtures of two constituents: the fluid and the magnetic particles. The other approach models ferrofluids as micropolar media. Here we let the fluid be non-polar and the magnetic particles be solid. Moreover the ferrofluid is viewed as a mixture, the particles being viewed as a micropolar medium. We follow the quasistatic approximation (∂t B = 0, ∇ × H = 0) and neglect the polarization P. We let q be the charge density and J the corresponding current density. We denote by the subscripts s and f the quantities pertaining to solid nanoparticles and base fluids. Let m s be the magnetic moment field of each particle. The occurrence ms · ∇)H, the body couple μ0m s × H, of a magnetic field H results in the force μ0 (m ` s + vs · (m ` s = ∂t m s + (vs · ∇)m ms · ∇)H], where m ms . and the energy supply μ0 [H · m Also, the electric charge shows up a force density qE and a power density E · J; E, H, J are the fields at the reference of the co-moving observer. Moreover we let Ns be the number density of nanoparticles; Ns is related to ρs by Ns = ρs /m s , m s being the mass of nanoparticles. We let M = Ns m s ,

m=

1 1 M= ms ρs ms

be the magnetizations, per unit volume and unit mass, respectively. Hence the effects of magnetic moments, per unit volume, to body force, body couple, and energy supply are ` μ0 (M · ∇)H, μ0 M × H, [μ0 (M · ∇)H] · vs + μ0 ρs H · m, ` = ∂t m + (vs · ∇)m. where m With this in mind the modelling of nanofluids is improved by letting the balance of mass, momentum, orientational momentum, and energy be given the form

12.8 Ferrofluids

725

ρ`s + ρs ∇ · vs = 0, ρ` f + ρs ∇ · v f = 0, ρs v` s = ∇ · Ts + ρs g + μ0 (M · ∇)H + ps , ρ f v` f = ∇ · T f + ρ f g + p f , ρs σ` s = μ0 M × H + ϒ s + cs , 0 = ϒ f + cf, ` + ϒ s · (ws − ω s ) + es − ps · vs − cs · ω s , ρs ε` s = Ts · D s − ∇ · qs + ρs rs + Js · E + μ0 ρs H · m

ρ f ε` f = T f · L f − ∇ · q f + ρ f r f + e f − p f · v f , subject to the constraints ps + p f = 0,

cs + c f = 0,

es + e f = 0.

Adding the two equations of orientational momentum we find ρs σ` s = μ0 M × H + ϒ s + ϒ f , which means that the rotation of nanoparticles is governed by the magnetic body couple density, μ0 M × H, and the skew part of the total stress via ϒ s + ϒ f . The system of balance equations is closed if the equation for the evolution of m s is given. As with nanofluids, since ω f is undefined then it is consistent to let ϒ f = 0, which would follow also from the entropy inequality in view of the arbitrariness of w f . This implies that cs = 0, c f = 0. We then take ϒ s in the form ϒ s = ζ(ws − ω s ), where ws = 21 ∇ × vs is the axial vector of the spin Ws . Hence a simple model is allowed with Ts , T f in the form D f ), Tvis D f ), Ts = − ps 1 + Tskw , T jkskw = − 21  jki ϒi ; T f = − p f 1 + Tvis f (D f = O(D (12.46) where ps = ps (ρs , θs ), p f = p f (ρ f , θ f ). The system of balance equations is closed if the equation for the evolution of μs , and hence of m, is given. A simple model arises by letting m s = ms e,

σ s = Iω s ,

726

12 Electromagnetism of Continuous Media

I being the inertia tensor of particles, per unit mass, and e a unit vector attached to the particles, while ms is constant. Hence we have e` s = ω s × es ,

` s = ms e` s , m

` s + ω × Iω s ) = μ0 M × H + ζ(ws − ωs ). ρs (Iω The thermodynamic consistency of this constitutive model is now examined by starting from the inequality22

1 1 [ρα θα η`α + ∇ · qα − ρα rα + θα ∇ · kα − qα · ∇θα ] ≥ 0, α θα θα which is required to hold for any process compatible with the balance equations. Now, the energy balance equations can be written ` + J · E + ϒ s · (ws − ωs )]δsα + eα − pα · vα − cα · ω α , ρα ε` α = Tα · D α − ∇ · qα + ρα rα + [μ0 ρs H · m

where δs f = 0, δss = 1. Substituting ∇ · qα − ρα rα in the second-law inequality we obtain

1 1 [ρα θα η`α − ρα ε`α + Tα · D α + eα − pα · vα + θα ∇ · kα − qα · ∇θα ] α θα θα 1 ` + J · E + ϒ s · (ws − ω s )] ≥ 0. + [μ0 ρs H · m θs Using the Helmholtz free energy ψα = εα − θα ηα we find

1 1 [−ρα (ψ` α + ηα θ`α ) + Tα · Lα + eα − pα · vα + θα ∇ · kα − qα · ∇θα ] α θα θα 1 ` + J · E + ϒ s · (ws − ω s )] ≥ 0. + [μ0 ρs H · m θs ` suggests that we let The occurrence of the power μ0 ρs H · m φs = ψs − μ0 H · m,

φf = ψf.

Hence we can write the inequality in the form

1 1 [−ρα (φ` α + ηα θ`α ) + Tα · Lα + eα − pα · vα + θα ∇ · kα − qα · ∇θα ] α θα θα 1 ` + J · E + ϒ s · (ws − ω s )] ≥ 0, + [μ0 M · H θs 22

Here α = f, s.

12.8 Ferrofluids

727

` = ∂t H + (vs · ∇)H. where H Assume that φα , ηα , kα , qα depend on ρα , θα , D α , ∇θα , E, and H while Ts , T f are given by (12.46) and ps = −p f depends also on v f − vs . Upon computation of φ` α we find −

1 ` + ρ ∂ φ · E] ` α + ρα ∂∇θ φα · ∇θ [ρα (∂θα φα + ηα )θ` α + ρα ∂D α φα · D α α E α ` + ... ≥ 0, α α θα

` , E, ` α , ∇θ ` and θ`α , where E` = the dots denoting terms which are independent of D α ∂t E + (vα · ∇)E. Hence it follows ∂Dα φα = 0,

∂∇θα φα = 0,

∂E φα = 0,

ηα = −∂θα φα .

The reduced inequality

is investigated

under the assumption that the temperatures θ f , θs are equal. Since α eα = 0 and α pα = 0 the inequality can be written as

2 α [(ρα ∂ρα φα

where q =

1 ` q · ∇θ − (ρs ∂H φs + μ0 M) · H θ +J · E + Tvis f · D f + ϒ s · (ws − ω s ) ≥ 0,

− pα )∇ · vα − pα · uα + θ∇ · kα ] −

α qα .

` implies The arbitrariness of H μ0 M = −ρs ∂H φs .

D f ) letting E = 0 and ∇θ = 0 we find Moreover, since Tvis f = O(D pα = ρ2α ∂ρα φα . The reduced inequality can then be written

α [−pα

1 · uα + θ∇ · kα ] − q · ∇θ + J · E + Tvis f · D f + ϒ s · (ws − ω s ) ≥ 0. θ

Setting aside cross-coupling terms, we conclude that a thermodynamically consistent model is obtained by letting kα = 0, α = f, s, and ps = ξ(v f − vs ) = −p f , ξ ≥ 0,

q = −κ∇θ, κ ≥ 0,

J = σE, σ ≥ 0,

T f = −ρ2f ∂ρ f ψ f 1 + 2μ f D f + λ f (tr D f )1, μ f ≥ 0, 2μ f + 3λ f ≥ 0, ϒ s = ζ(ws − ω s ), ζ ≥ 0, Ts = −ρ2s ∂ρs ψs 1 + Tskw s ,

1 (Tskw s )i j = 2  jik ζ(ws − ω s )k .

728

12 Electromagnetism of Continuous Media

Along with ηα = −∂θα φα ,

μ0 M = −ρs ∂H φs ,

these equations constitute a model for ferrofluids. In a dynamic problem the magnetic field H and the electric field E are regarded as given fields and the unknown solution is the set of fields ρs , ρ f , vs , v f , σ s , εs , ε f , possibly with εs , ε f replaced by the temperature θ as an unknown of the problem. Yet if H is given then we regard also m (or M) as a field to be determined. Now, μ0 M = −ρs ∂H φs is a relation providing M, at (x, t), in terms of H and other variables, at (x, t). More realistically, we might think that M and H are related by a rate-type equation. A model of this form follows by letting m = M/ρs be a further variable and keeping ψs as the pertinent potential so that ` −(ρs ∂m ψs − μ0 H) · m occurs in the second-law inequality. If the rate-type equation has the form ` = M (H, m) m then the thermodynamic requirement is (μ0 H − ρs ∂m ψs ) · M (H, m) ≥ 0. In a one-dimensional scheme we might assume 1 m` = − (m − h(H )), τ > 0. τ The thermodynamic requirement implies that (ρs ∂m φs − μ0 H )(m − h(H )) ≥ 0. whence ρs ∂m φs = μ0 H + β(m − h(H )), β ≥ 0. Otherwise, if we let φs depend on both H and m then we find ` ≤ 0. ` + (ρs ∂H ψs + μ0 M) · H ρs ∂m φs · m This is a basic starting point for the modelling of hysteretic materials.

12.8 Ferrofluids

729

Relation to other approaches It is worth comparing our scheme with other models of ferrofluids that appeared in the literature. The simplest model traces back to Neuringer and Rosensweig [342]. Ferrofluid is viewed as an incompressible magnetizable but nonconductive fluid, without any internal structures. A steady flow of a ferrofluid is then modelled by ρ(v · ∇)v = −∇ p + μv + μ0 (M · ∇)H, ∇ · v = 0,

∇ × v = 0,

∇ · B = 0.

In essence this model follows by letting vs = v f , adding the two equations of motion and neglecting ρs ∂t vs , (ρs + ρ f )g. The micropolar structure of ferrofluids is considered in [113] by letting σ s = I ω s , where I is the scalar moment-of-inertia density. The occurrence of a couple stress tensor  is also allowed. Hence, in the present notation, the balance of orientational momentum becomes ` s = ∇ ·  + μ0 M × H + ϒ s + ϒ f . ρs I ω Schliomis model [395] also accounts for the micropolar structure. However the effect of H on the angular momentum σ s is viewed as though the torque μ0 M × H, in a time τs , would produce an angular momentum τs μ0 M × H, in addition to I ω s . Moreover the effect on M is taken as a constant M0 times the unit vector of H plus a time τ H times ω s × M. Furthermore, the curl of 21 μ0 M × H is taken to provide an equivalent body force. To sum up, by neglecting inertia the governing equations are taken in the form −∇ p + μ0 (M · ∇)H + μv + σ s = I ω s + τs μ0 M × H,

1 ∇ × (σ s − I ω s ) = 0, 2τs

ˆ + τ H σ s × M, M = M0 H I

ˆ is the unit vector of H. where H A model developed by Jenkins23 [232] associates the micropolar character with the magnetization and is based on the laws of conservation obtained by Brown [70] for quasistatic processes in magnetic materials. The body is viewed as a single material with additional terms of magnetic character. Yet some original terms are considered. ˙ and The balance of angular momentum involves an additional magnetic term m × m a magnetic stress tensor . The energy density involves a formal kinetic energy 1 ˙ · m. ˙ βm 2 The balance of mass and linear momentum are given by the classical equations 23

See also [166].

730

12 Electromagnetism of Continuous Media

ρ˙ + ∇ · (ρv) = 0, ρ˙v = ∇ · T + μ0 (M · ∇)H + ρb. ˙ The angular momentum, per unit volume, is assumed to be ρ(r × v + βm × m), where β is a dimensional constant. Indeed, by analogy with micromagnetics (see Sect. 12.7), a balance law for the magnetization is assumed in the form d ∫ ρβ m ˙ dv = ∫ nda + ∫ ρ(μ0 H + ψ) dv dt Pt Pt ∂Pt for any region Pt of the body. The tensor  is viewed as a magnetic stress tensor while ψ is called the intrinsic magnetic field. Hence the balance of angular momentum is assumed in the form d ∫ ρ(r × v + βm × m)dv ˙ = ∫ [r × (Tn) + m × (n)]da dt Pt ∂Pt + ∫ ρ[μ0 m × H + μ0 r × (m · ∇)H + r × b + m × ψ]dv. Pt

The corresponding local form of the balance of angular momentum is ¨ = ϒ − (∇) × m + r × ∇ · T + m × ∇ ·  + μ0 M × H + r × ρb + μ0 r × (M · ∇)H ρ(r × v˙ + βm × m)

and hence, in view of the equation of motion, ¨ = ϒ − (∇) × m + m × ∇ ·  + μ0 M × H + M × ψ. ρβm × m The evolution equation ¨ = ∇ ·  + ρμ0 H ρβ m is allowed on condition that ϒ − (∇) × m + M × ψ = 0. Likewise the balance of energy is assumed to be expressed by d ∫ ρ(ε + 21 v2 + 21 β m ˙ 2 )dv = ∫ [vT + m ˙ − q) · nda dt Pt ∂Pt ˙ · H + r ]dv. + ∫ ρ[μ0 v · (m · ∇)H + v · b + μ0 m Pt

The local form is

12.8 Ferrofluids

731

˙ ·m ¨ = T · L + v · (∇ · T) + (∇) · m ˙ −∇ ·q+m ˙ · (∇ · ) ρε˙ + ρv · v˙ + ρβ m ˙ · H + ρr. +μ0 v · (M · ∇)H + ρv · b + ρμ0 m ¨ it follows that Upon substitution of ρ˙v and ρβ m ˙ · H + ρr. ˙ − ∇ · q + ρμ0 m ρε˙ = T · L + (∇) · m A simple model used for ferrofluids is based on the conditions ∇ · v = 0, ∇ × H = 0, ∇ · B = 0, M = χH while T = − p1 + 2μ D + skwT, where D is the stretching. Since (skwT) jk = 21 k ji ϒi then ∂xk (skwT) jk = 21 k ji ∂xk ϒi ,

∇ · (skwT) = − 21 ∇ × ϒ.

For steady flows of ferrofluids the equation of motion becomes −∇ p + μv + μ0 (M · ∇)H − ∇ × ( 21 ϒ) = 0 ¨ and  are neglected. This is consistent with the curlwhile ϒ = −μ0 H × M, if m ˆ α being a suitable term used in [298] where H is replaced with ρα2 (∇ × v) × M, ˆ constant parameter and M the unit vector of M.

12.8.1 Magneto-Rheological Fluids Magneto-rheological fluids respond to an applied magnetic field with a change in the rheological properties like, e.g. effective viscosity. Hence an electromagnetic control governs the behaviour of a mechanical system. While both electric and magnetic controls are of interest, in practice applications of the magnetic effects look more significant. The interaction between the resulting magnetic dipoles causes the particles to form columnar structures parallel to the applied magnetic field. These chain-like structures hinder the motion of the fluid thus increasing the viscous properties. We then examine the action of the magnetic field on the properties of ferrofluids. Assume the fluid flows between two infinite parallel fixed plates so that the flow is irrotational (∇ × v = 0). So the balance of orientational momentum can be written as ν > 0. ρs σ` s = μ0 M × H − νω s ,

732

12 Electromagnetism of Continuous Media

Since σ` s = Iω ` s + ω s × Iω s then the inner product with ω s yields ` s = μ0 M × H · ω s − νω 2s . ω s · Iω ` s = 21 (ω s · Iω s )`and then Now, ω s · Iω 1 (ω s 2

· Iω s )` = μ0 M × H · ω s − νω 2s .

This shows that the interaction of the fluid with the particles result in a decay of the kinetic energy 21 (ω s · Iω s ). Equilibrium of the particles, ω s = 0, occurs when M × H = 0 that is when the particle (axis) is aligned with the magnetic field H.

12.9 Magneto-, Electro-, and Mechanical-Optical Effects 12.9.1 Magneto-Optical Effects Magneto-optical effects are phenomena associated with electromagnetic waves propagating through a medium acted upon by a static magnetic field. In such materials, called gyrotropic or gyromagnetic, left- and right-rotating elliptical polarizations can propagate at different speeds. When light is transmitted through a layer of magnetooptic material, the plane of polarization can rotate and the speed of propagation depends on the polarization (Faraday effect). Discovered by Michael Faraday in 1845, the Faraday effect was the first experimental evidence that light and electromagnetism are related. The Faraday effect is caused by left and right circularly polarized waves propagating at slightly different speeds, a property known as circular birefringence. Since a linear polarization can be decomposed into the superposition of two equal-amplitude circularly polarized components of opposite handedness and different phase, the effect of a relative phase shift, induced by the Faraday effect, is to rotate the orientation of the polarization.24 Mathematically, the Faraday effect is modelled by a linear relation between the ˙ Yet the occurrence of P˙ makes the constituelectric field E and the time derivative P. tive equation non-objective. To overcome this difficulty we proceed in the following way. Consider the electric field E R and the polarization P R in the reference configuration, E R = EF and P R = J F−1 P. In addition to being referential fields, E R and P R are invariant under a change of frame. If Q is the time-dependent orthogonal tensor of the transformation we have

24

As Faraday wrote in his daily notebook, “Still, I have at last succeeded in illuminating a magnetic curve or line of force, and in magnetizing a ray of light”.

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

733

E∗R = (QE)(QF) = E(QT Q)F = EF = E R , P∗R = J (QF)−1 QP = J F−1 (QT Q)P = J F−1 P = P R . Since P R is invariant then P˙ R is objective (really, is invariant) and hence P˙ R can enter a constitutive equation. The same conclusion holds for25 FT P. First we observe that, by (2.84), if J, q, M, and k are zero then the entropy inequality can be written as ˙ + ρE · π˙ + T · L ≥ 0, −ρ(ψ˙ + η θ) where π = P/ρ. Now, π= Consequently,

1 1 FP R = FP R , ρJ ρR

π˙ =

1 ˙ (FP R + FP˙ R ). ρR

ρ R E · π˙ = E R · P˙ R + [E ⊗ (FP R )] · L.

Hence, upon multiplying by J , the entropy inequality can be written in the form ˙ + E R · P˙ R + [J T + E ⊗ (FP R )] · L ≥ 0. −ρ R (ψ˙ + η θ) We let ψ, η, E R , T depend on θ, P R , P˙ R , F. Indeed, since ψ, η, E R are Euclidean invariants, they are assumed to depend on F via the Cauchy-Green tensor C = FT F. ˙ = 2FT D F, computing ψ˙ and substituting we obtain the entropy inequality Since C in the form −ρ R (∂θ ψ + η)θ˙ + (E R − ρ R ∂P R ψ) · P˙ R − ρ R ∂P˙ R ψ · P¨ R + [J T − 2ρ R F∂C ψFT + E ⊗ (FP R )] · L ≥ 0.

The arbitrariness of P¨ R and θ˙ implies ∂P˙ R = 0,

η = −∂θ ψ.

Moreover, the arbitrariness of L implies T = 2ρF∂C ψFT − E ⊗ (FP R /J ). Since FP R /J = P then

T = 2ρF∂C ψFT − E ⊗ P.

Hence T is the sum of a symmetric elastic stress 2ρF∂C ψFT and an electromagnetic stress −E ⊗ P. Indeed, Under a change of reference with rotation tensor Q, F → QF and P → QP. As a consequence FT P → (QF)T QP = FT P.

25

734

12 Electromagnetism of Continuous Media

skwT = −skw(E ⊗ P) = skw(P ⊗ E), as expected. The remaining inequality (E R − ρ R ∂P R ψ) · P˙ R ≥ 0 holds if

E R = ρ R ∂P R ψ −  × P˙ R ,

 being an invariant vector. To derive a relation in terms of E and P we observe that ∂P ψ = ∂P R ψ J F−1 . So we have

E = ρ∂P ψ − ( × P˙ R )F−1 .

The vector  is called the gyration vector. In isotropic materials  is determined by a given magnetic induction B0 so that  = hB0 , h being possibly a scalar function of B0 . We now apply the model to the analysis of linear wave propagation. We let ρ be constant and F = 1. In addition, we let ρ∂P ψ = αP. Hence we have ˙ E = αP −  × P. The vector  so introduced is called the gyration vector. Our interest is in the case where the gyration vector is originated by a constant magnetic induction B0 . For isotropic materials we let  = hB0 , h being possibly a scalar function of B0 . Let Tel be the linear approximation of 2ρF∂C ψ and hence assume Tel = μ(∇u + (∇u)T ) + λ(∇ · u)1. Since T = Tel − E ⊗ P then ∇ · T = ∇ · Tel − E(∇ · P) − (P · ∇)E.

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

735

as is natural in formulations of constitutive equations, the value of E is meant as that at the frame at rest with the body. If we refer to the values at the laboratory frame then E → E + v × B. In view of the linear approximation, the terms E(∇ · P) and (P · ∇)E are neglected in the equation of motion (2.72). Moreover, the force term ρπ˙ × B is approximated by ∂t P × B0 . Hence the equation of motion, the constitutive equation for P, i.e. ˙ and Maxwell’s equations can be written in the form E = αP −  × P, ρ0 ∂t2 u = (μ + λ)∇(∇ · u) + μu + ∂t P × B0 , E + ∂t u × B0 − αP + hB0 × ∂t P = 0, ∇ × B/μ0 − 0 ∂t E − ∂t P = 0, ∇ × E + ∂t B = 0,

∇ · (0 E + P) = 0, ∇ · B = 0.

Magneto-optic effects are now described by having recourse to homogeneous plane-wave solutions of the system, u(x, t) = uˆ exp[i(kn · x − ωt)],

P = Pˆ exp[i(kn · x − ωt)]

ˆ Pˆ are complex amplitude vectors, k is the and the like for B − B0 and E. Here u, wave number, and n is the unit vector of the propagation direction. Upon substitution of u, P, B, E into the system of differential equations we find ˆ + μk 2 uˆ + iω Pˆ × B0 , ρ0 ω 2 uˆ = (μ + λ)k 2 (n · u)n

(12.47)

Eˆ − iω uˆ × B0 − αPˆ − iωhB0 × Pˆ = 0,

(12.48)

ˆ 0 + ω0 Eˆ + ω Pˆ = 0, kn × B/μ ˆ = 0, kn × Eˆ − ω B

ˆ = 0, kn · (0 Eˆ + P) kn · Bˆ = 0,

(12.49) (12.50)

where Bˆ is the amplitude vector of B − B0 . Suggestive solutions emerge by restricting attention to waves propagating in the direction of B0 . Let B0 be the component of B0 along n, namely B0 = B0 n. Inner and vector multiplication of (12.47) by n gives the two equations [ρ0 ω 2 − (2μ + λ)k 2 ]n · uˆ = 0,

(12.51)

(ρ0 ω 2 − μk 2 )n × uˆ − iω B0 Pˆ = 0.

(12.52)

736

12 Electromagnetism of Continuous Media

Let uˆ be collinear with n. Hence n · uˆ = 0; by (12.51) the phase velocity vph = ω/k is subject to |vph | = [(2μ + λ)/ρ0 ]1/2 . By (12.52) it follows Pˆ = 0 and hence by (12.48) and (12.50) Eˆ = 0 and Bˆ = 0; Eq. ˆ P, ˆ and Bˆ are zero. Hence the solution is the classical (12.49) holds identically if E, longitudinal sound wave (uˆ  n); this wave is unaffected by the applied field B0 . ˆ We now let uˆ be orthogonal to n. Substitution of Bˆ = n × Ek/ω in (12.49) gives Pˆ + (0 − k 2 /μ0 ω 2 )Eˆ = 0 whence ˆ Eˆ = γ P,

γ :=

ω/k β2 , β= , 0 (1 − β 2 ) c

√ c = 1/ μ0 0 being the light speed in free space. Substituting in (12.48) we find ˆ = 0. (α − γ)Pˆ + iω B0 [hn × Pˆ − n × u]

(12.53)

Inner multiply by n × Pˆ and use the properties n · Pˆ = 0, uˆ · Pˆ = 0 to obtain ωh B0 Pˆ · Pˆ = 0. Unless ωh B0 = 0 we have

Pˆ · Pˆ = 0.

(12.54)

Since Pˆ is complex-valued then we let Pˆ = Pˆ r + i Pˆ i and observe that Eq. (12.54) implies Pˆ r · Pˆ i = 0, |Pˆ r | = |Pˆ i |, ˆ Let θ = kn · x − ωt and e1 , e2 be Pˆ r , Pˆ i denoting the real and imaginary parts of P. the unit vectors of Pˆ r , Pˆ i , with (e1 , e2 , n) an orthogonal triad. The function P(x, t) can then be represented by P(x, t) = (Pˆ r + i Pˆ i ) exp(iθ) = P(e1 cos θ − e2 sin θ) + i P(e2 cos θ + e1 sin θ), where P = |Pˆ r | = |Pˆ i |. Things are different depending on whether we look at the time behaviour of the wave at a fixed point or, at a fixed time, we look at the wave along the direction of propagation n. As θ increases (n · x increases), if (e1 , e2 , n) is a right-handed triad then the vector P (both Pˆ r and Pˆ i ) describes a circle in the clockwise sense and we say that P is a right circularly polarized wave. Likewise, as θ decreases26 P is a left-circularly polarized wave. For definiteness, let θ increase. 26

Namely, at fixed x as t increases or at fixed t in the reverse direction of propagation, −n.

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

737

Conversely, if P is such that (e1 , e2 , n) is a left-handed triad then P describes a left circularly polarized wave. Observe ˆ n × Pˆ = ±i P, where the upper (lower) sign holds for left-handed (right-handed) triad (e1 , e2 , n). Since iω B0 ˆ P, n × uˆ = ρ0 ω 2 − μk 2 and n × Pˆ = ±i Pˆ then (12.53) becomes [α − γ +

2 B02 vph 2 ρ0 (vph

− μ/ρ0 )

∓ω B0 h]Pˆ = 0,

where vph = ω/k. Waves occur if the propagation condition α−γ+

2 B02 vph 2 ρ0 (vph

− μ/ρ0 )

∓ω B0 h = 0

(12.55)

holds. Equation (12.55) is the dispersion relation for transverse waves with a direction parallel to the applied magnetic field. The ∓ω B0 h term means that left and rightcircularly polarized waves propagate with different speeds; the upper sign refers to left-circularly polarized waves. This shows that the Faraday effect, that is the difference of the speeds depending on the polarization, is provided by the product B0 h. If B0 = 0 then the propagation condition (12.55) simplifies to α − γ = 0,

β2 =

1 . 1+χ

2 2 Irrespective of the polarization, the quantity B02 vph /ρ0 (vph − μ/ρ0 ) may account for the magnetoelastic dragging whereby the speed of a transverse sound wave traversing an elastic dielectric is lowered by the application of a magnetic field in the direction of propagation.

12.9.2 Frequency Dependence, Closed Processes, Memory Functionals As shown in the previous section, elementary microscopic pictures of electromagnetic materials result in frequency-dependent conductivities and susceptibilities. More generally, we might consider constitutive relations in the form

738

12 Electromagnetism of Continuous Media

Pω (t) = χ(ω)Eω (t),

Mω = χm (ω)Hω (t),

(12.56)

where χ and χm are the complex-valued electric and magnetic tensor susceptibility.27 Relations of the form (12.56) are applied to describe the response to electromagnetic wave propagation. Yet they are shown to follow from wave fields of the form Eω (x, t) = E0 exp[i(k · x − ωt)],

(12.57)

or the analogue for H, where E0 is a complex vector. This function describes an electric field propagating in the direction of k with phase velocity ω/|k|. This is done with the convention that the physical fields are obtained by taking the real parts ([230], Chap. 7). While it is well known that the complex exponential representation is convenient in calculations, here we show how definite conclusions can be obtained, without any convention, if a corresponding model is given in the time domain. For definiteness we look at P modelled by a memory functional of E in the form ∞

P(t) = G0 E(t) + G0 ∫ χ(u)E(t − u)du,

(12.58)

0

G0 , G0 being two second-order tensors while χ(u) → 0 as u → ∞. Let E(t) = E1 cos α + E2 sin α,

α = k · x − ωt.

Substituting in (12.58) we find ∞



0

0

P(t) = G0 (E1 cos α + E2 sin α) + G0 E1 ∫ χ(u) cos(α − ωu)du + G0 E2 ∫ χ(u) sin(α − ωu)du

Since



∫ χ(u) cos(α − ωu)du = χc (ω) cos α + χs (ω) sin α, 0



∫ χ(u) sin(α − ωu)du = χc (ω) sin α − χs (ω) cos α, 0

then P(t) = (G0 + G0 χc )(E1 cos α + E2 sin α) − G0 χs (E2 cos α − E1 sin α). Moreover the power P · E˙ becomes P · E˙ = −ω[(G0 + G0 χc )(E1 cos α + E2 sin α) − G0 χs (E2 cos α − E1 sin α)] · [E2 cos α − E1 sin α].

27

See, e.g. [274], Chap. 11.

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

739

We now examine the consequences of the second-law inequality for cyclic processes. We consider an undeformable dielectric at constant temperature; if E is cyclic on [0, d] then, by (12.2), it follows d

˙ dt ≤ 0. ∫ P(t) · E(t)

(12.59)

0

To proceed within the domain of complex-valued vectors we let E0 = E1 + iE2 , E1 , E2 being real-valued vectors, and observe that Eω = E1 cos α + E2 sin α + i(E2 cos α − E1 sin α). Further, letting χ(ω) = G0 + G0 χc (ω) + iG0 χs (ω) =: χ1 + iχ2 we have Pω (t) = χ(ω)Eω (t) and P(t) = Pω (t). Consequently

˙ P(t) · E(t) = Pω (t) · E˙ ω (t).

This allows us to examine the real, physical power P · E˙ within the domain of complex-valued vectors. For any pair of complex vectors u, v we have u · v = 21 (u + u∗ ) · (v + v∗ ) = 41 (u · v + u∗ · v∗ + u∗ · v + u · v∗ ). Observe

1 (u∗ 4

· v + u · v∗ ) = 21 (u · v∗ ).

If both u and v depend on time in the form (12.57) then n2π/ω

n2π/ω

0

0

∫ (u · v)(t)dt = 21  ∫ (u · v∗ )(t)dt,

n ∈ N.

Letting u = Pω and v = E˙ ω we find 0≥

n2π/ω

˙ ∫ (P · E)(t)dt = 0

1 2

n2π/ω

∫ (χEω · E˙ ω∗ )(t)dt = nπ{E2 · (χ1 − χ1T )E1 − (E1 · χ2 E1 + E2 · χ2 E2 )}. 0

740

12 Electromagnetism of Continuous Media

The arbitrariness of E1 , E2 implies χ1 = χ1T ,

χ2 ≥ 0.

Absorption is absent, or the phenomenon is non-dissipative, if χ2 ∈ Skw. Hence the material allows for non-absorption if and only if χ∗ = χ T .

(12.60)

Back to the constitutive equation we can write χEω = χ1 Eω + iχ2 Eω ; the skew-symmetry of χ2 implies that there is a vector g, i.e. the axial vector of χ2 , such that χ2 Eω = g × Eω . So we can write Pω = χ1 Eω + ig × Eω . The vector g is then referred to as the gyration vector and we observe that iEω might be viewed as the time derivative of Eω /ω. A strictly analogous procedure would determine Eω in terms of Pω . As a comment, a specific model in the time domain, like (12.58), allows us to check that, e.g. P = Pω is the physical field. Otherwise we might take P = Pω by convention. Yet we see that χ2 ∈ Skw holds if and only if G0 ∈ Skw. This however would contradict G0 χc (ω) ∈ Sym unless χc (ω) = 0. Relation to closed processes in the real domain We now examine the restrictions placed by (12.59) on (12.58) without any recourse to complex-valued vectors. Let d be the duration of the cyclic process E(t). Having in mind Fourier’s theorem we let E(t) = Ekc cos kωt + Eks sin kωt,

ω = 2π/d, k ∈ N ∪ {0},

where Ekc and Eks can be represented in the base {e j } as Ekc = ak j e j , Substituting in (12.58) we have

Eks = bk j e j .

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

741

P(t) = (G0 + G0 χc (kω))Ekc cos kωt + (G0 + G0 χs (kω)) sin kωt. Since

˙ E(t) = kω(Eks cos kωt − Ekc sin kωt)

then P · E˙ = kω{cos2 kωt[Eks · (G0 + G0 χc )Ekc − Eks · G0 χs Eks ] − sin2 kωt[Ekc · (G0 + G0 χc )Eks + Ekc · G0 χs Ekc ] + cos kωt

sin kωt[Eks

· (G0 + G0 χc )Eks

+ Eks · G0 χs Ekc − Ekc · (G0 + G0 χc )Ekc + Ekc · G0 χs Eks ]}.

For any k the function E(t) on [0, 2π/|ω|] is a closed process. Now, 2π/|ω|

∫ cos2 kωt dt = 0

π , k|ω|

and the like for sin2 kωt. Hence we find that 2π/|ω|

˙ ∫ P(t) · E(t)dt = k π sgn ω{Eks · [G0 + G0 χc − (G0 + G0 χc )T ]Ekc − Eks · G0 χs Eks − Eks · G0 χs Ekc }. 0

In view of the arbitrariness of the vectors Ekc , Eks , inequality (12.59) holds if and only if ωχs symG0 ≥ 0. (12.61) G0 + G0 χc ∈ Sym, The dielectric is non-dissipative if χs (kω) = 0, which may be true only for particular frequencies ω. The requirements (12.61) are necessary conditions; they follow by the secondlaw inequality in connection with closed processes. Hence we conclude that the constitutive equation (12.58) is physically admissible only if the conditions (12.61) hold. In general, for non-closed processes in undeformable dielectrics (12.58) is required to comply with ˙ + P · E˙ ≤ 0. ρ(φ˙ + η θ) Hence, if φ, η, P depend on θ, E, Et we have ρ(∂θ φ + η)θ˙ + (P + ρ∂E φ) · E˙ + dφ(θ, E, Et |E˙ t ) ≤ 0. It follows that η = −∂θ φ,

P = −ρ∂E φ,

dφ(θ, E, Et |E˙ t ) ≤ 0.

Let P be given by (12.58) and, in view of (12.61), let G0 , G0 ∈ Sym,

G (u) = G0 χ(u).

742

12 Electromagnetism of Continuous Media

Hence the requirement P = −ρ∂E φ implies that ∞

ρφ(t) = − 21 E(t) · G0 E(t) − E(t) · ∫ G (u)E(t − u)du + (θ(t), Et ). 0

˜ E˜ be The free energy φ is also required to have a minimum property. Let θ, temperature and electric field functions on R. Constant continuations of θ˜t0 , E˜ t0 are defined by θt (u) =

 θ(t ˜ 0 ), u ∈ [0, t0 ], θ˜t0 (u − t0 ), u ∈ [t0 , ∞),

Et (u) =

 E(t ˜ 0 ), u ∈ [0, t0 ], E˜ t0 (u − t0 ), u ∈ [t0 , ∞).

It follows from φ˙ = dφ(θ, E, Et |E˙ t ) ≤ 0 that ˜ 0 ), E(t ˜ 0 ), E˜ t0 ). φ(θ(t), E(t), Et ) ≤ φ(θ(t By the continuity of the functional, as t − t0 → ∞ φ(θ(t), E(t), Et ) → φ(θ(t0 ), E(t0 ), E0† , ˜ 0 ), u ∈ [0, ∞). Hence where E0† is the constant history E0† (u) = E(t ˜ 0 ), E(t ˜ 0 ), E˜ t0 ). φ(θ(t0 ), E(t0 ), E0† ) ≤ φ(θ(t Thus, among all histories Et with given present value E0 none yields a smaller value of the free energy than that corresponding to the constant history E0† . For definiteness we select the function  in the form  = 0 (θ(t)) +

1 2



∫ E(t − u) · G (u)E(t − u)du. 0

Consequently we find28 ∞

ρφ = 0 (θ) − 21 E · G0 E − 21 E · ( ∫ G (u)du)E 0

+ 21 E









· ( ∫ G (u)du)E − E · ∫ G (u)E(t − u)du + 0

0

= 0 (θ) − 21 E · G∞ E + where

1 2

1 2



∫ E(t − u) · G (u)E(t − u)du 0



∫ (E(t) − E(t − u)) · G (u)(E(t) − E(t − u))du, 0



G∞ = G0 + ∫ G (u)du. 0

28

The dependence on the present values θ(t), E(t) is denoted by θ, E.

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

743

Since φ has a minimum at constant histories then G (u) ≤ 0, ∀u ∈ [0, ∞). This in turn implies G0 ≥ G∞ . Moreover, for constant histories P = G∞ E ; hence we conclude that G∞ > 0. The functional ρφ is consistent with thermodynamics if the further requirement dφ(θ, E, Et |E˙ t ) ≤ 0 holds. Now, ∞

˙ − u) · G (u)(E(t) − E(t − u))du. dφ(θ, E, Et |E˙ t ) = − ∫ E(t 0

˙ − u) = −∂u E(t − u); hence we obtain Observe E(t ∞

dφ(θ, E, Et |E˙ t ) = ∫ ∂u (E(t) − E(t − u)) · G (u)(E(t) − E(t − u))du 0

= 21 [(E(t) − E(t − u)) · G (u)(E(t) − E(t − u))]∞ 0 ∞

− 21 ∫ [(E(t) − E(t − u)) · G (u)(E(t) − E(t − u))]du. 0

Since G (u) → 0 as u → ∞ then it follows ∞

0 ≥ dφ(θ, E, Et |E˙ t ) = − 21 ∫ [(E(t) − E(t − u)) · G (u)(E(t) − E(t − u))]du. 0

This condition holds if and only if G = G0 χ ≥ 0. Since G ≤ 0 then we let G0 < 0, χ(0) > 0. Hence it follows χ ≤ 0; the function χ(u) is a monotone decreasing function.

744

12 Electromagnetism of Continuous Media

12.9.3 Electro-Optical Effects Certain materials change their optical properties when subjected to an electric field. The electro-optical effect is the change of the refractive index n resulting from the application of a steady (dc) electric field. Physically this is caused by forces that distort the orientations or shapes of the molecules constituting the material. As a consequence the electric field controls the light. It happens that the material becomes birefringent, with different indices of refraction for light polarized parallel to or perpendicular to the applied field. In modelling the electro-optical effects the customary assumptions are referred to as Pockels effect (n changes in proportion to the applied electric field) and Kerr effect (n changes in proportion to the square of the applied electric field). This indicates that we have to look for the electric polarization P as a nonlinear function of the electric field. For ease of modelling we let the body be non-conducting (J = 0, q = 0) and non-magnetizable (M = 0). Moreover it is consistent to let the extra-entropy flux be zero. Relative to an inertial frame, locally at rest, the second-law inequality (12.18) simplifies to ˙ − P · E˙ + T · L ≥ 0. −ρ(φ˙ + η θ) We model the material as thermo-electro-elastic and hence describe it by functions of the deformation gradient F, the temperature θ, and the electric field E. Borrowing from Sect. 12.2 we let φ be a Euclidean-invariant function of C, θ, E . Hence we find that P = −ρF∂E φ, T = 2ρF ∂C φ FT − E ⊗ P, η = −∂θ φ, whence skw[T + E ⊗ P] = 0. Let E, θ) φ = φ0 (C, θ) + φe (E and, for definiteness, assume φe = −

0 1 (1) [  (θ) · E ⊗ E + 13 (2) (θ) · E ⊗ E ⊗ E + 41 (3) (θ) · E ⊗ E ⊗ E ⊗ E ], ρ 2

where χ(i) , i = 1, 2, 3, are second-, third-, and fourth-order susceptibility tensors (i) is fully symmetric. and, e.g. (1) (θ) · E ⊗ E = (1) H K E H E K . By definition, each  Hence it follows E + (2) (θ)E E ⊗ E + (3) (θ)E E ⊗ E ⊗ E ]. P = 0 F[(1) (θ)E In terms of E we find P = 0 [χ(1) (θ)E + χ(2) (θ)E ⊗ E + χ(3) (θ)E ⊗ E ⊗ E],

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

745

(2) (3) where, e.g. χi(2) j = Fi H F j K  H K . In many materials the third term χ E ⊗ E ⊗ E is negligible in comparison with χ(2) E ⊗ E. In such cases, occurring mainly in some crystals, we restrict attention to a Pockels medium characterized by

P = 0 [χ(1) + χ(2) E]E. If, instead, the material is a gas, a liquid, or a centro-symmetric crystal then we can restrict to a Kerr medium (χ(2) = 0) and hence P = 0 [χ(1) + χ(3) E ⊗ E]E. To understand the birefringence properties associated with the Pockels and Kerr media, we first look at light in anisotropic non-magnetic media. We assume D = E, B = μ0 H and let the dielectric permittivity tensor  be real-valued and positive definite. Maxwell’s equations29 ∇ × E = −∂t B,

∇ × B = μ0 ∂t D,

∇ · D = 0,

∇ ·B=0

indicate the absence of free charges and electric current. Let E be given by a plane wave of angular frequency ω and wave vector k, E(x, t) = Eω exp[i(k · x − ωt)]. Since μ0 ∂t2 D = −∇ × (∇ × E) and −∇ × (∇ × E) = (k · E)k − k2 E then it follows k2 Eω − (k · Eω )k = μ0 ω 2 (Eω ). In addition, ∇ · D = 0 implies

(12.62)

k · D = 0.

whence D is perpendicular to the wave vector k. Owing to the possibly anisotropic permittivity , the orthogonality condition need not hold for the electric field E. If k · Eω = 0 then plane waves occur if k satisfies the compatibility condition det[k2 1 − k ⊗ k − μ0 ω 2 ] = 0.

29

(12.63)

Here we let F be time independent and hence we may take the fields E, D, H, B satisfy Maxwell’s equations in the laboratory frame.

746

12 Electromagnetism of Continuous Media

Equation (12.63) is called Fresnel’s equation and determines the unknown function k(ω). If  is isotropic,  = 1, then k · Eω = 0 and (12.62) reduces to (−k2 + μ0 ω 2 )Eω = 0 thus showing that plane waves occur at any direction with phase speed 1 ω2 , = k2 μ0  possibly dependent on the frequency ω if  depends on ω. In free space ω2 1 = = c2 . k2 μ0 0 Hence in matter v2 =

0 ω2 = c2 . k2 

Since v 2 /c2 = 1/n 2 , n being the refractive index, we find  = 0 n 2 . For formal convenience represent  and k in the principal basis of . If 1 , 2 , 3 are the eigenvalues of  then we can write  = 0 diag[n 21 , n 22 , n 23 ]. Hence det[−k2 1 + μ0 ω 2  + k ⊗ k] = 0 becomes ⎡

⎤ −k22 − k32 + ω 2 n 21 /c2 k1 k2 k1 k3 ⎦ = 0. k1 k2 −k12 − k32 + ω 2 n 22 /c2 k2 k3 det ⎣ k1 k3 k2 k3 −k12 − k22 + ω 2 n 23 /c2 Upon computing the determinant and rearranging the terms we can write the equation in the form ω4 ω2  k 2 + k 2 k2 + k2 k2 + k2   k2 k2 k2  − 2 1 2 2 + 1 2 3 + 2 2 3 + 2 1 2 + 2 2 2 + 2 3 2 k2 = 0. 4 c c n3 n2 n1 n2n3 n1n3 n1n2 (12.64) The simplest example of anisotropy is that of uniaxial materials.30 For definiteness let n 1 = n 2 = n 3 . In such a case we say that the third direction is the optical axis. 30

Uniaxial materials comprise the largest family of birefringent specimens, including calcite, quartz, and ordered synthetic or biological structures.

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

747

As is customary in the literature we use the notation n o , for n 1 and n 2 , and n e for n 3 . The determinantal equation (12.64) can then be written in the factored form  k2 n 2o



ω 2  k12 + k22 k32 ω2  = 0. + − c2 n 2e n 2o c2

Both conditions k2 ω2 − 2 = 0, 2 no c

k12 + k22 k2 ω2 + 32 − 2 = 0 2 ne no c

define surfaces in the space of wave vectors k that are allowed for a given ω. The first equation defines a sphere; this means that the refractive index is n o regardless of the direction of k (ordinary rays). The second equation defines a spheroid symmetric about the optical axis and the wave speed depends on the direction of k (extraordinary rays). Along the optical axis the two rays propagate with the same speed, ω2 c2 ω2 = = . k2 n 2o k32 If k is perpendicular to the optical axis (k3 = 0) then the extraordinary ray propagates with speed ω2 c2 ω2 = 2 = 2. 2 2 k ne k1 + k2 Hence the extraordinary ray is the fast wave if n e < n o (negative birefringence) and the slow wave if n e > n o . To model Pockels and Kerr effects it is worth distinguishing the case when the light wave is superposed to an external electric field E0 (DC effect) from that when no external field is applied (AC effect). Let E = E0 + E˜ exp[i(k · x − ωt)]. ˜  |E0 | we restrict attention to linear terms in E˜ and write If |E| P  0 (χ(1) + 2χ(2) E0 + 3χ(3) E0 ⊗ E0 )E˜ exp[i(k · x − ωt)], which shows that

χeq = χ(1) + 2χ(2) E0 + 3χ(3) E0 ⊗ E0

is the equivalent susceptibility tensor and hence  = 0 (1 + χeq ).

748

12 Electromagnetism of Continuous Media

12.9.4 Mechanical-Optical Effects The optical properties of a medium can be changed by mechanical agencies such as deformation and velocity gradients. A simple way of accounting for mechanicaloptical effects is to allow the permittivity, and the susceptibility, depend on the strain and the spin. Moreover dependences on E˙ are in order; the thermodynamic analysis shows how this can be realized. Again for ease of modelling we let the body be non-conducting (J = 0, q = 0) and non-magnetizable (M = 0), and let k = 0. Relative to an inertial frame, locally at rest, the second-law inequality (12.18) simplifies to ˙ − P · E˙ + T · L ≥ 0. −ρ(φ˙ + η θ) To account for mechanical-optical interactions we consider the reference electric field E = FT E and observe E˙ = F−T E˙ − LT E. Hence the inequality can be written in the form ˙ − (F−1 P) · E˙ + (E ⊗ P) · L + T · L ≥ 0. −ρ(φ˙ + η θ) Having in mind some models occurring in the literature we let φ, η, P, T depend ˙ = 2FT DF and on C, θ, E , E2 , E˙ . Since C E · E˙ = E · [F−T E˙ − E · LT F−T E ] = E · F−T E˙ − E · DE we can write the entropy inequality in the form −ρ(∂θ φ + η)θ˙ − (ρ∂E φ + F−1 P + ρ∂E2 φ F−1 E) · E˙ − ρ∂E˙ φ · E¨ +(T − 2ρF∂C φFT + E ⊗ P + ρ∂E2 φ E ⊗ E) · L ≥ 0. ˙ E¨ , and L implies The arbitrariness of θ, η = −∂θ φ,

∂E˙ φ = 0,

T = 2ρF∂C φFT − E ⊗ P − ρ∂E2 φ E ⊗ E,

(12.65)

(ρ∂E φ + F−1 P + ρ∂E2 φ F−1 E) · E˙ ≤ 0.

(12.66)

By (12.65) it follows skwT = skw(P ⊗ E), that is the classical condition of balance of angular momentum. The reduced inequality (12.66) holds if

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

749

F−1 P = −ρ∂E φ − ρ∂E2 φF−1 E + (C, θ)E˙ . If  is positive definite then this equation accounts for dissipation induced by E˙ . If  ∈ Skw or  = 0 then the model is regarded as non-dissipative. Anyway, since E˙ = FT LT E + FT E˙ then we obtain ˙ P = −ρF∂E φ − ρ∂E2 φE + FFT LT E + FFT E. If  ∈ Skw then FFT ∈ Skw too. In addition, we have a term in E˙ and one in WE, as we may find in the literature. Let φ = φ0 (C, θ) −

0 E · χ0 (θ)E E + a1 (θ)E E · E + a2 (θ)(tr C)E2 ], [E 2ρ

with tr C = F · F. Hence we have ˙ P = 0 (Fχ0 FT )E + a1 (FFT )E + a2 (tr C)E + FFT (D − W)E + FFT E. (12.67) Hence the interaction of the electric field with deformation results in the susceptibility tensor χ = Fχ0 FT + a1 FFT + a2 (tr C)1, while FFT (D − W)E + FFT E˙ is a model for the interaction of the electric field with the velocity gradient field. Indeed, observe that FFT (D − W)E + FFT E˙ = FFT DE + FFT (E˙ − WE). Hence the dependence on the time derivative E˙ is through the corotational (objective) derivative E˙ − WE while the dependence on the velocity gradient is through DE. If  ∈ Skw then, in terms of the axial vector λ of , we have FFT (E˙ − WE) = F[λ × FT (E˙ − WE)]. Equation (12.67) is a nonlinear version of the constitutive relation associated with the permittivity modelled in [274], Sect. 102. The linearized expression follows by observing F = 1 + ∇X u = 1 + ∇u(1 + ∇uF) whence FFT = 1 + 2ε + o(∇u). Consequently, to within linear terms we can write P = 0 [χ0 + (a1 + 3a2 )1 + 2a2 (tr ε)1] + DE + (E˙ − WE).

750

12 Electromagnetism of Continuous Media

If  ∈ Skw then, in terms of the axial vector λ, we have DE + (E˙ − WE) = λ × DE + λ × (E˙ − WE). It is worth remarking that E˙ − WE is the time derivative relative to a corotating observer.31 Hence, relative to a corotating observer, if  and W are constants then time-harmonic waves E = E0 exp(−iωt) are solutions and (E˙ − WE) = −iωλ × E; moreover λ plays the role of gyration vector. It is worth observing that sometimes [274] the permittivity tensor is assumed to be  = 0 + νD + iλW. ◦

A contribution νD to the permittivity occurs if  = ν1. In such a case  E= ν E˙ = −iωνE. A factored dependence on W seems to be in contradiction with thermodynamics. As with the dependence on D, in suspensions and in colloidal solutions of anisotropically shaped particles, the stretching D is regarded to orienting the particles suspended in the fluid. Hence the suspension is optically anisotropic as a consequence of the velocity gradient and this behaviour is called the Maxwell effect. It is then appropriate to view the suspension as a structured continuum where the orientation of particles is affected by the velocity gradient.

12.9.5 Magnetocaloric and Electro-Caloric Effects The magnetocaloric effect was discovered in 1881, when Warburg observed it in iron [441], and was explained independently by Debye [121] and Giauque [190]. They also suggested its practical use: the adiabatic demagnetisation used to reach temperatures lower than that of liquid helium, which had been the lowest achievable experimental temperature. One of the most notable examples of the magnetocaloric effect is in the chemical element gadolinium and some of its alloys. Gadolinium’s temperature increases when it enters certain magnetic fields. When it leaves the magnetic field, the temperature drops. This effect is considerably stronger for the gadolinium alloy (Gd5 Si2 Ge2 ) which has allowed scientists to approach to within one milliKelvin, one thousandth of a degree of absolute zero. The magnetocaloric effect is then applied as an alternative technology for refrigeration. In essence, in a magnetic refrigeration process a decrease in the applied magnetic field allows the magnetic domains of a magnetocaloric material to become disoriented by the agitating action of the thermal energy present in the material. If the material is isolated (adiabatic process) the temperature drops as the domains absorb the thermal energy to perform their reorientation. Namely, an observer rotating with angular velocity w equal to the axial vector of W, wi = − 21 i jk W jk .

31

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

751

Lately, a similar behaviour, called electro-caloric effect, was observed. In electrocaloric materials under adiabatic conditions, temperature increases when we apply an external electric field. When it leaves the electric field, the temperature drops. The electro-caloric effect is often considered to be the physical inverse of the pyroelectric effect. It should not be confused with the thermoelectric effect (specifically, the Peltier effect), in which a temperature difference occurs when a current is driven through an electric junction with two dissimilar conductors. We now provide a thermodynamic setting for the magnetocaloric and the electrocaloric effects. For the sake of simplicity we look at a rigid body at rest in an inertial frame. The Clausius–Duhem inequality reads ˙ + E · J − P˙ · E − μ0 M ˙ · H − 1 q · ∇θ ≥ 0. −ρ(ψ˙ + η θ) θ ˜ ˜ Assume ψ = ψ(E, H, θ) and, ignoring cross-coupling terms, P = P(E, θ), M = ˜ M(H, θ). Letting ρφ = ρψ − P · E − μ0 M · H we can write the Clausius–Duhem inequality in the form ˙ − E · J + 1 q · ∇θ ≤ 0. ρ(∂θ φ + η)θ˙ + (ρ∂E φ + P)E˙ + (ρ∂H φ + μ0 M)H θ ˙ can be chosen arbitrarily thanks to the appropriate values The values of E˙ and H of ∇ × H and ∇ × E which warrant the validity of Maxwell’s equations and do not ˙ E, ˙ H ˙ implies that affect the constitutive equations. The arbitrariness of θ, η = −∂θ φ, and

P = −ρ∂E φ,

μ0 M = −ρ∂H φ

(12.68)

1 E · J − q · ∇θ ≥ 0. θ

Look at the balance of energy, ˙ − ∇ · q + ρr. ρε˙ = E · J + E · P˙ + μ0 H · M

(12.69)

ρε = ρφ + ρθη + E · P + μ0 H · M

(12.70)

Since

then, in view of (12.68), by (12.69) it follows ρθη˙ = E · J − ∇ · q + ρr.

752

12 Electromagnetism of Continuous Media

Restrict attention to adiabatic processes, so that q = 0, r = 0. Inasmuch as E · J is negligible the process can be regarded also as isentropic, η˙ = 0. Consequently

˙ =0 ∂θ η θ˙ + ∂E η · E˙ + ∂H η · H

whence, by means of (12.68), 1 ˙ (∂θ P · E˙ + μ0 ∂θ M · H). θ˙ = − ρ∂θ η

(12.71)

It is worth observing that, by (12.70) and (12.68), ρ∂θ ε = ρθ∂θ η + E · ∂θ P + μ0 H · ∂θ M and hence ρθ∂θ η equals the specific heat, per unit volume, ρ∂θ ε, apart from E · ∂θ P + μ0 H · ∂θ M. Assume P and E, as well as M and H, are aligned so that we may restrict attention to the components P, E, M, H in the common direction. A generalized Curie– Weiss law may be applied to paramagnets and simple ferroics (ferromagnets and ferroelectrics).32 The electric and magnetic susceptibilities χe , χm are decreasing functions of the temperature θ, P = 0 χe (θ)E,

M = χm (θ)H

χe ≤ 0, χm ≤ 0.

As θ is greater than the Curie temperature θC the Curie–Weiss law is assumed to hold, Ce Cm χe = , χm = . θ − θC θ − θC Consequently ∂θ P E˙ = −0

Ce (E 2 )˙, 2(θ − θC )2

∂θ M H˙ = −

Cm (H 2 )˙. 2(θ − θC )2

If the strengths of the externally applied fields E 2 , H 2 decrease then ∂θ P E˙ > 0,

∂θ M H˙ > 0.

Moreover since ρθ∂θ η = ρ∂θ ε − E · ∂θ P − μ0 H · ∂θ M, and ρ∂θ ε > 0, for slowly varying fields ρθ∂θ η is positive yet smaller than the specific heat. Hence it follows from (12.71) that 32

See [442], Eq. (6.62).

12.9 Magneto-, Electro-, and Mechanical-Optical Effects

753

θ˙ < 0 (> 0) as (E 2 )˙ < 0 (> 0), (H 2 )˙ < 0 (> 0). Magnetic refrigeration To determine a connection between the changes of magnetic field and temperature we observe that by (12.68) it follows μ0 ∂θ M = ρ∂H η. For simplicity, disregard deformation and let the magnetic field H have a constant direction. Denote by H the component in the fixed direction and let η be given by a function η(θ, H ). Then η˙ = ∂θ η θ˙ + ∂ H η H˙ . In adiabatic conditions (q = 0, r = 0) we have η˙ = 0 and hence ∂θ η θ˙ + ∂ H η H˙ = 0. This shows the magnetocaloric effect, μ0 1 ∂ H η H˙ = − ∂θ M H˙ , θ˙ = − ∂θ η ρ∂θ η and hence a variation of H induces a variation of temperature. In a process from Hinitial to Hfinal , by a change of variable (t → H (t)) we have  θfinal − θinitial = −

Hfinal Hinitial

1 ∂H η d H = − ρ∂θ η



Hfinal Hinitial

μ0 ∂θ M d H. ρ∂θ η

The integrand is a function of θ, H ; often 1/ρ ∂θ η is denoted as θ/C, C being the specific heat. Since ∂θ η > 0, two types of processes may occur. If ∂θ M = ρ∂ H η/μ0 < 0 then H˙ > 0 implies θ˙ > 0; increasing the field H results in an increase of temperature (direct magnetocaloric effect). If ∂θ M = ρ∂ H η/μ0 > 0 then H˙ > 0 implies θ˙ < 0; increasing the field H results in a decrease of temperature (inverse magnetocaloric effect). For most magnetic materials ∂θ M < 0 and hence most often the direct effect occurs. A process of refrigeration is associated with the direct effect. It is worth commenting on the physical picture of the process. (1) Adiabatic magnetization. A magnetocaloric sample (∂θ M < 0) is in an isolated environment. By applying the field H the magnetic dipoles tend to align, the magnetic entropy decreases. Since the whole entropy is constant (η˙ = 0) then the lattice entropy (and heat content) increases and the temperature increases. (2) Isomagnetic transfer. While the field H is held constant, the added heat is removed. (3) Adiabatic demagnetization. The external magnetic field is removed, the magnetic spin system becomes disordered and captures energy from the lattice, which decreases the temperature.

754

12 Electromagnetism of Continuous Media

(4) Isomagnetic transfer. The magnetic field is held constant (H = 0) and the material is placed in thermal contact with the environment. Since the material is cooler then energy migrates into the magnetic material. The entropy of a magnetic material can be expressed in the form η(θ, H ) = ηm (θ, H ) + ηr (θ) + ηe (θ), where ηm is the magnetic entropy, ηr is lattice entropy, and ηe is the electronic entropy (see, e.g. [416]). When a field H is applied under adiabatic conditions the total entropy remains unchanged. Since the magnetic entropy ηm decreases (ordered spin system) then lattice and electronic entropies increase to compensate. This in turn causes a temperature increase. When the external field is removed the magnetic spin system returns to the disordered arrangement by capturing energy from the lattice, which decreases the thermal entropy and the temperature decreases.

12.10 Plasmas A plasma is a ionized gas with two electrically charged components, free electrons and ions. These particles generate electromagnetic fields through their elementary charges and currents. A macroscopic description of the plasma can be pursued in which emphasis is on its fluid nature. Depending on circumstances, this fluid description may be a one-fluid or a two-fluid approach. The two-fluid description is more appropriate if plasma temperatures and energy densities are of interest; the electron and ion temperatures are often quite different due to the weak energy exchange rates between ions and electrons. Furthermore, the plasma might be weakly ionized and in this case the neutral particles (atoms) should be modelled as a further fluid.33 In the two-fluid description the plasma is then modelled as a non-reacting binary fluid mixture; the ions and the electrons are treated as conducting fluids that are coupled through the interaction force and by Maxwell’s equations. Instead, the one-fluid description is appropriate for dense ionized gases; electrons and ions move in such a way that no separation of charge occurs. Hence the motion can be described as a single conducting fluid with the usual variables of mass density, velocity, and pressure. Two-fluid plasma theory The subscripts e and i denote quantities pertaining to electrons and ions, respectively, while α stands for both e and i. Denote by n e and n i the number density of electrons and ions (per unit volume). Let m e and −e be the mass (at rest) and the electric charge of the electron. In addition, we assume that all ions are equal, with mass (at rest) m i and electric charge ζe, where ζ is the number of electric charges per ion. Then the 33

See [329] for a general model of plasmas as several chemically-reacting constituents.

12.10 Plasmas

755

mass and charge densities are given by ρe = n e m e , qe = −n e e,

ρi = n i m i , qi = n i ζe.

Each constituent of the plasma obeys the laws of conservation of mass and electric charge, ∂t qα + ∇ · (qα vα ) = 0, (12.72) ∂t ρα + ∇ · (ρα vα ) = 0, both equations being accounted for by ∂t n α + ∇ · (n α vα ) = 0.

(12.73)

Also, each constituent is subject to the balance of linear momentum, ρα v` α = qα (E + vα × B) + ∇ · Tα + ρα b + mα ,

(12.74)

or, equivalently, ∂t (ρα vα ) + ∇ · (ρα vα ⊗ vα ) = qα (E + vα × B) + ∇ · Tα + ρα b + mα , (12.75) where b, as is customary, is identified with the acceleration gravity. Here me and mi are the interaction force densities, due to collisions, on electrons and ions. They are required to satisfy (12.76) me + mi = 0. Kinetic derivations of the balance equations indicate that [261] mα =

β

ρα (vα − vβ )ναβ .

The requirement (12.76) would imply that ρe νei = ρi νie . The fields E and B are related to the charges and the motions by ∇ ·E=

α

qα ,

∇ × B = ∂t E +

α

qα vα .

The balance of energy is written in the form34 ρα (εα + 21 vα2 )` = ∇ · (vα Tα − qα ) + qα vα · E + ρα (rα + b · vα ) + lα , where qα is the heat flux, rα is the heat supply and lα is the energy growth. The growths are required to satisfy (12.77) le + li = 0.

34

Here the growths are denoted by lα rather than eα .

756

12 Electromagnetism of Continuous Media

Equivalently we can write ∂t [ρα (εα + 21 vα2 )] + ∇ · [ρα (εα + 21 vα2 )vα ] = ∇ · (vα Tα − qα ) + qα vα · E + ρα (rα + b · vα ) + lα .

(12.78)

By virtue of the equation of motion the balance of energy reduces to ρα ε`α = Tα · Lα − ∇ · qα + ρα rα + lα − mα · vα .

(12.79)

12.10.1 One-Fluid Plasma Theory and Magnetohydrodynamics A simpler model of plasma is obtained by using appropriate one-fluid variables. Consistent with the theory of mixtures we consider the mass density ρ = ρe + ρi and the barycentric velocity v = (ρe ve + ρi vi )/ρ. Moreover we consider the charge density and the current density q = qi + qe = e(ζn i − n e ),

J = qe ve + qi vi .

By adding Eqs. (12.72) over α = e, i we find ∂t ρ + ∇ · (ρv) = 0,

∂t q + ∇ · J = 0.

Moreover, adding (12.75) over α yields ∂t (ρv) + ∇ · (ρv ⊗ v) = qE + J × B + ∇ · T + ρb, or, ρ˙v = qE + J × B + ∇ · T + ρb,

(12.80)

where, as is general for mixtures, T=

α

(Tα − ρα uα ⊗ uα ),

uα = vα − v.

The term J × B is referred to as total Lorentz force. Since J involves the velocities of ions and electrons, the evolution of J is expected to be related to the equations of motion (12.75). For the sake of simplicity we look for the evolution equation under the assumption that the constituents are inviscid fluids so that Tα = − pα 1. We then regard p = pe + pi as the pressure in the mixture (in the plasma) and write T = − p1 −

α ρα uα

⊗ uα .

12.10 Plasmas

757

Multiply (12.74) by qα /ρα to have qα v` α =

qα2 qα qα (E + vα × B) − ∇ pα + qα b + mα . ρα ρα ρα

By virtue of the identity qα v` α = ∂t (qα vα ) + ∇ · (qα vα ⊗ vα ) we compute the summation over α to obtain

qα2



qα (E + vα × B) − α ∇ pα + α mα . α ρα ρα ρα (12.81) By replacing vα with v + uα we find ∂t J + ∇ ·

α qα vα

α qα vα

⊗ vα =

⊗ vα = qv ⊗ v + J ⊗ v + v ⊗ J +

α qα uα

⊗ uα .

Hence, in view of the identity ∇ · (J ⊗ v + v ⊗ J) = J∇ · v + (v · ∇)J + v∇ · J + (J · ∇)v, we can write (12.81) in the form

qα2 (E + vα × B) α ρα



qα − α ∇ pα − ∇ · α qα uα ⊗ uα + α mα . (12.82) ρα ρα

J˙ + L J + (∇ · v)J + (∇ · J)v + ∇ · (qv ⊗ v) =

Adding (12.78) over α and using the standard definitions of T, q, and ρr we obtain the balance of energy for the mixture (plasma) in the form ρv · v˙ + ρε˙ = ∇ · (Tv) − ∇ · q + ρr + ρb · v + J · E. In view of the equation of motion (12.80) it follows that ρε˙ = T · D − ∇ · q + ρr + J · E − v · (qE + J × B).

(12.83)

A suggestive form of (12.83) arises by using h = J − qv. Substitution of J in (12.83) results in ρε˙ = T · D − ∇ · q + ρr + h · (E + v × B).

(12.84)

758

12 Electromagnetism of Continuous Media

Since v is the barycentric velocity,35 then we can view qv as the convection current and h as the current relative to the barycentric framework. Quite consistently, the effective electric field E + v × B is the field in the barycentric framework. It is of interest that (12.84) is consistent with the form obtained by means of the Poynting flux vector [98]. It is worth mentioning that Eqs. (12.80) and (12.84) look consistent with the corresponding equations in [88], once the current h is identified with j of [88]. Moreover, Eq. (9) of [88] amounts to letting ρε = 23 p and observing that ρε˙ = (ρε)˙+ ρε∇ · v so that (12.84) can be written as ( 23 p)˙+ 25 p∇ · v = Td · D − ∇ · q + ρr + h · (E + v × B),

Td = T + p1.

If Td , q, r, E, and B vanish (adiabatic condition) then we have ˙ = 23 ρ5/3 ( p/ρ5/3 )˙. 0 = ( 23 p)˙+ 25 p∇ · v = 23 ( p˙ − 53 p ρ/ρ) This motivates why often the energy equation is taken in the form (adiabatic condition) p = constant, ργ γ being the ratio of specific heats c p /cv ; γ = 5/3 for monoatomic gases. Other times the equation is taken in the form v · ∇η = 0, to state the constancy of entropy, in the adiabatic condition. As is often reasonable, we now assume the plasma is electrically neutral, that is q = qe + qi = 0,

n e = ζn i .

Hence the equation of motion simplifies to ∂t (ρv) + ∇ · (ρv ⊗ v) = J × B + ∇ · T + ρb, where J = qe (ve − vi ) = n e e(vi − ve ). Further, the neutrality assumption implies that ∇ · J = 0. 35

Or, roughly, the velocity of ions.

12.10 Plasmas

759

Let

e me

γe =

be the charge-to-mass ratio of electrons. Hence it follows

qα2 me = |qe |γe (1 + ζ ), α ρα mi

qα2 me vα = |qe |γe (ve + ζ vi ), α ρα mi

qα me − α ∇ pα = γe (∇ pe − ζ ∇ pi ), ρα mi

qα me mα = −γe (1 + ζ )me . α ρα mi

Upon substitution in (12.82) we obtain the equation for J in the form me me J˙ + L J + (∇ · v)J = |qe |γe (1 + ζ )E + |qe |γe (ve + ζ vi ) × B mi mi

me me +γe (∇ pe − ζ ∇ pi ) − ∇ · α qα uα ⊗ uα − γe (1 + ζ )me . (12.85) mi mi This equation may be viewed as the generalized Ohm’s law for a plasma since it relates the current J to the electric field E; the left-hand side has the same form of (9.69) for the mass flux hα . A simpler evolution equation for J follows as a consequence of further approximations. Since m i ≥ 1836 m e then it is reasonable to neglect ζm e /m i relative to unity. The pressures pe , pi are usually nearly equal and hence pe − ζ(m e /m i ) pi  pe . We then let p = pi + pe  2 pe . Equation (12.85) becomes me J˙ + L J + (∇ · v)J = |qe |γe E + |qe |γe (ve + ζ vi ) × B + ∇ · S − γe me , mi where S = 21 γe p1 −



qα uα ⊗ uα .

α

Since ρ  ρi then vi +

ζm e ζm e 1 1 ve  (ρi vi + n i m i ve ) = (ρi vi + ρe ve ) = v. mi ρ mi ρ

Moreover, ve +

ζm e ζm e ζm e ζm e vi = vi + ve − (vi + ve ) + ve + vi  v − (vi − ve ) mi mi mi mi

and |qe |(vi − ve ) = qi vi + qe ve = J.

760

12 Electromagnetism of Continuous Media

Hence we have |qe |γe (ve + ζ

me vi ) × B = |qe |γe v × B − γe J × B. mi

Further we let me be proportional to the velocity difference vi − ve , me = β(vi − ve ) =

β J, |qe |

β > 0.

Thus the generalized Ohm’s law (12.85) simplifies to γe β J˙ + LJ + (∇ · v)J + J = |qe |γe (E + v × B) − γe J × B + ∇ · S. |qe | It is of interest to look at Ohm’s law in the frame of ions, which amounts to neglecting terms

in vi and v, relative to ve , and the derivatives L and ∇ · v. We also regard ∇ · ( α qα uα ⊗ uα ) negligible compared to γe ∇ pe . Hence J = −|qe |ve and the equation for J reduces to β J = |qe |(E + v × B) − J × B + ∇ pe . J˙ + |qe | In stationary conditions, J˙ = 0, we have J = σ(E + v × B) −

|qe | |qe | J× B + ∇ pe , β β

(12.86)

where σ = |qe |2 /β is the conductivity. In Drude’s model of electric conduction, σ = γe |qe |τ , τ being the mean free time between collisions (of electrons). It is worth remarking that Eq. (12.86) provides two classical physical effects, the Seebeck effect and the Hall effect. First, let pe be a function of θ. Hence ∇ pe = pe ∇θ so that |qe | J × B + σS∇θ, J = σ(E + v × B) − β where S is the Seebeck coefficient (also called thermopower); here S = pe /|qe |. The physical view is that the charged particles (here electrons) in the hotter region move faster than those in the colder region. Hence particles in the hotter region diffuse further and produce a higher density in the colder region. Hence ve ∝ −∇θ, J ∝ ∇θ. Because of the connection between temperature and electric current, the Seebeck effect, discovered by T.J. Seebeck in 1821, is called thermoelectric effect after H.C. Ørsted. The Hall effect, here expressed by −(|qe |/β)J × B, is the production of a

12.10 Plasmas

761

voltage difference (the Hall voltage) across an electrical conductor, transverse to an electric current J in the conductor and to an applied magnetic field B (not parallel to J). MHD equations In magnetohydrodynamics

(MHD for short) two further approximations are adopted. Both nonlinear terms α ρα uα ⊗ uα and α qα uα ⊗ uα are neglected. The whole set of equations of MHD is then given by ∂t ρ + ∇ · (ρv) = 0, ρ˙v = J × B − ∇ p, J˙ + L J + (∇ · v)J + (γe β/|qe |)J = |qe |γe (E + v × B) − γe J × B + 21 γe ∇ p, ∇ × B = μ0 J + μ0 0 ∂t E ∇ × E = −∂t B, (12.87) On the whole there are 13 equations in the 13 unknowns ρ, v, J, E, and B; it is understood that the pressure p is expressed in terms of the mass density ρ. Sometimes (see, e.g. [344]) the charge continuity equation ∂t qe + ∇ · J = 0 is added to the set of equations and then q is the 14-th unknown, though the electrical neutrality is assumed. If electrical neutrality is not assumed then the equation of motion comprises the force q E and Ohm’s law is given by (12.82). In the literature on MHD the term ∇ · (J ⊗ v + v ⊗ J) is not considered and hence the left-hand-side of (12.87) is merely ∂t J + J/τ , where τ is an appropriate parameter, here τ = |qe |/γe β. In this sense the equation ∂t J +

1 J = |qe |γe (E + v × B) − γe J × B + 21 γe ∇ p τ

can be viewed as a rate-type equation for J, similar to the Maxwell–Cattaneo equation, though with the partial time derivative. Perhaps that is why, on physical grounds, it is said that, for very low frequencies, one can ignore the ∂t J term in the generalized Ohm’s law, whereas for low temperatures the ∇ p term can be ignored. In addition, it is assumed that the J × B term is negligible compared to the v × B term. Under these assumptions, Ohm’s law becomes J = σ(E + v × B). Hence the standard MHD is characterized by the set of equations ρ∂t v = J × B − ∇ p, ∂t ρ + ∇ · (ρv) = 0, J = σ(E + v × B), ∇ × B = μ0 J. ∇ × E = −∂t B,

(12.88)

The set (12.88) comprises 13 equations in the 13 unknowns ρ, v, J, E, and B while the pressure p is supposed to be a function of the density ρ. A comparison with kinetic derivations shows a satisfactory agreement. For instance, by (8.20) of [344], equality of the coefficients of E in the equation for ∂t J requires that

762

12 Electromagnetism of Continuous Media

|qe |γe =

ρe2 , memi

which is true as far as ρ  n i m i = n e m i . A further approximation leads to ideal MHD. Finite values of J with infinite (very high) values of the conductivity σ are allowed if the effective electric field vanishes, E + v × B = 0. This in turn provides E in terms of B, E = −v × B. The system of ideal MHD equations then reads ρ∂t v = J × B − ∇ p, ∂t ρ + ∇ · (ρv) = 0, ∇ × B = μ0 J. ∇ × (v × B) = ∂t B,

(12.89)

As to Eqs. (12.89) observe that the identity (∇ × B) × B = −∇ 21 B2 + (B · ∇)B allows the equation of motion to be written in the form ρ∂t v = −∇( p + B2 /2μ0 ) +

1 (B · ∇)B. μ0

(12.90)

If the field lines of B are straight and parallel, since ∇ · B = 0 then (B · ∇)B = 0. Thus the right-hand side reduces to −∇( p + B2 /2μ0 ). That is why B2 /2μ0 is called the magnetic pressure (see [261], Sect. 3.8). It is natural to expect that the standard Ohm’s law is obtained from (12.82) via suitable approximations. As emphasized above, this is the case in stationary conditions, J˙ = 0, with a negligible motion of the underlying body, v = 0, negligible pressure gradients and negligible quadratic terms qα uα ⊗ uα . We then let qe qe2 E + me = 0. ρe ρe If me = λn e n i (vi − ve )  −λn e n i J/qe then we have J = σE,

σ=

n e e2 . ni λ

12.10 Plasmas

763

The term generalized Ohm’s law is also used for the equation E = −v × B +

1 1 me 1 J+ (J × B − ∇ pe ) + J ⊗ J)], [∂t J + ∇ · (v ⊗ J + J ⊗ v − σ ne e ne e n e e2

often referred to as Hall-MHD model [110, 397]. The comparison with (12.82) indicates the approximations  q2 q2 n e e2 α  e = , ρ ρe me α α

 q2 q2 n e e2 n e e2 n e e2 e α vα  e vα = v+ ue  v− J. ρ ρe me me me me α α

We know that J is defined to be α qα vα  qe ve . Hence it is quite ambiguous to use the same symbol, J, to denote both qe ve and qe (ve − vi ) unless we neglect the underlying (ion) motion. Rather, J/σ + (1/n e e)J × B could be replaced by 1 1 h+ h × B, σ ne e where h = qe (ve − vi ). The balance of energy is sometimes written in the form [201] ∂t e + ∇ · [(e + p + B2 /2μ0 )v] = J2 /σ, where e = ρ(ε + 21 v2 ). The flux pv as a contribution to the variation of the energy ρ(ε + 21 v2 ) is standard, see (2.47). The occurrence of (B2 /2μ0 )v is less motivated. It emerges from (12.90) which merely motivates B2 /2μ0 as an equivalent pressure to the effect of motion. The right-hand side term J2 /σ is just given by the power term E · J along with Ohm’s law E = J/σ. Thermodynamic consistency Conceptually there are two scenarios for the statement of the second law according as we regard the plasma as a mixture or as a single (MHD) fluid. Within the mixture scenario we follow the assumption (9.41) and hence state that

α [ρα η`α

+∇ ·

rα qα + ∇ · kα − ρα ] ≥ 0 θα θα

must hold for any set of constitutive functions compatible with the balance equations. Substitution of ∇ · qα − ρα rα from (12.79) results in

α {ρα η`α

+

1 1 (−ρα + Tα · Lα + lα − mα · vα ) − 2 qα · ∇θα + ∇ · kα } ≥ 0. θα θα

Using the free energy ψα = εα − θα ηα we can write the inequality in the form

764

12 Electromagnetism of Continuous Media

1 1 [−ρα (ψ` α + ηα θ`α ) + Tα · Lα + lα − mα · vα ] − 2 qα · ∇θα + ∇ · kα } ≥ 0. θα θα

α{

A simple thermodynamically-consistent model is obtained by letting each constituent be a Navier–Stokes–Fourier fluid. Hence we let kα = 0 and ψα = ψα (θα , ρα ) and set pα = ρ2α ∂ρα ψα ,

ηα − ∂θα ψα ,

Tα = − pα 1 + 2μα Dα + λα (∇ · vα )1,

qα = −κα ∇θα ,

where μα ≥ 0, 2μα + 3λα ≥ 0, and κα ≥ 0 while  1 (lα − mα · vα ) ≥ 0. θα α As shown in Chap. 5, we can take mα and lα in the form mα =

β

Mαβ (vβ − vα ),

lα =

β Nαβ (θβ

− θα ) + v ·

β Mαβ (vβ

− vα ),

where M and N are symmetric matrices. Hence we have

1 Nαβ uα (lα − mα · vα ) = α,β (θβ − θα )2 + α,β Mαβ (vβ − vα ) · . α θα θα θβ θα The first term is non-negative provided Nαβ ≥ 0, ∀α = β. As with binary mixtures, it is ρe ue + ρi ui = 0 and hence we find that

α,β Mαβ (vβ

− vα ) ·

uα ρe 1 ρe 2 = Mei (1 + )( + )u . θα ρi θe ρi θi e

Accordingly Mei = Mie ≥ 0 guarantees the thermodynamic requirement. The whole set of differential equations governing the evolution of the plasma, subject to given fields E, B, and b, rα is then expressed by n` α + n α ∇ · vα = 0, ρα v` α = qα (E + vα × B) − ∇ · Tα + ρα b + ρα ε`α = Tα · Dα − ∇ · qα + ρα rα +

β

β

Mαβ (vβ − vα ),

Nαβ (θβ − θα ) − uα ·

β

Mαβ (uβ − uα ),

12.10 Plasmas

765

with α = e, i. So we have a closed system of 10 equations in the 10 unknowns n e , n i , ve , vi , θe , θi . If E and B are unknown then we complete the system by inserting the Maxwell equations. The MHD equations have different forms due to the various approximations made. The evolution of J is assumed to be given by an approximation of (12.85) where the right-hand side is replaced by a scalar times E , E = E + v × B. The evolution equation for q, instead, is regarded as a constitutive equation. We then have to take an objective time rate of q; for definiteness we adopt the Jaumann time rate. The whole system of equations is then taken in the form ρ˙ + ρ∇ · v = 0,

ρ˙v = J × B − ∇ p + ρb,

τ J (ρ, θ)(J˙ + LJ + (∇ · v)J) + J = σ(ρ, θ)E ,

τq (ρ, θ)(q˙ − Wq) + q = −κ(ρ, θ)∇θ,

ρε˙ = T · D − ∇ · q + ρr + J · E , ∇ × E = −∂t B,

∇ × B = μ0 J + μ0 0 ∂t E,

where τ j , τq > 0. The fields E and B are subject only to Maxwell’s equations; no constitutive properties are considered for them. The standard assumption on the entropy inequality and substitution of −∇ · q + ρr from the balance of energy yield ˙ − p∇ · v + Td · D + J · E − 1 q · ∇θ ≥ 0, −ρ(ψ˙ + η θ) θ where Td = T + p(ρ, θ)1. We let ψ, η, T be functions of ρ, θ, J, q, D, ∇θ while J˙ and q˙ are given by the corresponding rate equations. Moreover ψ is dif˙ the entropy ferentiable. Hence, upon evaluation of ψ˙ and substitution of J˙ and q, inequality becomes p 2 ˙ − ρ∂∇θ ψ · (∇θ)˙ − 3 ∂J ψ · J)ρ˙ − ρ(∂θ ψ + η)θ˙ − ρ∂D ψ · D ρ 1 +(Td + ρ∂J ψ ⊗ J − 13 ρ∂J ψ · J 1) · D − q · ∇θ θ 1 σ  1 κ  −ρ∂J ψ · (−WJ − J + E ) + J · E − ρ∂q ψ · (Wq − q − ∇θ) ≥ 0. τJ τJ τq τq

(−ρ∂ρ ψ +

We let Td + ρ∂J ψ ⊗ J − 13 ρ∂J ψ · J 1 = 0(|D|). At any point x and time t the fields ˙ D, ˙ and (∇θ)˙can take arbitrary values, independent of those of ρ, θ, D, g, J, q. ρ, ˙ θ, Hence the inequality holds only if

766

12 Electromagnetism of Continuous Media

∂D ψ = 0, ∂∇θ ψ = 0,

p = ρ2 ∂ρ ψ + 23 ρ∂J ψ · J, η = −∂θ ψ.

The skew-symmetry of W implies that ∂J ψ ⊗ J − ∂q ψ ⊗ q ∈ Sym. The remaining inequality holds only if σρ ∂J ψ = J, τJ

κρ 1 ∂q ψ = q, τq θ

ρ ρ ∂J ψ · J + ∂q ψ · q + (Td + ρ∂J ψ ⊗ J) · D ≥ 0. τJ τq The obvious integrations yield ψ = (ρ, θ) +

τq 2 τJ 2 J + q . 2ρσ 2ρκθ

Hence it follows that ∂J ψ ⊗ J, ∂q ψ ⊗ q ∈ Sym, separately, and that σ, κ > 0. The requirement (Td + ρ∂J ψ ⊗ J − 13 ρ∂J ψ · J 1) · D ≥ 0 holds if Td comprises a viscous stress, Td = −ρ(∂J ψ ⊗ J − 13 ∂J ψ · J 1) + 2μ(ρ, θ)D + λ(ρ, θ)∇ · v 1,

μ ≥ 0, 2μ + 3λ ≥ 0.

If, instead, as is often assumed in the thermodynamics of irreversible processes, τ J˙ + J = σE then it follows that p = ρ2 ∂ρ ψ,

Td = 2μ(ρ, θ)D + λ(ρ, θ)∇ · v 1

while the remaining results hold unchanged, even though J˙ is replaced by the corotational derivative J˙ − WJ. Linear plasma and MHD waves In this section we follow linear approximations in the sense that we let the pertinent quantities, like n e , ve , E, be the sum of equilibrium values and disturbances, e.g. ˜ t), n e = n 0 + n(x,

12.10 Plasmas

767

and examine the pertinent equations with only linear terms in the disturbances. We first examine oscillations and waves in an unmagnetized plasma, namely in a plasma where E and B are produced by the presence and the motion of electrons and ions. We begin with the so-called plasma oscillations that is the motion of electrons under the action of the self-generated electric field. We let n 0 be the density at equilibrium, that is in the configuration n i = n e = n 0 , ve = 0, E = 0. The electric field E satisfies Poisson’s equation ∇ ·E=

q . 0

Hence the continuity equation, the equation of motion and Poisson’s equation, in the linear approximation, result in ∂t n˜ + n 0 ∇ · ve = 0,

∂t ve = −γe E,

∇ ·E=−

ne ˜ . 0

The time derivative of the first equation and the divergence of the second one allows us to write n 0 e2 n. ˜ ∂t2 n˜ = 0 m Hence the density n˜ oscillates with the (angular) frequency ω p given by ω 2p =

n 0 e2 . 0 m

The frequency ω p > 0 is called the plasma frequency. We now look for electromagnetic waves by regarding E and B as produced within the plasma. The pertinent equations are Maxwell’s equations, ∇ · B = 0, ∇ · E = −

ne ˜ , ∇ × E = −∂t B, ∇ × B = μ0 (J + 0 ∂t E), 0

and the approximations J = −n 0 eve , ∂t n˜ + n 0 ∇ · ve = 0, m e ∂t ve = −e(E + ve × B) in the unknowns E, B, n, ˜ ve (and J). The approximation of the equation of motion is, in a sense, consistent with (12.87). If we neglect LJ + (∇ · v)J + (γe β/|qe |)J and γe ∇ pe in (12.87) then we are left with J˙ = |qe |γe (E + ve × B). Upon substitution of J = −n 0 eve we get the (linearized) equation of motion; ve × B is in fact nonlinear and then has to be neglected too. By applying ∇ × to m e ∂t ve we

768

12 Electromagnetism of Continuous Media

find that m e ∂t ∇ × ve = −e∇ × E = e∂t B. To within an inessential function of the position x we can write m e ∇ × ve = eB. Moreover by applying ∇ × to ∇ × B and ∂t to ∇ × E we then obtain B = μ0 n 0

e2 B + μ0 0 ∂t2 B. me

Time harmonic wave solutions exp[i(k · x − ωt)] occur subject to the dispersion relation k 2 = μ0 0 (ω 2 − ω 2p ), where k = |k|. Since μ0 0 = 1/c2 , c being the speed of light in free space, we can write ω 2 = k 2 c2 + ω 2p . At high frequencies, ω  ω p , it is ω  ck so that waves propagate at the speed of light (in free space). Instead, the wave vector k becomes imaginary36 when |ω| < ω p ; in this case the wave form is exp(−ki n · x) exp(−iωt) where ki = |ki | and n is any unit vector. The wave does not propagate and the disturbance damps as exp(−ki n · x). We say that the waves have a cut-off at ω = ω p . The refractive index n = c/(ω/k) takes the form n=



1 − (ω 2p /ω 2 ).

The relation k 2 = μ0 ω 2 0 (1 − ω 2p /ω 2 ) may be regarded as a dispersion relation k 2 = μ0 ω 2 with the dielectric constant  = 0 (1 − ω 2p /ω 2 ).

36

Waves with complex-valued wave vectors k are called inhomogeneous waves [85].

12.10 Plasmas

769

By allowing for a pressure gradient we find plasma oscillations in a form that is referred to as Langmuir waves. The whole set of (linearized) equations is ∇ · B = 0, ∇ · E = −

ne ˜ , ∇ × E = −∂t B, ∇ × B = μ0 (J + 0 ∂t E), 0

and the approximations J = −n 0 eve , ∂t n˜ + n 0 ∇ · ve = 0, n 0 m e ∂t ve = −n 0 eE − ∇ pe . Again the equation of motion, with the insertion of the pressure term −∇ pe is consistent with (12.87) in the approximation ∂t J = |qe |γe E + γe ∇ pe as it follows by replacing J and dividing by γe . The pressure pe is regarded as a function of ρe = n e m e , i.e. pe (x, t) = pˆ e (ρe (x, t)). Let pe (x, t) = p0 + p(x, ˜ t) so that ˜ n 0 m e ∂t ve = −n 0 eE − ∇ p. ˜ 0 . Moreover let Compute the divergence and replace ∇ · ve = −∂t n/n ˜ p˜ = cs2 ρ˜e = cs2 m e n, where cs2 = dpe /dρe and cs is the sound speed in the electron gas. Hence we find ˜ −∂t2 n˜ = ω 2p n˜ − cs2 n. Time harmonic wave solutions exp[i(k · x − ωt)] for n˜ occur with ω 2 = ω 2p + k 2 cs2 . When the action of the electron pressure is taken into account, we say that “warm” electrons are considered. In the Bohm–Gross model, the dependence of pe on n e is taken in the form p˜ = 3nk ˜ B θe , where k B is Boltzmann’s constant and θe is the uniform electron temperature. Hence it follows that cs2 = 3k B θe /m e . Upon substitution we can write ω 2 = ω 2p + (3k B θe /m e )k 2 , usually referred to as the Bohm–Gross dispersion relation for Langmuir waves. Owing to the pressure term, longitudinal (Langmuir waves) occur as we see at once

770

12 Electromagnetism of Continuous Media

by the equation of motion, i ω n 0 m e ve = n 0 e E + i p˜ k. More involved wave features occur in magnetized plasmas where the magnetic induction is the sum of a given constant field B0 and a space-time dependent B which is produced in the plasma by the motion of electrons and ions. To fix ideas let e3 be the unit vector of B0 . Since 0 = ∇ · (B0 + B ) = ∇ · B then k · B = 0,

B = B1 e1 + B2 e2 .

The continuity and the motion equations along with Faraday’s and Ampere’s equations are written in the form ∂t n˜ α + n α0 ∇ · vα = 0, ∇ × E = ∂t B ,

∂t vα = sgn qα γα (E + vα × B0 ),

∇ × B = μ0

α qα vα

+ μ0 0 ∂t E.

Time differentiation of the equation of motion allows us to write ∂t2 vα = sgn qα γα (∂t E + ∂t vα × B0 ) = sgn qα γα ∂t E + γα2 E × B0 − γα2 B02 vα⊥ , where vα⊥ = vα − (B0 · vα )B0 /B02 is the component of vα perpendicular to B0 . Let ωcα = γα B0 ; ωcα is called the cyclotron frequency (of the constituent α). Consequently, at a timeharmonic wave, vα ∝ exp(−iωt), 2 − ω 2 )vα⊥ = −iω sgn qα γα E⊥ + γα2 E × B0 , (ωcα

−ω 2 vα3 = −iω sgn qα γα E 3 .

The components vα1 , vα2 , and vα3 can then be given by γe i E 1 + (ωce /ω)E 2 , ω 1 − (ωce /ω)2

ve2 =

γe (ωce /ω)E 1 − i E 2 , ω 1 − (ωce /ω)2

ve3 =

γi −i E 1 + (ωci /ω)E 2 , ω 1 − (ωci /ω)2

vi2 =

γe (ωci /ω)E 1 + i E 2 , ω 1 − (ωci /ω)2

vi3 = −

ve1 = −

iγe E3 ω

and vi1 = −

iγi E3 ω

Now, with a view to Ampere’s law ∇ × B = μ0 (J + 0 ∂t E), we observe that J + 0 ∂t E =

 α

ρα0 γα vα − iω0 E = −iω0 E,

12.10 Plasmas

771



⎤ 1 i2 0  = ⎣ −i2 1 0 ⎦ 0 0 3

where

and 1 = 1 +

ω 2pe 2 − ω2 ωce

+

ω 2pi 2 −ω ωci

, 2 = 2

ω 2pi ω 2pi ω 2pe ωce ω 2pe ωci − ,  = 1 − − . 3 2 − ω2 2 − ω2 ω ωce ω ωci ω2 ω2

The plasma can then be viewed as a dielectric with dielectric tensor 0 . Since  ω pi me  ω pe mi in many applications, at high frequencies, the entries of  can be simplified to 1 = 1 +

ω 2pe 2 −ω ωce

, 2

2 =

ωce ω 2pe , 2 − ω2 ω ωce

3 = 1 −

ω 2pe ω2

.

We now look for wave solutions in an unbounded magnetized plasma. Faraday’s and Ampere’s equations are written in the form ∇ × E = −∂t B ,

∇ × B = μ0 0 ∂t E,

where the dielectric tensor accounts for the motions of electrons and ions. Hence ∇ × (∇ × E) = −

1 2 ∂ E, c2 t

where c2 = 1/μ0 0 , c being the light speed in free space. A time-harmonic plane wave is considered in the form E(x, t) = Eˆ exp[i(k · x − ωt)]. Upon substitution we see that E(x, t) is a solution if ˆ ˆ − k2 Eˆ = − ω E. (k · E)k c2 2

Maxwell’s equations ∇ · E = 0,

∇ · B = 0,

∇ × E = −∂t B

imply that E and B are perpendicular to k and to each other,

772

12 Electromagnetism of Continuous Media

k · B = 0,

k · E = 0,

E · B = 0.

We let k be parallel to B0 = B0 e3 so that the waves propagate parallel to the magnetic field (along B0 ). Hence E 3 = 0, B3 = 0. The vector Eˆ is then a solution to

ω2 c

2  − k 1 Eˆ = 0, 2

with Eˆ 3 = 0. Nontrivial values of Eˆ are allowed by the condition det

2  − k 1 =0 2

ω2 c

restricted to the 2 × 2 submatrix. Though  is complex valued it follows that k is real and k 2 = k2 takes the values 2 k± =

ω2 (1 ± 2 ). c2

Let k R = k+ =

ω 2pe 1/2 ω 1− , c ω(ω − ωce )

k L = k− =

ω 2pe 1/2 ω 1− . c ω(ω + ωce )

The corresponding eigenvectors Eˆ R,L are given by Eˆ = Eˆ 2 (ie1 + e2 ), Eˆ = Eˆ 2 (−ie1 + e2 ),

E R = Eˆ 2 exp[i(k R · x − ωt)](ie1 + e2 ), EL = Eˆ 2 exp[i(kL · x − ωt)](−ie1 + e2 ),

Since i = exp(iπ/2) we can write E R = Eˆ 2 {exp[i(k R · x − (ωt − π/2))]e1 + exp[i(k R · x − ωt)]e2 ) and the like for EL with ωt + π/2. Take the real part. At a fixed place x, as time increases we see that E R is a vector, of constant amplitude, rotating in the clockwise direction. Hence E R is a right circular polarized wave. Similarly, we find that EL is a left circular polarized wave. As a comment, the circular polarization of an electromagnetic wave is an electric field, with a constant magnitude, whose direction rotates with time at a steady rate in a plane perpendicular to the direction of the wave (in the plane k · x = constant). In electrodynamics the strength and direction of an electric field is defined by its electric field vector. In the case of a circularly polarized wave, the tip of the electric field vector, at a given point in space, describes a circle as time progresses. At any instant

12.10 Plasmas

773

of time, the electric field vector of the wave describes a helix around the direction of propagation. One well-known wave solution in a magnetized plasma is the Alfvén wave. In 1942 Alfvén recognized that a wave could propagate in an incompressible perfectly conducting fluid immersed in a strong magnetic field. In this model the magnetic field and the mass density provide the restoring force and the inertia. Here we show that this type of wave solution is allowed in a more general setting by allowing for the compressibility of the fluid and the restoring force of the pressure. We apply the MHD equations (12.89), so that ∂t ρ + ∇ · (ρv) = 0,

1 (∇ × B) × B − ∇ p, μ0

ρ∂t v =

∇ × (v × B) = ∂t B.

The pressure is a function of the density, possibly via the adiabatic law p = K ργ , or rather a function p(ρ, η), with the entropy η as a constant. We regard the plasma at equilibrium with mass density ρ0 and magnetic induction B0 . Then we imagine departures of B, ρ, v from the equilibrium values in the form B = B0 + B (x, t), ρ = ρ0 + (x, t), v = v(x, t). Upon substitution and linearization in the perturbations B , , v we have 1 B0 × (∇ × B ) = 0, μ0 ∂t B − ∇ × (v × B0 , ) = 0,

∂t  + ρ0 ∇ · v = 0, ρ0 ∂t v + s 2 ∇ +

where s 2 = ∂ρ p. The gradient of the first equation and the (partial) time derivative of the second one allows us to write ∂t2 v − s 2 ∇(∇ · v) +

1 B0 × {∇ × [∇ × (v × B0 )]} = 0. ρ0 μ0

Look for v as a time-harmonic plane-wave solution, v(x, t) = vˆ exp i(k · x − ωt), with polarization vˆ and direction of propagation (wave vector) k. Upon substitution we find that vˆ is required to satisfy −ω 2 vˆ + s 2 (k · vˆ )k −

1 B0 × {k × [k × (ˆv × B0 )]} = 0. ρ0 μ0

Since B0 × {k × [k × (ˆv × B0 )]} = −(k · vˆ )B20 k + B0 · k[(k · vˆ )B0 − (B0 · k)ˆv + (B0 · vˆ )k],

774

12 Electromagnetism of Continuous Media

letting vA = √

B0 ρ0 μ0

we can write [−ω 2 + (v A · k)2 ]ˆv + [(s 2 + v 2A )(k · vˆ ) − (v A · k)(v A · vˆ )]k − (v A · k)(k · vˆ )v A = 0. (12.91) If k is perpendicular to v A then (12.91) simplifies to −ω 2 vˆ + (s 2 + v 2A )(k · vˆ )k = 0. Hence the wave is longitudinal, vˆ  k, and propagates with the phase speed u = ω/k, k = |k|, given by u 2 = s 2 + v 2A . If instead k is parallel to v A then v 2A (k · vˆ )k − (v A · k)(v A · vˆ )k − (v A · k)(k · vˆ )v A = −k2 (v A · vˆ )v A and (12.91) becomes (k 2 v 2A − ω 2 )ˆv + (

s2 − 1)k 2 (v A · vˆ )v A = 0. v 2A

Hence there are two solutions according as vˆ is parallel to v A or is not. If vˆ  v A , and then vˆ  k, it follows that −ω 2 + s 2 k 2 = 0 and hence u2 = s2. This is a longitudinal wave, in fact a sound wave, unaffected by the magnetic induction B0 . If, instead vˆ is not parallel to v A then k 2 v 2A − ω 2 is required to vanish and this implies u 2 = v 2A . Moreover it is required that v A · vˆ = 0 and this in turn implies that k · vˆ = 0. As a consequence this is a transverse wave propagating with the Alfvén speed v A . This is a true Alfvén wave and is a purely MHD phenomenon where only the magnetic field provides the restoring force.

12.11 Chiral Media and Optical Activity

775

12.11 Chiral Media and Optical Activity Optical activity means that a linearly polarized plane of light will rotate as it passes through the medium, interacts with the state of an electromagnetic waves and couples selectively with either left or right circularly polarized component. Natural optical activity denotes that this behaviour occurs in materials without the application of appropriate external fields. Chiral materials are materials displaying natural optical activity. Our purpose here is to investigate some models of chiral materials. The vector fields E, B, D, H are assumed to be related by linear constitutive equations. In the simplest case D = E, B = μH;  is the permittivity and μ is the permeability tensor. If  = 1 we say that the material is electrically isotropic and the like for μ. Crystals are described by symmetric tensors. If two eigenvalues are equal, e.g. μ1 = μ2 = μ3 , then the crystal is said to be uniaxial and the principal axis with the unequal eigenvalue is called the optic axis. If the eigenvalues are unequal then the medium is said to be biaxial. If cross-coupling terms occur between the electric and magnetic fields then we might have D = E + ξH, B = ζE + μH. The material is said to be biisotropic or bianisotropic according as both , μ and ξ, ζ are isotropic or anisotropic. Chiral media are modelled in a number of ways.37 Isotropic chirality was modelled in the form D = [E + β∇ × E], B = μ[H + β∇ × H] by Drude, Born and Fedorov. Instead Condon modelled chirality in the form D = E − β∂t H,

B = μH + β∂t E.

We might formally consider the analogue by letting D, H be given by E and B. Boys and Post constitutive equations are given by D = E + iβB,

H = iβE + B/μ.

Boys-Post equations look like the Condon model in the frequency domain if the time dependence is in the form ∝ exp(−iωt). Here we let E and H be independent variables and assume the body is undeformable. For definiteness we investigate the thermodynamic consistency of the constitutive equations D = [E + β∇ × E], 37

See, e.g. [275], Chap. 3.

B = μ[H + γ∇ × H],

(12.92)

776

12 Electromagnetism of Continuous Media

D = E + α∂t H,

B = μH + λ∂t E,

(12.93)

the scalars , β, μ, γ, α, λ being possibly dependent on the temperature θ. Since the body is undeformable then we let φ = ψ + E · P + μ0 H · M and the entropy inequality (12.2) simplifies to ˙ + θ∇ · k ≥ 0 − ρφ˙ − ρη θ˙ − P · E˙ − μ0 M · H

(12.94)

while E, B, D, H are subject to Maxwell’s equations ˙ ∇ × E = −B,

˙ ∇ × H = D.

˙ To explore the consistency of equations (12.92) we let θ, E, H, ∇ × E, ∇ × H, E, ˙ be independent variables. Yet, if φ depends on E˙ and H ˙ then φ˙ comprises H ¨ + ∂H˙ φ · H. ¨ Now, Maxwell’s equations allow E ¨ and H ¨ to be given arbitrary ∂E˙ φ · E ˙ ˙ values in that ∇ × E and ∇ × H do not enter the constitutive equations. The arbi¨ and H ¨ implies that ∂E˙ φ = 0, ∂H˙ φ = 0. Hence we start with a function trariness of E φ(θ, E, H, ∇ × E, ∇ × H) for the free energy φ. Computing φ˙ we have ˙ − ρ∂∇ × E φ · ∇ × E˙ − ρ∂∇ × H φ · ∇ × H ˙ + θ∇ · k ≥ 0. −ρ(∂θ φ + η)θ˙ − (ρ∂E φ + P) · E˙ − (ρ∂H φ + μ0 M) · H

The arbitrariness of θ˙ implies that η = −∂θ φ. Now, divide the inequality by θ, 1 1 ˙ − ρ ∂∇ × E φ · ∇ × E˙ − ρ ∂∇ × H φ · ∇ × H ˙ + ∇ · k ≥ 0. − (ρ∂E φ + P) · E˙ − (ρ∂H φ + μ0 M) · H θ θ θ θ

To determine the remaining degree of arbitrariness we notice that, for any two vector fields w, u, we have the identity w · ∇ × u = ∇ · (u × w) + u · ∇ × w. Hence

(12.95)

ρ ρ ρ ∂∇ × E φ · ∇ × E˙ = ∇ · (E˙ × ∂∇ × E φ) + E˙ · ∇ × ∂∇ × E φ θ θ θ

and the like for H. Consequently we obtain the inequality 1 ρ 1 ρ ˙ − (ρ∂E φ + P + θ∇ × ∂∇ × E φ) · E˙ − (ρ∂H φ + μ0 M + θ∇ × ∂∇ × H φ) · H θ θ θ θ ρ ρ˙ × ∂∇ × H φ) ≥ 0. +∇ · (k − E˙ × ∂∇ × E φ − H θ θ

12.11 Chiral Media and Optical Activity

777

˙ It then follows that The inequality is linear in E˙ and H. ρ P = −ρ∂E φ − θ∇ × ∂∇ × E φ, θ and k=

ρ μ0 M = −ρ∂H φ − θ∇ × ∂∇ × H φ, θ

ρ˙ ρ˙ E × ∂∇ × E φ + H × ∂∇ × H φ. θ θ

We then look for the function φ which determines equations (12.92). Let φ = (θ) + 21 c1 E2 + 21 c2 H2 + c3 E · ∇ × E + c4 H · ∇ × H and assume θ is constant in space. Hence P = −ρc1 E − 2ρc3 ∇ × E,

μ0 M = −ρc2 H − 2ρc4 ∇ × H.

Consequently, since D = 0 E + P and B = μ0 (H + M) then D = (0 − ρc1 )E − 2ρc3 ∇ × E,

B = (μ0 − ρc2 )H − 2ρc4 ∇ × H.

Upon the identification of the coefficients, Eqs. (12.92) follow. In this regard we observe that the condition γ = β, or 2ρc4 2ρc3 = , 0 − ρc1 μ0 − ρc2 does not follow as a thermodynamic restriction. The Condon model involves a dependence of D and B (and hence of P and M) on ˙ H. ˙ Let φ, η, P, M, k be functions of θ, E, H, E, ˙ H. ˙ The entropy the derivatives E, inequality (12.94) becomes ˙ − ρ∂ ˙ φ · E ¨ − ρ∂ ˙ φ · H ¨ + θ∇ · k ≥ 0. −ρ(∂θ φ + η)θ˙ − (ρ∂E φ + P) · E˙ − (ρ∂H φ + μ0 M) · H E H

¨ and H ¨ By Maxwell’s equations and the definition of M and P it follows that E ˙ ˙ can take arbitrary values independent of ∇ E and ∇ H (occurring in ∇ · k). Hence it follows ∂H˙ φ = 0. ∂E˙ φ = 0, Moreover it follows η = −∂θ φ. Thus ∇ · k = 0 and the entropy inequality reduces to ˙ ≤ 0. (ρ∂E φ + P) · E˙ + (ρ∂H φ + μ0 M) · H The constitutive equations ˙ P = −ρ∂E φ − λE˙ − H,

˙ + E, ˙ μ0 M = −ρ∂H φ − ξ H

778

12 Electromagnetism of Continuous Media

where λ, ξ ≥ 0, are then consistent with thermodynamics. The Condon model is obtained, in the particular case λ, ξ = 0, by letting ∂E φ ∝ E and ∂H φ ∝ H.

12.11.1 Wave Solutions in Chiral Media Consider the function f(x, t) = (ξi1 + iζi2 ) exp{i[(k1 + ik2 ) · x − ωt)]}, where k1 and k2 are real-valued vectors. Then  f = ξ exp(−k2 · x) cos(k1 · x − ωt)i1 − ζ exp(−k2 · x) sin(k1 · x − ωt)i2 . At any point x, exp(−k2 · x) is fixed. Then  f traces out an ellipse in the plane perpendicular to k1 ; the semi-major and semi-minor axes of the ellipse have lengths a exp(−k2 · x) and b exp(−k2 · x). As k1 · x increases the ellipse is traced out in the clockwise direction if ξζ > 0 and in the counterclockwise direction if ξζ < 0. We now investigate the optical activity properties associated with some models of chiral media. Drude–Born–Fedorov model To begin with we consider the model (12.92). Hence the fields E, B, D, H are subject to Eqs. (12.92) and Maxwell’s equations ∇ · D = 0,

∇ · B = 0,

˙ ∇ × E = −B,

˙ ∇ × H = D.

Let E0 , H0 be complex-valued vectors in the form E0 = E 1 e1 + i E 2 e2 ,

H0 = H1 e1 + i H2 e2 ,

where E 1 , E 2 , H1 , H2 ∈ R. We look for solutions of the form E = E0 exp[i(kn · x − ωt)],

H = H0 exp[i(kn · x − ωt)].

(12.96)

These solutions represent plane waves propagating in the direction n with wave speed ω/k or slowness s = k/ω. For definiteness we let n = e3 . We will find that the model allows for circularly polarized waves. First we notice that, in view of (12.92), ∇ · D = 0 and ∇ · B = 0 imply ∇ · E = 0, ∇ · H = 0 and hence ∇ · E0 = 0, n · E0 = 0,

∇ · H0 = 0, n · H0 = 0;

(12.97)

12.11 Chiral Media and Optical Activity

779

both E and H are orthogonal to the direction of propagation n. Now, ∇ × E = −B˙ ˙ result in and ∇ × H = D kn × E0 = ωμ[H0 + iγkn × H0 ],

kn × H0 = −ω[E0 + iβkn × E0 ]. (12.98)

Since n · E0 = 0 and n · H0 = 0 then by applying the cross product n × to the first equation in (12.98) we find kE0 = −ωμ[n × H0 − iγkH0 ]. Upon substitution of E0 in the second equation of (12.98) we obtain an × H0 + ibH0 = 0, where a = k 2 /ω 2 − μ − μβγk 2 ,

b = μk(γ + β).

It follows that a H1 − bH2 = 0,

bH1 − a H2 = 0.

Nontrivial solutions occur if a 2 − b2 = 0, whence k2 = μ[1 + βγk 2 ± k(β + γ)]. ω2 If s = s+ = {μ[1 + βγk 2 + k(β + γ)]}1/2 then a = b and H1 = H2 ; the wave is left-circularly polarized. If s = s− = {μ[1 + βγk 2 − k(β + γ)]}1/2 then a = −b and H1 = −H2 ; the wave is right circularly polarized. Without the application of any appropriate external field, the Drude–Born–Fedorov model allows for the existence of two circularly polarized waves with different speeds of propagation. Condon model The fields D, E, B, H are subject to Maxwell’s equations and the constitutive equations ˙ ˙ D = E − g H, B = μH + g E. ˙ We look for solutions in the form (12.96). Hence Maxwell’s equations ∇ × H = D and ∇ × E = B˙ result in kn × H0 = −ωE0 − igω 2 H0 ,

kn × E0 = μωH0 − igω 2 E0 .

Since Eqs. (12.97) still hold then by applying the cross product n × to the first relation we have −kH0 = −ωn × E0 − igω 2 n × H0 .

780

12 Electromagnetism of Continuous Media

Replacing n × E0 and next E0 we obtain aH0 + ibn × H0 = 0, where a = −k 2 /ω 2 + μ − g 2 ω 2 ,

b = gk.

Hence it follows a H1 + bH2 = 0,

bH1 + a H2 = 0.

Nontrivial solutions occur if a 2 − b2 = 0, whence a = ±b,

−k 2 /ω 2 + μ − g 2 ω 2 = ±gk.

If a = b then H1 = H2 and the solution is a left (clockwise) circularly polarized wave; if a = −b then H1 = −H2 and the solution is a right (counterclockwise) circularly polarized wave. Hence also the Condon model allows for the existence of two circularly polarized waves with different speeds of propagation.

12.12 Anisotropic Non-Dissipative Magnetic Materials (Ferrites) Ferrites are examples of magnetic materials with sufficiently low losses in the required range of frequencies. They are then usually described by a complex-valued anisotropic permeability in the frequency domain. Here we show how the complexvaluedness arises from an appropriate model in the time domain. Modelling of ferrites usually starts with the observation that the magnetic properties arise by the behaviour of spinning electrons, with magnetic dipole moment m and angular momentum s, with m = −γss,

γ=

e m

−γ being called the gyromagnetic ratio; e and m are the (positive) charge and the mass of the electron. If a magnetic intensity H occurs (in matter) then a torque μ0m × H is applied to the spinning electron. Hence s and m are subject to s˙ = μ0m × H,

˙ = γμ0m × H. m

(12.99)

m(t)| =constant. Further, if H is conThe solution m (t) to Eq. (12.99) is a function |m stant then also m · H is constant; the angle, θ say, between m and H is constant in that ˙ = 0. The vector m rotates around H with a constant angle. Morem · H)˙ = H · m (m over, by a direct check we see that the transverse part of m describes a circumference

12.12 Anisotropic Non-Dissipative Magnetic Materials (Ferrites)

781

with constant angular frequency, ωL = γμ0 H, known in the literature as Larmor frequency. Letting N be the (constant) number of electrons per unit volume we have M = m. Hence Nm ˙ = −γμ0 M × H = γμ0 H × M. M Let H be the sum of a constant field H0 and a time-dependent field H . Let M = M0 + M , where M0 is the magnetization induced by H0 . If H = H0 then M = M0 and 0 = γμ0 H0 × M0 . Hence M0 is parallel to H0 ; let e3 be the common unit vector of H0 and M0 . If H = H0 + H then we have ˙ = γμ0 (H0 + H ) × (M0 + M ) = γμ0 [H0 × M + H × M0 + H × M ]. (12.100) M If ρ∂M ψ = αM, since M2 = M 2 + 2M0M · e3 + M02 then we have M2 + 2M0M · e3 + M02 ) + ψ0 , ρψ = 21 α(M ψ0 being independent of M . In light of (12.29) we have ˙ ≥ 0. (−αM + μ0 H) · M ˙ from (12.100) results in Substitution of M [−α(M0 e3 + M ) + μ0 (H0 e3 + H )] · γμ0 (H0 e3 × M − M0 e3 × M + H × M ) = γμ0 (αM0 e3 × H · M + μ0 M0 e3 × M + μ0 H0 e3 · H × M )) ≡ 0 for any value α. Wave solutions are determined by means of the linearized version of (12.100), i.e. by neglecting H × M. Here we review models compatible with the reduced inequality ˙ ≥ 0. (−αM + μ0 H) · M Take the rate equation ˙ = γμ0 H × M − λM × (M × H) + ν[(−αM + μ0 H) · M]M, M ˙ = γμ0 H × M follows by letting λ, ν = 0. We where λ, ν ≥ 0, γ ∈ R; equation M can split the effects of M by letting M = Mm, m being the unit vector of M. Hence

782

12 Electromagnetism of Continuous Media

˙ = Mm ˙ + M m, ˙ M

˙ = 0. m·m

Accordingly we can write ˙ + Mm ˙ = γμ0 MH × m − λM 2 m × (m × H) + ν M 2 [(−αMm + μ0 H) · m]m. Mm

˙ in the form ˙ allows us to split the effects of M The orthogonality between m and m M˙ = −αν M 3 + μ0 ν M 2 H · m,

˙ = γμ0 MH × m + λM 2 m × (H × m). m

Particular solutions are now considered. • Constant solutions: M(t) = M0 , m(t) = m0 . They happen if μ0 H0 · m0 = αM0 ,

0 = γμ0 M0 H0 × m0 + λM02 m0 × (H0 × m0 )

whence μ0 H0 = αM0 ,

0 = H0 × m0 .

Constant solutions occur provided m0 is collinear to H0 and M0 = μ0 H0 /α. • Solutions with constant magnitude, M(t) = M0 . They occur if μ0 H0 · m(t) = αM0 ,

˙ m(t) = γμ0 M0 H0 × m(t) + λM02 m(t) × (H0 × m(t)).

Hence m rotates around H0 at a constant angle, say β, such that cos β = αM0 /μ0 H0 . Consequently these solutions occur if M0 ≤ μ0 H0 /α. • Solutions with constant direction, m(t) = m0 . They occur if ˙ M(t) = −αν M 3 (t) + μ0 ν M 2 (t)H0 · m0 ,

whence

0 = γμ0 M(t)H0 × m0 + λM 2 (t)m0 × (H0 × m0 ),

˙ M(t) = −αν M 3 (t) + μ0 ν M 2 (t)H0 ,

0 = H0 × m0 .

These solutions then occur provided m0 is parallel to H0 . As a remark observe that if we let H = H0 e3 + H , M = M0 e3 + M the stationary solution H = H0 e3 , M = M0 e3 holds only if αM0 = μ0 H0 and then, consistently, α is required to equal μ0 H0 /M0 . Wave solutions Wave solutions in a ferrite medium are determined by modelling the body via linearized equations. Hence D and E are supposed to be related by D = E while M is governed by the linearized version of (12.100), namely ˙ = γμ0 (H0 e3 × M − M0 e3 × H ). M

(12.101)

12.12 Anisotropic Non-Dissipative Magnetic Materials (Ferrites)

783

In addition electromagnetic fields satisfy Maxwell’s equations ˙ ∇ × H = E,

˙ ∇ × E = −B,

∇ · B = 0,

∇ · E = 0.

It follows from (12.101) that ˙ · e3 = (M M · e3 )˙. 0=M We then let M · e3 = 0. Since ∇ · B = 0 then H + M ) = μ0 ∇ · (H H + M ). 0 = μ0 ∇ · (H0 + M0 ) + μ0 ∇ · (H H + M ) = 0, the existence of wave solutions Consistent with the requirement ∇ · (H is now investigated in the complex-valued harmonic dependence H = H 0 exp[i(kz − ωt)],

M = M 0 exp[i(kz − ωt)],

(12.102)

where H · e3 = 0, M · e3 = 0, z being the Cartesian coordinate along e3 . Hence H 0 = H1 e1 + H2 e2 , H1 , H2 ∈ C, and the like for M . Hence ∇ · H = 0, ∇ · M = 0. In components, Eq. (12.101) becomes −iωM1 = γμ0 (−H0 M2 + M0 H2 ),

−iωM2 = γμ0 (H0 M1 − M0 H1 ).

Upon algebraic manipulations we find the magnetic susceptibility χm , ⎡

M = χm H ,

⎤ χ1 −iχ2 0 χm = ⎣ iχ2 χ1 0 ⎦ 0 0 0

where χ1 =

ωC ω M , ωC2 − ω 2

χ2 =

ωω M , ωC2 − ω 2

ωC = γμ0 H0 ,

ω M = γμ0 M0 .

Hence it follows that H, B = μ0 H0 + μH

μ = μ0 (1 + χm ),

that is the constitutive equation for the magnetic induction B subject to the assumptions (12.102) and (12.101). To establish the existence of solutions in the form (12.102) we observe that H and E are now subject to ∇ × H = −iωE,

∇ × E = iωμH.

784

12 Electromagnetism of Continuous Media

Since ∇ · H = ∇ · (H0 + H ) = 0 then the curl of ∇ × H leads to H=0 H + ω 2 μH H whence (−k 2 + ω 2 μ1 )H1 − iω 2 μ2 H2 = 0,

(−k 2 + ω 2 μ1 )H2 + iω 2 μ2 H1 = 0, (12.103) where μ1 = μ0 (1 + χ1 ), μ2 = μ0 χ2 . Nontrivial solutions occur if (k 2 − ω 2 μ1 )2 − ω 4 2 μ22 = 0. Hence there are two wavenumber solutions  k± = ω (μ1 ± μ2 ). Correspondingly, Eq. (12.103) implies H2 = ±iH1 ; accordingly k+ and k− are the wavenumber solutions of two positive and negative circularly polarized waves. Incidentally, since  √ k± = ω μ0 1 + ω M /(ωC ∓ω) then the negatively polarized wave (k− ) propagate with speed ω/k− for any value of ω. The positively polarized wave (k+ ) propagate with speed ω/k+ with a stopband. Since ωC + ω M − ω ωM = 1+ ωC − ω ωC − ω it follows that propagation is forbidden if ω ∈ [ωC , ωC + ω M ]. A linearly polarized plane wave may be viewed as the superposition of two counter-rotating circularly polarized waves and vice versa. Formally this is expressed by the identity w1 e1 exp[i(k · x − ωt)] = 21 (w1 e1 + iw1 e2 ) exp[i(k · x − ωt)] + 21 (w1 e1 − iw1 e2 ) exp[i(k · x − ωt)].

In a ferrite medium the two circularly polarized waves propagate at different speeds. If L is the length of the sample, say z ∈ [0, L], then the outgoing wave is 1 (w1 e1 + iw1 e2 ) exp[i(k+ z − ωt)] + 21 (w1 e1 − iw1 e2 ) exp[i(k− z 2 = 21 (w1 e1 + iw1 e2 ) exp[i((k+ − k− )z − ωt)] + w1 e1 exp[i(k− z

− ωt)] − ωt)]

12.12 Anisotropic Non-Dissipative Magnetic Materials (Ferrites)

785

at z = L. The result is a linearly polarized wave, in the direction of the incident one, plus a rotated wave, the angle being (k+ − k− )L. Thermodynamic consistency Let the body be undeformable and then look at the thermodynamic inequality ˙ + E · P˙ + μ0 H · M ˙ ≥ 0. −ρ(ψ˙ + η θ) We let ψ, η, H and E be functions of θ, P, M while M is subject to the rate equation (12.100). By ˙ ≥ 0. −ρ(∂θ ψ + η)θ˙ + (E − ρ∂P ψ) · P˙ + (μ0 H − ρ∂M ψ) · M The linearity and the arbitrariness of θ˙ and P˙ imply η = −∂θ ψ,

E = ρ∂P ψ.

and the inequality reduces to ˙ ≥ 0. (μ0 H − ρ∂M ψ) · M

(12.104)

Assume ρ∂M ψ = αM. Consistent with (12.100) we let M = M0 e3 + M . Hence it follows ˙ = [μ0 (H0 e3 + H ) − α(M0 e3 + M )] · γμ0 [H0 e3 × M + H × M0 e3 + H × M ] (μ0 H − ρ∂M ψ) · M

It follows that

˙ =0 (μ0 H − ρ∂M ψ) · M

holds identically for any values of α, H0 , M0 . Accordingly, the model (12.100) of ferrite can be viewed as non-dissipative. Since also α might be a function of H0 then the model of ferrite is significantly nonlinear. Incidentally the free energy can be written in the form ρψ = (θ, P) + 21 α(M02 + 2M0M · e3 + M 2 ).

12.12.1 Magnetization Model with Damping Effects An improvement of the evolution law is due to Landau and Lifshitz [270] who established the equation

786

12 Electromagnetism of Continuous Media

˙ = γμ0 Heff × M − λM × (M × Heff ), M

(12.105)

where γ, λ ≥ 0; γ is a constant while λ is allowed to depend on |M|. The vector Heff denotes the effective magnetic field and is defined as Heff = H − ∂M ϕ(M) + (A · ∇∇)M, ϕ being a scalar function of M. Physically, the field (A · ∇∇)M is related to the exchange energy 21 Ai j ∂xi M · ∂x j M and ϕ(M) is the anisotropy energy; e.g. ϕ = β(M · e)2 , e being called the direction of easy magnetization. The term λM × (M × Heff ) produces a damping effect in the following sense. Observe that |M| is constant in time. This is so because ˙ = −λM · [M × (M × Heff )] = −λM · [(M · Heff )M − M2 Heff ] = 0 M·M and hence M2 is constant. Instead, the inner product with Heff results in 2 ˙ = −λHeff · [(M · Heff )M − M2 Heff ] = λM2 [Heff Heff · M −(

M · Heff )2 ] ≥ 0. |M|

Let H be constant. Then ˙ = |Heff | |M| cos˙ α 0 ≤ Heff · M α being the angle between H and M. Hence α decreases in time and this is the meaning of the damping. Having in mind Eq. (12.105) we now investigate possible evolution equations for the magnetization M. Consider an undeformable (non-conducting) solid. The entropy inequality can be written in the form ˙ + E · P˙ + μ0 H · M ˙ + θ∇ · k ≥ 0. −ρ(ψ˙ + η θ) ˙ be given by constitutive functions of the set of variables We let ψ, η, k and M ˙ ∇∇M). (θ, P, H, M, ∇M, M, The evolution equation for M is consistent with the objectivity requirement in that the ˙ is relative to the observer at rest with the body. solid is undeformable and the rate M Equation (12.105) is expected to be among those consistent with thermodynamics. Evaluation of ψ˙ and substitution in the entropy inequality results in ˙ − (ρ∂M ψ − μ0 H) · M ˙ − ρ∂∇M ψ · ∇ M ˙ −ρ(∂θ ψ + η)θ˙ − (ρ∂P ψ − E) · P˙ − ρ∂H ψ · H ¨ − ρ∂∇∇M ψ · ∇∇ M ˙ + θ∇ · k ≥ 0. −ρ∂ ˙ ψ · M M

Maxwell’s equations

12.12 Anisotropic Non-Dissipative Magnetic Materials (Ferrites)

˙ ∇ × H = 0 E˙ + P,

˙ + M), ˙ ∇ × E = −μ0 (H ∇ · (H + M) = 0,

787

∇ · (0 E + P) = 0

˙ and P˙ can be chosen allows us to say that at any point x and time t the values H arbitrarily; the consistency with Maxwell’s equations is guaranteed by appropriate ˙ H, ˙ and P˙ implies values of ∇ × E and ∇ × H. The arbitrariness of θ, η = −∂θ ψ,

∂H ψ = 0,

E = ρ∂P ψ.

˙ is up to now an undetermined constitutive function we merely assume that Since M ∂M˙ ψ = 0,

∂∇∇M ψ = 0.

Next we divide throughout by θ and observe ρ ˙ = −∇ · ( ρ ∂∇M ψ M) ˙ + [∇ · ( ρ ∂∇M ψ)]M. ˙ − ∂∇M ψ · ∇ M θ θ θ Hence the inequality becomes 1 ρ ˙ + ∇ · (k − ρ ∂∇M ψ M) ˙ ≥ 0. [μ0 H − ρ∂M ψ + θ∇ · ( ∂∇M ψ)] · M θ θ θ We then let k=

ρ ˙ ∂∇M ψ M. θ

Consequently the entropy inequality reduces to ρ ˙ ≥ 0. [μ0 H − ρ∂M ψ + θ∇ · ( ∂∇M ψ)] · M θ

(12.106)

˙ on the independent variables satisfying inequality We look for dependences of M (12.106). For any two vectors v, w,the inequality v · w ≥ 0 holds if w = βM × v − λM × (M × v),

β ∈ R, λ ∈ R+ .

Indeed v · w = λ[M2 v2 − (M · v)2 ] ˙ v = μ0 H − ρ∂M ψ + and then λ ≥ 0 makes v · w ≥ 0 to hold. Hence letting w = M, θ∇ · (ρ∂∇M ψ/θ) we conclude that inequality (12.106) holds and meanwhile the evolution equation (12.105) holds too with

788

12 Electromagnetism of Continuous Media

Heff = H −

1 ρ [ρ∂M ψ − θ∇ · ( ∂∇M ψ)]. μ0 θ

For definiteness, let ψ = ψ0 (θ) + 21 [c1 |M|2 + c2 (M · e)2 + c3 ∇M · A∇M] where c1 , c2 and A ∈ Sym+ may depend on θ while e is a possibly-privileged unit vector. For simplicity let ρ, θ be uniform. Upon substitution of ψ the field Heff is found to have the form Heff = H −

ρ [c1 M + c2 |M · e|e + c3 (A · ∇∇)M]. μ0

Electromagnetic waves ˙ = 0. Denote by H = H0 e3 , Parallel fields M, H produce a stationary solution, M M = M0 e3 the stationary solution and let H = H0 e3 + H ,

M = M0 e3 + M .

˙ = γμ0 H × M − λM × (M × H) reads The linear approximation to M ˙ = γμ0 (H0 × M + H × M0 ) − λ(M0 H0M − M0 · M H0 + M0 · H M0 − M 2H ). M 0 (12.107) ˙ · e3 and find that We compute M ˙ · e3 = −λ(M0 H0M · e3 − M0 H0M · e3 + M 2H · e3 − M 2H · e3 ) = 0. M 0 0 Hence we let M · e3 = 0. Moreover, since ∇ · B = 0 then H + M ) = 0. ∇ · (H So we let also H · e3 = 0. We now investigate the existence of time-harmonic solutions, H = H 0 exp(−iωt),

M = M 0 exp(−iωt),

where H 0 = H1 e1 + H2 e2 and the like for M . Upon substitution in (12.107) we find the system of equations 

γμ0 H0 −iω + λM0 H0 −γμ0 H0 −iω + λM0 H0



M1 M2

We then compute M in terms of H to obtain



 =

 λM02 H1 + γμ0 M0 H2 . −γμ0 M0 H1 + λM02 H2

12.12 Anisotropic Non-Dissipative Magnetic Materials (Ferrites)



where

M1 M2



 =

χ1 χ2 −χ2 χ1



 H1 , H2

χ1 =

(−iω + λM0 H0 )λM02 + γ 2 μ20 H0 M0 , (γμ0 H0 )2 + (λM0 H0 )2 − ω 2 − 2iωλM0 H0

χ2 =

(−iω + λM0 H0 )γμ0 H0 − γμ0 H0 λM02 . (γμ0 H0 )2 + (λM0 H0 )2 − ω 2 − 2iωλM0 H0 

and hence B = μ H,

μ = μ0

 1 + χ1 χ2 . −χ2 1 + χ1

The field H satisfies the differential equation H + ω 2 μH H = 0. H

789

Part IV

Hysteresis and Phase Transitions

Chapter 13

Plasticity

Plasticity describes the non-reversible deformation of a material in response to applied forces. The physical mechanisms that cause plastic, that is, non-reversible, deformation can vary widely. For many metals, tensile loading applied to a sample will cause it to behave in an elastic manner. However, once the load exceeds a threshold, the extension increases more rapidly than in the elastic region. Moreover, when the load is removed, some degree of extension will remain. Perfect plasticity is a property of materials to undergo irreversible deformation without any increase in stresses or loads. Models of plastic materials involve a combination of elastic and (perfect) plastic behaviours. Consistent with the involved nature of plasticity this chapter exhibits well-known concepts (yield criteria) and classical models (gradient theories, Kröner decomposition). Next a decomposition-free approach to the modelling of plastic materials is developed through the essential role of the entropy production as a constitutive function. This allows a general scheme where the hyperelastic regime, the hypoelastic regime, and the hysteretic regime occur depending on the free energy and the entropy production. Definite examples are given of hysteretic models. Polymeric foams are also modelled as hysteretic materials.

13.1 Qualitative Aspects of the Stress–Strain Curve Also to introduce a useful terminology, consider a tension test on a cylindrical specimen. Let L 0 and A0 be the length and the cross-sectional area of the specimen; upon deformation they are changed to L and A. Let P denote the axial force and define s=

P , A0

σ=

P ; A

© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_13

793

794

13 Plasticity

s and σ can be viewed as the Piola stress and the Cauchy stress. Let λ = L/L 0 and e = λ − 1. By increasing P (and hence s and σ) the s − e curve is essentially linear as is in elastic bodies. Beyond the elastic limit, called yield strength, the specimen shows a monotone increasing dependence of σ and s on e, with a decisively lower slope; this phenomenon is called strain hardening. Moreover, beyond the yield strength a reduction of P results in an elastic unloading curve. Unloading to zero stress produces a permanent plastic strain e p . At any point of this curve, upon increasing the stress (reloading) the stress–strain curve follows the elastic behaviour up to the point from which the unloading was initiated. By a further loading, the specimen follows the hardening curve. If, instead, P becomes a compression along the unloading curve then reverse yielding begins at the stress σr (< 0). If σ f is the value of stress from which the reversed deformation was initiated in the hardening curve, then it happens that |σr | < σ f ; the stress at which plastic flow recommences upon load reversal is smaller than that at which the reversed loading was initiated (Bauschinger effect). The closed interval [σr , σ f ] is called the elastic range. The rather complicate character of plasticity gives reasons of the various approaches set up to describe the corresponding phenomena. Many approaches are based on the decomposition of the displacement gradient H = ∇R u as H = He + H p , where He is the elastic distortion and H p is the plastic distortion. Hence, the elastic and plastic strains are considered in the form1 E e = sym He ,

E p = sym H p .

In the simplest approach, it is assumed that tr H p = tr E p = 0, which holds for isochoric small deformations. In the isothermal approximation, the Clausius–Duhem inequality, in the Lagrangian description, can be taken in the simple form ρ R ψ˙ − T R · F˙ ≤ 0. Since T R = J TF−T , in the approximation of small deformations T R  J T ∈ Sym. Hence2 e p T R · F˙  T R · E˙ = T R · E˙ + T R · E˙ . Consequently, the Clausius–Duhem inequality can be written as 1 To avoid ambiguities with the different strain tensors we observe that E = (C − 1)/2 is the Green– St. Venant strain, E = symH is the strain, and ε = sym∇u is the infinitesimal strain. 2 Henceforth the power T · E˙ is meant to be J T · ε ˙. R

13.1 Qualitative Aspects of the Stress–Strain Curve

795

ρ R ψ˙ − T R · E˙ − T R · E˙ ≤ 0. e

p

Let T0 := dev T R . The assumption3 tr E p = 0 implies that T R · E p = T0 · E˙ . p

If now it is assumed that energy dissipation is confined to the plastic behaviour then the Clausius–Duhem inequality splits into e ρ R ψ˙ − T R · E˙ = 0,

p T0 · E˙ ≥ 0.

(13.1)

A simple way to satisfy the inequality in (13.1) might be to take T0 = κ

p E˙ p , |E˙ |

κ ≥ 0,

as E˙ = 0. Yet the stress–strain curve indicates a more involved behaviour and this is realized by means of a scalar internal variable S, called the hardening variable, such that p T0 = Tˆ 0 (E˙ , S). p

The scheme is completed by a constitutive equation for S; the evolution of S is assumed to be given by a rate equation p S˙ = h(E˙ , S),

(13.2)

the function h providing the hardening rate.

13.1.1 Yield Criteria Ductile materials can be stretched without breaking and drawn into thin wires; aluminum, copper, tin, mild steel, platinum, and lead are examples of ductile materials. Yield points are the points below which the body deforms elastically and beyond which the body deforms plastically with increase in stress applied. If the body is subject to a uniaxial tensile load it will start yielding, i.e. deforming plastically, when the stress reaches the yield stress, say σ y . When the state of stress is triaxial a single stress does not characterize the yielding. In general, we can assume that yield occurs at a point when some combination of the stress components reaches some critical value, say 3

The plastic behaviour is usually ascribed to the microscopic structure and the occurrence of dislocations. The flow of dislocations is regarded to be isochoric and hence H p is assumed to be deviatoric, tr H p = 0.

796

13 Plasticity

F(T) = k, F(T) denoting a function of the components of T while k is a parameter to be determined experimentally. Alternatively we may express the dependence on T by means of the principal stresses T1 , T2 , T3 and the principal directions n1 , n2 , n3 . If the material is isotropic the function F is independent of the directions {ni } and hence the yield criterion simplifies to F(T1 , T2 , T3 ) = k. Since the principal invariants I1 , I2 , I3 of stress are independent of the material orientation then we might write the condition in the form4 F(I1 , I2 , I3 ) = k

or

F(I1 , J2 , J3 ) = k,

where J2 , J3 are the principal invariants of the deviator T0 . Often it is assumed that the yield stress is independent of I1 = tr T (hydrostatic stress) and hence the condition becomes F(J2 , J3 ) = k. For definiteness observe that for ductile materials there are two main yield criteria, von Mises criterion and Tresca criterion. Let Tm = 13 tr T = 13 (T1 + T2 + T3 ). The scalar Tm is named the hydrostatic stress; it is a common assumption that Tm does not influence the yielding. Hence, the yield criterion is characterized by a condition on the deviator, T0 = T − Tm 1.

von Mises Criterion The von Mises criterion is in fact the result of subsequent contributions of Lévy, von Mises, Prandtl, and Reuss. Indeed, it seems that the criterion was anticipated by Maxwell in 1856 [216]. We show that the criterion is in fact based on considerations about the strain energy. Let {Ti }, {E i } be the principal stresses and strains. Based on linear elasticity we consider the energy, per unit volume, in the form UT = 21 Ti E i .

4

Though the function changes depending on the pertinent variables we keep using the same symbol F.

13.1 Qualitative Aspects of the Stress–Strain Curve

797

In view of the relations (4.10), we find UT =

1 [T 2 + T22 + T32 − 2ν(T1 T2 + T1 T3 + T2 T3 )]. 2E 1

Now consider what is called thehydrostatic energy, that is, the energy associated with the isotropic stress Tm = 13 i Ti . Replacing T1 , T2 , T3 in UT with Tm , we find UH =

1 − 2ν 2 [T1 + T22 + T32 + 2(T1 T2 + T1 T3 + T2 T3 )]. 6E

Consequently, the distortion energy U D = UT − U H can be written in the form UD =

1+ν 2 1+ν [T1 + T22 + T32 − T1 T2 − T1 T3 − T2 T3 ] = [(T1 − T2 )2 + (T2 − T3 )2 + (T2 − T3 )2 ]. 3E 6E

In terms of U D the (equivalent) von Mises stress is defined as TMises := [T12 + T22 + T32 − T1 T2 − T1 T3 − T2 T3 ]1/2 and the von Mises criterion is assumed in the form TMises = Y, Y being the yield stress. Equivalently, we have (T1 − T2 )2 + (T2 − T3 )2 + (T1 − T3 )2 = 2Y 2 .

(13.3)

To substantiate the von Mises criterion (13.3) we observe that an alternative statement is |J2 | = k 2 . Since |J2 | = 16 [(T1 − T2 )2 + (T2 − T3 )2 + (T1 − T3 )2 ] then the criterion becomes (T1 − T2 )2 + (T2 − T3 )2 + (T1 − T3 )2 = 6k 2 . In a uniaxial tension test, we have Tmax = Y,

2 2Tmax = 6k 2 ,

and hence the von Mises criterion (13.3) follows.

√ k = Y/ 3

798

13 Plasticity

Tresca Criterion The Tresca criterion states that a material will yield if the maximum shear stress reaches a critical value. This is made formal by assuming max{ 21 |T1 − T2 |, 21 |T2 − T3 |, 21 |T1 − T3 |} = k. In a simple tension test, we have k = Y/2. Hence we find the criterion max{|T1 − T2 |, |T2 − T3 |, |T1 − T3 |} = Y.

(13.4)

It is of interest to contrast the two relations (13.3) and (13.4) in the plane stress condition T3 = 0. Equation (13.4) simply becomes max |T1 − T2 | = Y . In the same plane geometry Eq. (13.3) becomes (T1 − T2 )2 + T22 + T12 = 2Y 2 . Change the stress representation by taking a rotation of π/4 counterclockwise. We have √ √ Tˆ2 = (1/ 2)(T1 − T2 ). Tˆ1 = (1/ 2)(T1 + T2 ), Upon substitution we find

Tˆ1 Tˆ2 + = Y 2, 2/3 2

which with major half-axis equal √ is the equation of the ellipsis, centred at the origin, √ to 2 Y , at θ = π/4, and minor half-axis equal to 2/3 Y .

13.2 Rate-Independent Scheme of Plastic Flow Rate-independent models for plasticity are widely applied, at least at low temperatures. Observe that a change in time scale is a transformation t → t ∗ of the form t ∗ = p αt, α > 0; a function f is rate independent if f (t ∗ ) = f (t). Let E˙ = 0. The tensor p E˙ N := p |E˙ | p

is rate independent in that d p p E (αt) = αE˙ (t). dt Apparently N p is invariant. As for the function h in (13.2) we observe that

13.2 Rate-Independent Scheme of Plastic Flow

α S˙ = h(αE˙ , S), p

799 p S˙ = α−1 h(αE˙ , S).

Hence the flow rule, for T0 , and the hardening equation, for S, are rate independent if p S˙ = h(N p , S) |E˙ |. T0 = T˜ 0 (N p , S), p p By the assumption T0 · E˙ ≥ 0, dividing by |E˙ |, we have

Y (N p , S) := N p · T˜ 0 (N p , S) ≥ 0; the scalar Y is called the flow resistance. A further assumption is made on the function T˜ 0 , namely, p E˙ T0 = p = Np, |T0 | |E˙ |

referred to as the codirectionality of T0 and E˙ . Consequently p

|T0 | = T0 · N p = Y (N p , S) and hence T0 = Y (N p , S)N p . A further restriction is taken on the functions Y (N p , S) and h(N p , S), namely, that they be independent of the flow direction N p . Hence, we have the Mises flow equations p S˙ = h(S)|E˙ |. T0 = Y (S)N p , These equations are operative once the initial condition for S is chosen. We let Y (S) = S and state the initial-value problem in the form p S˙ = h(S)|E˙ |,

S(0) = 0.

(13.5)

Let e p denote the accumulated plastic strain e p defined by p e˙ p = |E˙ |,

e p (0) = 0,

the dependence on the position X being omitted. Hence t

p e p (t) = ∫ |E˙ |(τ )dτ ≥ 0. 0

We now show that the hardening variable S can be related by a 1–1 correspondence with e p . Consider the initial-value problem

800

13 Plasticity

dS = h(S), de p

S(0) = 0.

ˆ p ) we observe To determine the function S(t) = S(e d Sˆ p p S˙ = e˙ = h(S)|E˙ |. de p ˆ p ) is a solution of (13.5). This in turn allows us to let the flow Hence the function S(e resistance Y be a function of e p , ˆ p )). Y = Y (e p ) = Y ( S(e p While |T0 | = Y if E˙ = 0, an additional assumption is considered to characterp ize no-flow states, that is, |T0 | ≤ Y and E˙ = 0 for |T0 | < Y . The model is then expressed in terms of the accumulated plastic strain e p by means of the Mises–Hill equations p T0 = Y (e p )N p for E˙ = 0;

|T0 | ≤ Y (e p ),

p e˙ p = |E˙ |, e p (0) = 0,

p E˙ = 0 for |T0 | < Y (e p ).

13.2.1 Consistency of Mises–Hill Model with a Defect Energy p p Instead of viewing the flow E˙ as a purely dissipative term, namely, T0 · E˙ ≥ 0, we now generalize the scheme by letting the free energy ψ be the sum,

ψ = ψe + ψ p , of an elastic free energy ψ e and a plastic free energy ψ p , this being associated with the defect energy in the solid. For simplicity, it is assumed that e ρ R ψ˙ e = T · E˙

and hence the Clausius–Duhem inequality reduces to p ρ R ψ˙ p − T0 · E˙ ≤ 0.

Let ψ p be a function of e p . It is assumed that g(e p ) := ρ R

dψ p de p

(13.6)

13.3 A Temperature-Dependent Model

801

is subject to g(e p ) > 0 for e p > 0.

g(0) = 0, Since e˙ p = |E˙ | then p

ρR

d p p p ψ (e ) = g(e p )|E˙ | ≥ 0. dt

p p Moreover, since |E˙ | = E˙ · N p then inequality (13.6) becomes p (gN p − T0 ) · E˙ ≤ 0.

Let p Tdis 0 := T0 − gN .

The inequality becomes

˙p Tdis 0 ·E ≥ 0

dis and Tdis 0 can be viewed as the dissipative part of the deviatoric stress T0 ; T0 can then be taken in the form p Tdis 0 = Y0 N ,

Y0 being any positive-valued function of e p . For the sake of simplicity we let Y0 be constant. Consequently, we have T0 = [Y0 + g(e p )]N p . The Mises–Hill equation T0 = Y (e p )N p is recovered by simply letting Y (e p ) = Y0 + g(e p ).

13.3 A Temperature-Dependent Model The Mises–Hill model may be generalized by letting the stress tensor and the free energy depend also on the temperature. In addition to provide a more precise model, the temperature dependence allows us to make a better distinction between dissipative and non-dissipative powers.5 Still within the approximation of small deformations, the stress power is written as T R · E˙ . We then omit the suffix R for T and write the Clausius–Duhem inequality in the form ˙ + T · E˙ − 1 q R · ∇R θ ≥ 0. −ρ R (ψ˙ + η θ) θ

5

This section parallels Chap. 81 of [216].

802

13 Plasticity

We let E = E e + E p and use the accumulated plastic strain e p to describe the plastic p p e p behaviour; since e˙ p = |E˙ | then E˙ = e˙ p N p . Hence T · E˙ = T · E˙ + T · E˙ and T · E˙ = τ e˙ p , p

τ := T0 · N p ,

so that τ , called the resolved shear, is the deviatoric stress resolved on the direction of plastic flow N p . To determine the set of constitutive equations, we start with the assumption that E e , θ) + ψ p (e p , θ) E e , e p , θ) = ψ e (E ψ = ψ(E and let T, η, q R depend on E e , e p , θ, ∇R θ. Instead, to allow for dissipativity, we let τ depend on e p , θ, e˙ p . The Clausius–Duhem inequality can be written as 1 e −ρ R (∂θ ψ + η)θ˙ − ρ R (∂E e ψ e − T) · E˙ − ρ R (∂e p ψ p − τ )e˙ p − q R · ∇R θ ≥ 0. θ e The arbitrariness of θ˙ and E˙ implies that

η = −∂θ ψ,

T = ∂E e ψ e ,

1 ρ R (∂e p ψ p − τ )e˙ p + q R · ∇R θ ≤ 0. θ If τ is independent of ∇R θ then it follows (τ − ∂e p ψ p )e˙ p ≥ 0. Let τdis := τ − ∂e p ψ p ,

τ = τdis + ∂e p ψ p .

Hence τdis e˙ p ≥ 0. Since e˙ p ≥ 0 then the inequality holds if and only if τdis ≥ 0. The positive valuedness of τdis is realized by a positive-valued function Ydis (e p , e˙ p , θ) or Ydis (e p , θ). Finally, letting e˙ p = 0 we obtain the heat-conduction inequality q R · ∇R θ ≤ 0. To complete the scheme we may require the codirectionality constraint,

13.4 Gradient Theories of Plasticity

803

T0 = |T0 |N p . Hence τ = |T0 |,

T0 = τ N p = (Ydis + ∂e p ψ p )N p .

It is expected that the plastic stress power T0 · E˙ be non-negative. Now, the free energy ψ p is taken to represent the defect energy and it is likely that the defect energy increases as the number of defects, described by e p , increases. Hence, we let p

∂e p ψ p ≥ 0, whence ∂e p ψ p e˙ p ≥ 0. As a consequence p T0 · E˙ = τ e˙ p = (Ydis + ∂e p ψ p )e˙ p ≥ 0.

It is worth checking the consequences of the model so established on the balance of energy. By ρ R ε˙ = T · E˙ − ∇ R · q R + ρ R r we have

˙ + θη) ˙ = T · E˙ − ∇ R · q R + ρ R r. ρ R (ψ˙ + θη

Owing to the thermodynamic restrictions, we find ρ R θη˙ = ρ R r − ∇ R · q R + (τ − ρ R ∂e p ψ p )e˙ p = ρ R r − ∇ R · q R + Ydis e˙ p . Hence ρ R θη˙ − (ρ R r − ∇ R · q R ) = Ydis e˙ p ≥ 0; the plastic flow produces a growth of entropy.

13.4 Gradient Theories of Plasticity Gradient theories refer to models involving constitutive dependencies on the gradient of the plastic strain. This scheme is motivated by the need of justifying flow rules related to the size dependence. Attempts at theories with size dependence were originated by the flow rule [2] T0 = (Y (e p ) − β R e p )N p , which implies that the resolved shear τ = T0 · N p takes the form τ = Y (e p ) − β R e p ,

804

13 Plasticity

 R being the Laplacian relative to the reference coordinates. A thermodynamic scheme allowing for the dependence of the resolved shear on  R e p may be given by taking E e , e p , ∇R e p , e˙ p as the set of independent variables. Again we let e Te · E˙ + τ p · e˙ p

be the power density, where τ p = T0 · N p . In addition we let E e ) + ψ p (e p , ∇R e p ) ψ R = ψ e (E E e ). Neglecting heat conduction, the be the reference free energy6 and Te = Te (E second-law inequality is taken in the form e −ψ˙ R + T · E˙ + τ p e˙ p + θ∇R · k R ≥ 0,

where k R is the reference entropy flux. Hence we have e (−∂E e ψ e + Te ) · E˙ + (−∂e p ψ p + τ p )e˙ p − ∂∇R e p ψ p · ∇R e˙ p + θ∇R · k R ≥ 0.

Divide by θ and observe 1 1 1 ∂∇R e p ψ p · ∇R e˙ p = ∇R · ( ∂∇R e p ψ p e˙ p ) − [∇R · ( ∂∇R e p ψ p )]e˙ p . θ θ θ The inequality can then be written as 1 1 1 1 e (−∂E e ψ e + Te ) · E˙ + [−∂e p ψ p + θ∇R · ( ∂∇ e p ψ p ) + τ p ]e˙ p + ∇R · [k R − ( ∂∇ e p ψ p e˙ p )] ≥ 0. θ θ θ R θ R

Hence it follows that Te = ∂E e ψ e , where

τ p = τenp + τdis ,

1 τenp = ∂e p ψ p − θ∇R · ( ∂∇R e p ψ p ), θ

kR =

1 ∂∇ e p ψ p e˙ p , θ R

τdis e˙ p ≥ 0.

The flow rule is then given by 1 τ = τdis (e p , ∇R e p , e˙ p ) + ∂e p ψ p − θ∇R · ( ∂∇R e p ψ p ). θ See [216], Sect. 89; ψ e and ψ p are said to represent the elastic strain energy and the defect energy, respectively.

6

13.5 Model of Plasticity via the Kröner Decomposition

805

Aifantis relation for τ is recovered by letting ψ p = 21 β|∇e p |2 , with β > 0 constant, and

τdis = Y (e p ) > 0.

If the elastic stress is taken in the classical linear form Te = 2μEe + λ(tr Ee )1 then the corresponding free energy is given by ψ e = μ|Ee |2 + λ(tr Ee )2 . It is worth remarking that in [216], Sect. 89, the Aifantis flow rule is framed within a thermodynamic framework by introducing a microscopic hyperstress vector ξ p so that the power comprises the additional term ξ p · ∇ e˙ p . The assumption ξ dis = 0 of [216], Sect. 89.4, makes ξ to coincide with ∂∇R e p ψ p and hence, if ∇R θ = 0, the result (89.44) of [216] coincides with the present expression of τ .

13.5 Model of Plasticity via the Kröner Decomposition The additive decomposition H = He + H p can be viewed as a linearization of the displacement gradient to apply in the case of small deformations. A modelling of large-deformation plasticity is based on the multiplicative decomposition F = Fe F p ;

(13.7)

Fe and F p are said to represent the elastic distortion and the plastic distortion. The relation (13.7) is referred to as the Kröner decomposition [262]. The standard assumption det F > 0 is guaranteed by letting det Fe > 0,

det F p > 0;

hence both Fe and F p are invertible. While F = ∇R χ is the gradient of the motion χ, neither Fe nor F p are the gradient of corresponding functions χe , χ p . We can think that for vectors Y in a neighbourhood of X, with X, Y ∈ R, the images x, y satisfy

806

13 Plasticity

y − x = Fe F p (Y − X) + o(|Y − X|). Hence, upon neglecting o(|Y − X|), F p maps the vector Y − X into yˆ − xˆ and hence Fe maps yˆ − xˆ into y − x. We can picture the transformation by viewing yˆ and xˆ as point in a structural space while y and x are points in χ(R) ⊂ E . Since7 F˙ = F˙ e F p + Fe F˙ p , F−1 = F p −1 Fe−1 , ˙ −1 is decomposed in the form then L = FF L = (F˙ e F p + Fe F˙ p )(F p −1 Fe−1 ) = F˙ e Fe−1 + Fe (F˙ p F p −1 )Fe−1 . So two velocity gradients can be defined, Le = F˙ e Fe−1 ,

L p = F˙ p F p −1 ,

and hence the decomposition of L is expressed as L = Le + Fe L p Fe−1 . We can then define the elastic stretching De and the elastic spin We , De = 21 (Le + Le T ),

We = 21 (Le − Le T ).

Likewise, the plastic stretching D p and the plastic spin W p are defined in the form D p = 21 (L p + L p T ),

W p = 21 (L p − L p T ).

Hence D = De + sym(Fe L p Fe−1 ). As with the approximation of small deformations, the plastic flow is assumed to be isochoric and hence L p and D p are assumed to be deviatoric, tr L p = tr D p = 0. We then let J = det Fe , J˙ = J tr De . det F p = 1, The balance of linear and angular momentum is taken to be the standard ones, ρ˙v = ρb + ∇ · T,

T = TT .

The balance of energy shows new questions induced by the multiplicative decomposition (13.7). Upon substitution, the power T · D becomes 7

For a more compact notation, we let Fe−1 := (Fe )−1 and Fe−T := (Fe )−T .

13.5 Model of Plasticity via the Kröner Decomposition

807

T · D = T · De + T · (Fe L p Fe−1 ). Now, by Ce := Fe T Fe , it follows

F˙ e := Le Fe ,

Ee := 21 (Ce − 1),

˙ e = 2 Fe T De Fe , C

(13.8)

E˙ e = Fe T De Fe .

Consequently, T · De = T · (Fe−T E˙ e Fe−1 ) = (Fe−1 TFe−T ) · E˙ e . The definition Te := J Fe−1 TFe−T allows Te to be viewed as the elastic counterpart of the standard second Piola stress; Te is then called the second Piola elastic stress. Since T ∈ Sym then also Te ∈ Sym. Based on the power (13.8), we observe T · (Fe L p Fe−1 ) = (Fe T TFe−T ) · L p . Moreover tr (Fe T TFe−T ) = tr T, and hence, since tr L p = 0, (FeT TFe−T ) · L p = (FeT T0 Fe−T ) · L p ; it is worth remarking that neither FeT TFe−T nor FeT T0 Fe−T are symmetric. Upon the requirement that T · (Fe L p Fe−1 ) = J −1 T p · L p ,

tr L p = 0,

we find that the plastic stress T p is given by T p = J Fe T T0 Fe−T . Now, J Fe T T0 Fe−T = J (Fe T TFe−T )0 = J (Fe T Fe Fe−1 TFe−T )0 = (Ce Te )0 . Letting Me := Ce Te = J Fe T TFe−T we obtain

808

13 Plasticity

(Me )0 · L p = T p · L p , whence T p = (Me )0 . Consequently, the power (13.8) is expressed in the form T · D = J −1 Te · E˙ e + J −1 (Me )0 · L p . The stress tensor Me is called the Mandel stress [295]. We neglect heat conduction and hence consider the entropy inequality ˙ + J −1 Te · E˙ e + J −1 T p · L p ≥ 0, −ρ(ψ˙ + η θ) whence

˙ + Te · E˙ e + T p · L p ≥ 0. −ρ R (ψ˙ + η θ)

The form of the power, Te · E˙ e + T p · L p , suggests that Te might be derived from a potential whereas T p might produce dissipative processes. A temperature-dependent model is obtained by letting ˆ Ee ), ψ = ψ(θ,

Te = Tˆ e (θ, Ee ),

T p = Tˆ p (θ, L p , e p ).

The accumulated plastic strain e p is now defined by e˙ p = |De |, to within an initial value. Evaluation of ψ˙ and substitution into the entropy inequality result in ρ R (∂θ ψˆ + η)θ˙ − (ρ R ∂Ee ψˆ − Te ) · E˙ e + T p · L p ≥ 0. Since E˙ e and L p are independent of each other and both of them are independent of θ˙ then the inequality holds if and only if ˆ η = −∂θ ψ,

ˆ Te = ρ R ∂Ee ψ,

and T p · L p ≥ 0. It is worth observing that Ee and e p are objective. Furthermore, L p is objective in that it is a map from the reference configuration to the structural space and then it is invariant under Euclidean transformations. For definiteness, if ψ is considered in the form

13.6 A Decomposition-Free Thermodynamic Scheme

809

ρ R ψ = μ|Ee |2 + 21 λ(tr Ee )2 , which is adopted in the modelling of small elastic strains, it follows Te = 2μEe + λ(tr Ee )1. Since Ee = 21 (Ce − 1) and Me = Ce Te then Me = μ(Ce2 − Ce ) + 21 (tr Ce − 3)Ce . In this approximation, it follows that Me is symmetric, Me = Me T , and so is T p . Hence Tp · Lp = Tp · Dp. The plastic stress T p (and the deviatoric Mandel stress (Me )0 ) is taken to depend on D p via N p = D p /|D p | as D p = 0. The thermodynamic restriction T p · D p ≥ 0 then yields Y (N p , e p ) := N p · T p (N p , e p ) ≥ 0. If Y (N p , e p ) is independent of N p then T p = Y (e p )N p ,

D p = 0.

Along with t

e p (t) = ∫ |D p (τ )|dτ , 0

the evolution of D p determines T p = (Me )0 .

13.6 A Decomposition-Free Thermodynamic Scheme A wide class of viscoelastic and plastic materials is modelled by following an approach which is characterized by a thermodynamic potential and the entropy production; both the Helmholtz free energy ψ and its Legendre transform φ = ψ − E · T R R /ρ R are applied as pertinent potentials. No decomposition, additive or multiplicative, is made in elastic and plastic distortions. No use is made of the linear approximation of E. We apply the second-law inequality in the standard form (Sect. 2.6), ρη˙ + ∇ ·

q ρr − = σ ≥ 0, θ θ

810

13 Plasticity

where the entropy production σ is taken to be a non-negative-valued constitutive function; for the present purposes, it is enough to identify q/θ with the entropy flux. We then have ˙ +T·D− −ρ(ψ˙ + η θ)

1 ˙ + T R R · E˙ − 1 q R · ∇R θ = J θσ, q · ∇θ = θσ, −ρ R (ψ˙ + η θ) θ θ

according as we follow the Eulerian or the Lagrangian description. Let ˙ T˙ R R , q˙ R )  = (θ, E, T R R , q R , ∇R θ, E, be the set of variables and take ψ, η, and σ be functions of . Computation of ψ˙ and substitution in the Clausius–Duhem inequality allow us to find that ψ is independent ˙ T˙ R R , q˙ R , ∇R θ while η is related to ψ as of E, η = −∂θ ψ. The Clausius–Duhem inequality then reduces to 1 (−ρ R ∂E ψ + T R R ) · E˙ − ρ R ∂T R R ψ · T˙ R R − ρ R ∂q R ψ · q˙ R − q R · ∇R θ = J θσ ≥ 0. θ We let E˙ and T˙ R R be possibly related to each other but q˙ R , ∇R θ are independent of ˙ T˙ R R . Consequently it follows E, (−ρ R ∂E ψ + T R R ) · E˙ − ρ R ∂T R R ψ · T˙ R R = J θσ E T ≥ 0,

(13.9)

1 − ρ R ∂q R ψ · q˙ R − q R · ∇R θ = J θσq ≥ 0, θ

(13.10)

σ E T and σq being entropy productions to be determined according to the constitutive model, σ = σ E T + σq . Inequality (13.10) models heat conduction. Let ψ depend on q R via ξ = |q R |n , n ≥ 2. Hence ∂q R ψ = n∂ξ ψ|q R |n−2 q R and inequality (13.10) becomes 1 (nρ R ∂ξ ψ|q R |n−2 q˙ R + ∇R θ) · q R = −J θσq ≤ 0. θ Letting J θσq =

1 q2 , κ(θ, J, ξ, |∇R θ|) R

κ > 0,

we obtain τ q˙ R + q R = −κ∇R θ,

(13.11)

13.6 A Decomposition-Free Thermodynamic Scheme

811

where τ = nρ R θ ∂ξ ψ|q R |n−2 . Equation (13.11) is a generalization of the Maxwell– Cattaneo equation where τ plays the role of relaxation time. If n = 2 then τ = 2κρ R θ∂ξ ψ. As to inequality (13.9) we observe that, in terms of the Legendre transform ρ R φ = ρ R ψ − T R R · E, we have − ρ R ∂E φ · E˙ − (ρ R ∂T R R φ + E) · T˙ R R = J θσ E T ≥ 0.

(13.12)

It is worth observing that the free energy φ can be represented in the Eulerian description. Consider the energy T R R · E; by the definition of T R R we have T R R · E = J (F−1 TF−T ) · E = J T · (F−T EF−1 ). Since F−T EF−1 = E A , the Eulerian–Almansi strain, then we conclude that TR R · E = J T · EA , Moreover,

ρ R φ = ρ R ψ − T R R · E ⇐⇒ ρφ = ρψ − T · E A .

˙ E˙ = FT E A F = FT (LT E A + E˙ A + E A L)F = FT E A F. 



Since E A = D then we have the expected result T R R · E˙ = J T · D. We now investigate a class of models consistent with the entropy inequality (13.9) and the objectivity principle.

13.6.1 Duhem-Like Solids In the literature, Duhem-like models are characterized by differential equations of the form x(t) ˙ = f (x(t), u(t))g(u(t)), ˙ where x, u are vectors in Rn . Models of this form, that traces back to Duhem, have been applied to represent friction and mechanical hysteresis (see, e.g. [291]). With this picture in mind we now examine (13.9). The significant variables are ˙ T˙ R R and then we observe that (13.9) can be specified by letting θ, E, T R R , E, ˙ E|)| ˙ E|, ˙ J θσ E T = γ E (θ, E, T R R , E/|

γE ≥ 0

812

or

13 Plasticity

˙ R R |, J θσ E T = γT (θ, E, T R R , T˙ R R /|T˙ R R |)|T

γT ≥ 0.

Hence, inequality (13.9) can be written as an equality in the form

or

˙ ≤0 (ρ R ∂E ψ − T R R ) · E˙ + ρ R ∂T R R ψ · T˙ R R = −γ E |E|

(13.13)

˙ R R | ≤ 0. (ρ R ∂E ψ − T R R ) · E˙ + ρ R ∂T R R ψ · T˙ R R = −γT |T

(13.14)

Likewise, in terms of the free energy density φ, it follows from (13.12) that ˙ ≤ 0, ρ R ∂E φ · E˙ + (ρ R ∂T R R φ + E) · T˙ R R = −γ E |E|

(13.15)

˙ R R = −γT |T ˙ R R | ≤ 0. ρ R ∂E φ · E˙ + (ρ R ∂T R R φ + E) · T

(13.16)

All these differential equations are invariant under the time transformation t → ct,

c > 0,

since so are the coefficients γ E and γT . Hence Eqs. (13.13)–(13.16) are rate independent.

Eulerian Scheme It is natural to ask about the Eulerian analogue of Eq. (13.9) and the subsequent ones. ˆ E A , T) then the Clausius–Duhem inequality results in If we start with ψ(θ, − ρ∂E A ψˆ · E˙ A − ρ∂T ψˆ · T˙ + T · D = θσ E T .

(13.17)

Apart from the relation between E˙ A and D, which is considered shortly, we observe that Eq. (13.17) might be used to write a constitutive relation in a Duhem-like form ˙ and E˙ A are not objective and hence (13.17), involving T˙ and E˙ A . However, both T though formally correct, is inadequate. Instead Eq. (13.9) is objective in that E and T R R are invariant. First recall that E = FT E A F and then 

E˙ = FT E A F,



T R R · E˙ = T · D.

E A = D,

Moreover, since T R R = J F−1 TF−T then 

T˙ R R = J F−1 T F−T . Now, if ψ = ψ(θ, E(E A , F), T R R (T, F)) then we obtain

13.6 A Decomposition-Free Thermodynamic Scheme

∂T ψ = J F−T ∂T R R ψF−1 , ρ∂E ψ · E˙ = ρ∂E A ψ · D,

813

∂E A ψ = F∂E ψFT , 

ρ∂T R R ψ · T˙ R R = ρ∂T ψ· T .

Hence substitution in (13.9) leads to 

(−ρ∂E A ψ + T) · D − ρ∂T ψ· T= θσ E T ,

(13.18)

where we take θσ E T in the form 





θσ E T = γ˜ E (θ, E A , T, D/|D|)|D| or θσ E T = γ˜ T (θ, E A , T, T /| T |)| T |. So we have



(ρ∂E A ψ − T) · D + ρ∂T ψ· T= −γ˜ E |D|, 

(13.19)



(ρ∂E A ψ − T) · D + ρ∂T ψ· T= −γ˜ E | T |.

(13.20)

This shows that the constitutive properties of T˙ R R in terms of E˙ have the strict analogue 

in T in terms of D. The differences between (13.17) and (13.18) are obviously understood if we observe that ψ in (13.18) is a composite function, of E A and T, which depend also on F. In (13.17), instead, ψˆ depends on E A and T but is independent of F.

13.6.2 Hyperelastic and Hypoelastic Solids Look at Duhem-like solids without dissipation, namely, with σ E T = 0. If σ E T = 0 and ∂T R R ψ = 0 then it follows from (13.13) that (ρ R ∂E ψ − T R R ) · E˙ = 0. The arbitrariness of E˙ implies T R R = ρ R ∂E ψ and then T = ρF∂E ψFT . This constitutive equation models hyperelastic solids. We might have obtained (13.13) by assuming ψ = ψ(θ, E) from the start. If this is the case we have ∂T R R ψ = 0 and (13.13) implies

814

13 Plasticity

T R R = ρ R ∂E ψ,

˙ ˙ R R = ρ R ∂E2 ψ(E)E˙ + ρ R ∂θ ∂E ψ θ, T

where ∂E2 ψ is a fourth-order tensor with major and minor symmetries. If instead γ E = 0 but ∂T ψ = 0 (and ∂T R R ψ = 0) then (13.9) can be written in the form (ρ R ∂E ψ − T R R ) · E˙ + ρ R ∂T R R ψ · T˙ R R = 0. ˙ Consequently T˙ R R can be given a linear representation in E. To this end, we recall the representation formula (A.8) of tensors; for any tensor Z with a known value of the inner product Z · N, and N · N = 1, we have Z = (Z · N)N + (I − N ⊗ N)G for any tensor G. Now, assume |∂T R R ψ| = 0 and let N = ∂T R R ψ/|∂T R R ψ|, Z = T˙ R R , ˙ Observe that, by (13.13) with γ E = 0, we have and G = G R R E. 1 ˙ ˙ + (I − N ⊗ N)G R R E, T˙ R R = [(T R R − ρ R ∂E ψ) · E]N ρR where G R R (θ, E, T R R ) is an arbitrary fourth-order tensor-valued function. Hence, letting C R R (θ, E, T R R ) = G R R +

1 ∂T ψ ⊗ (T R R − ρ R ∂E ψ − ρ R GTR R ∂T R R ψ), ρ R |∂T R R ψ|2 R R (13.21)

we have

˙ T˙ R R = C R R E.

(13.22)

This relation describes the constitutive properties of hypoelastic solids [345, 428] with a nonlinear tensor function C R R of θ, E, T R R . By the arbitrariness of G R R it follows that there are infinitely many hypoelastic tensors C R R compatible with a given free energy ψ. If, instead, a hypo-thermoelastic constitutive relation is given in advance in the form (13.22) then (13.13) can be written as (ρ R ∂E ψ − T R R ) · E˙ + ρ R ∂T R R ψ · C R R E˙ = 0,

(13.23)

and the arbitrariness of E˙ implies T R R − ρ R ∂E ψ = ρ R CTR R ∂T R R ψ.

(13.24)

In [310], the existence of a thermodynamic potential ψ satisfying (13.23) is investigated starting from (13.22) with a given tensor C R R . In general, the fourth-order tensor C R R enjoys the minor symmetries, not the major one. We now give some examples of the representation formula (13.22).

13.6 A Decomposition-Free Thermodynamic Scheme

815

Example 1 Let ρ R ψ(θ, T R R ) = ρ R ψ0 (θ) +

1 2

|T R R |2

∫ α(θ, ξ) dξ, 0

where α is a nonzero real-valued function. Then ρ R ∂T R R ψ = α(θ, |T R R |2 )T R R , ρ R ∂E ψ = 0 so that choosing G R R = νI we have C R R (θ, T R R ) = νI +

1 α

−ν

 TR R TR R ⊗ , |T R R | |T R R |

which enjoys both minor and major symmetries. This tensor is positive definite provided that 1/α ≥ ν > 0. In particular, if we choose ν = 1/α it follows that C R R = (1/α)I. Example 2 Let ρ R ψ(θ, E, T R R ) = ρ R ψ0 (θ) + T R R · E − 21 α(θ)|E|2 −

1 2

|E|2

∫ ζβ(θ, ζ) dζ, 0

where α, β are real-valued functions. Hence ρ R ∂T R R ψ = E, ρ R ∂E ψ = T R R − α(θ)E − |E|2 β(θ, |E|2 )E so that choosing G R R = νI we have C R R (θ, E) = νI + βE ⊗ E + (α − ν)

E E ⊗ , |E| |E|

which enjoys both minor and major symmetries. This tensor is positive definite provided that ν ≥ 0, β > 0 and α ≥ ν. In particular, if we choose ν = α it follows (see [310], Sect. C) C R R (θ, E) = αI + βE ⊗ E. The next example involves both |E| and |T R R |. Example 3 Consider the free energy ρ R ψ(θ, E, T R R ) = ρ R ψ0 (θ) + α(θ)T R R · E − 21 β(θ)|E|2 + 21 γ(θ)|T R R |2 , where α(θ) = 0, β(θ) = 0. Hence we have ρ R ∂T R R ψ = α(θ)E + γ(θ)T R R , ρ R ∂E ψ = α(θ)T R R − β(θ)E.

816

13 Plasticity

Consequently, if γ = (1 − α)α/β and G R R = νI we have C R R (θ, E, T R R ) = νI +

β α

−ν

 γT R R + αE γT R R + αE ⊗ |γT R R + αE| |γT R R + αE|

enjoys the major symmetry and is positive definite provided that β/α ≥ ν > 0. If α = 1 as in Example 2 then γ = 0. A similar approach can be developed by starting from (13.12) with σ E T = 0, namely,   (13.25) ρ R ∂E φ · E˙ + ρ R ∂T R R φ + E · T˙ R R = 0, where |∂T R R φ| = 0. Hence, by paralleling previous arguments and letting K R R (θ, E, T R R ) = J R R −

1 ∂E φ ⊗ (E + ρ R ∂T R R φ + JTR R ρ R ∂E φ), ρ R |∂E φ|2

(13.26)

where J R R (θ, E, T R R ) is an arbitrary fourth-order tensor-valued function, we obtain E˙ = K R R T˙ R R .

(13.27)

Example 4 Let ρ R φ(θ, E, T R R ) = ρ R ψ(θ, T R R ) − T R R · E and ρ R ψ(θ, T R R ) = ρ R ψ0 (θ) + 21 α(θ)|T R R |2 +

1 2

|T R R |2

∫ uβ(θ, u) du 0

where α and β are real-valued functions. Hence, we obtain   ρ R ∂T R R φ = α(θ) + |T R R |2 β(θ, |T R R |2 ) T R R − E,

ρ R ∂E φ = −T R R

so that letting J R R = μI we have K R R (θ, E) = μI + β(θ, |T R R |2 )T R R ⊗ T R R + (α − μ)

TR R TR R ⊗ . |T R R | |T R R |

The tensor K R R enjoys the major symmetry and, choosing μ = α, reduces to K R R (θ, E) = α(θ)I + β(θ, |T R R |2 )T R R ⊗ T R R . The resulting hypoelastic constitutive Eq. (13.27) is generated by the well-known Prandtl–Reuss work hardening plasticity theory provided that tr E = tr T R R = 0 (see [310], Sect. C). In the spatial description, using (13.19) with γ˜ E = 0, we obtain an expression equivalent to (13.22), namely,

13.6 A Decomposition-Free Thermodynamic Scheme

817

1 ∂T ψ ⊗ (T − ρ∂E A ψ − ρGT ∂T ψ). ρ|∂T ψ|2 (13.28) After replacing ρ R , E, T R R , C R R with ρ, E A , T, C, respectively, we can translate all previous results into the spatial description. 

T = CD,

C(θ, E A , T) = G +

13.6.3 Dissipative Duhem-Like Solids The Duhem-like structure is considered by letting ∂T R R ψ = 0, or ∂E φ = 0, and allowing the model to be dissipative in that σ E T is nonzero. Moreover, T R R , or E, is still an independent variable. If ∂T R R ψ = 0, and γ E = 0, by (13.13) we have ˙ ≤ 0. (ρ R ∂E ψ − T R R ) · E˙ = −γ E |E| To obtain a representation of T R R we apply the representation formula (A.8) to Z = ˙ E| ˙ and, for simplicity, G = 0, we find ρ R ∂E ψ − T R R with N = E/| TR R = γE

E˙ + ρ R ∂E ψ(θ, E). ˙ |E|

˙ E| ˙ may be viewed as a dissipative This equation is rate independent and the term γ E E/| (viscous) effect. If, for instance, ρ R ψ is a quadratic function of E, ρ R ψ = 21 λ(θ)E · E, then E˙ + λ(θ)E, T = γ˜ E ˙ |E| which is similar to the model of Kelvin–Voigt viscoelastic solids. In the spatial description, if ∂T ψ = 0 we have T = γ˜ E

D + ρ∂E A ψ(θ, E A ). |D|

If instead we consider (13.16) and let ∂E φ = 0 then we have ˙ R R |. (ρ R ∂T R R φ + E) · T˙ R R = −γT |T Applying the representation formula (A.8) with N = T˙ R R /|T˙ R R |, Z = ρ R ∂T R R φ + E, and G = 0 we find the equation E = −γT

T˙ R R − ρ R ∂T R R φ(θ, T R R ), |T˙ R R |

818

13 Plasticity

which is dissipative (γT > 0) and rate independent. If ρ R φ has the quadratic form ρ R φ = 21 κ(θ)T R R · T˙ R R then we find γT

T˙ R R + κ(θ)T R R = −E. |T˙ R R |

(13.29)

Equation (13.29) is formally similar to the equation of a standard linear solid with a repulsive spring in parallel to the Maxwell unit. Letting k (s) = −k ( M ) in (6.23) we find the analogue of (13.29).

13.6.4 Elastic–Plastic Models Elastic–plastic models are characterized by an entropy production σ which depends ˙ or |T˙ R R |. For definiteness, we consider (13.13), with ∂T R R ψ = 0, linearly on either |E| or (13.16), with ∂E φ = 0. Assume |∂T R R ψ| = 0 and let ∂T R R ψ PR R = . ρ R |∂T R R ψ|2 In view of the representation formula (A.8), we can write the constitutive Eq. (13.13) in the form ˙ T˙ R R = C R R E˙ − γ E P R R |E|, where C R R is given in (13.21). In the spatial description, if |∂T ψ| = 0 then (13.19) can be written as 

T= CD − γ˜ E P|D|, where C is given in (13.22) and P=

∂T ψ . ρ|∂T ψ|2

A different class of elastic–plastic models follows from (13.16) by letting ∂E φ = 0, which amounts to letting ρ R ∂E ψ − T R R = 0, and Q R R (θ, E, T R R ) =

∂E φ ρ R ∂E ψ − T R R = . ρ R |∂E φ|2 |ρ R ∂E ψ − T R R |2

In view of (A.8), we obtain ˙ R R |, E˙ = K R R T˙ R R − γT Q R R |T where K R R is given in (13.26).

13.6 A Decomposition-Free Thermodynamic Scheme

819

In a purely formal way, we might define E˙ e = KT˙ R R ,

E˙ p = −γT Q R |T˙ R R |

so that E˙ = E˙ e + E˙ p . Moreover we might view E˙ e and E˙ p as the elastic and plastic strain rates. Yet, there is no decomposition E = Ee + E p of the strain. In the spatial description, we assume ∂E A φ = 0, and then T = ρ∂E A ψ, and let Q(θ, E A , T) =

∂E A φ ∂E A ψ − T = . ρ|∂E A φ|2 |∂E A ψ − T|2

Hence it follows from (13.20) that 



D = K T −γ˜ T Q| T |, where K is obtained from K R R by replacing ρ R , E, T R R , J R R with ρ, E A , T, J. Again in a formal way, we can define 

De = K T,



D p = −γ˜ T Q| T |

and say that D = De + D p ; this however does not mean that the velocity field is decomposed in elastic and plastic parts.

13.6.5 One-Dimensional Models Restrict attention to one-dimensional models associated with strain and stress in the direction e so that E = Ee ⊗ e, T R R = Se ⊗ e. The symbol S for the component of T R R is consistent with the engineering stress considered in the literature as the ratio of the axial force over the reference area [216], p. 74. Let ψ be a function of E, S, parameterized by θ, and θσ be a function ˙ S. ˙ The generalized Clausius–Duhem inequality (13.9) is then written as of E, S, E, ˙ S), ˙ ∂ S ψ R S˙ + (∂ E ψ R − S) E˙ = −J θ σ(E, S, E,

(13.30)

where ψ R is the referential free energy, ψ R = ρ R ψ. At constant temperature, ψ˙ R = ∂ E ψ R E˙ + ∂ S ψ R S˙ and hence integration of (13.30), as t ∈ [t1 , t2 ], along a closed curve in the E − S plane results in t2

t2

t1

t1

˙ = − ∫ S E˙ dt = 0 ≥ ∫[∂ S ψ R S˙ + (∂ E ψ R − S) E]dt



S d E,

820

13 Plasticity



denoting the integral along the closed curve. The positive value of that the closed curve is run in the clockwise sense. Consistent with (13.13) we let ˙ E|, ˙ J θσ = γ E (E, S, sgn E)|



S d E implies

γ E ≥ 0.

Hence Eq. (13.30) is invariant under the time transformation t → ct, c > 0, and hence the associated model is rate independent because so is γ E . Look at time intervals where E˙ = 0. Since ∂ S ψ R = 0 , divide Eq. (13.30) by ∂ S ψ R E˙ to obtain γE S˙ S − ∂E ψR ˙ − sgn E. (13.31) = ˙ ∂S ψR ∂S ψR E Let χ1 :=

S − ∂E ψR , ∂S ψR

χ2 := −

γE . ∂S ψR

(13.32)

Hence the stress–strain slope can be written in the form S˙ ˙ = χ1 + χ2 sgn E, E˙

(13.33)

where χ1 is a function of E, S parameterized by θ, while χ2 can also depend on ˙ sgn E. Since γ E depends on E˙ only through sgn E˙ then the right-hand side of (13.31) has the form S˙ = f (S, E). E˙ Now if a stress–strain curve is piecewise described by the form F(S, E) = 0 then dS ∂E F S˙ = . =− ∂S F dE E˙ ˙ E˙ is the slope of the stress–strain curve and then is the Thus in (13.33) the ratio S/ effective differential elastic modulus. If γ E = 0, and hence χ2 = 0, then d S/d E is the slope of the hypoelastic curve; the assumption dS = χ1 (θ, E, S) > 0 dE means that the slope d S/d E is positive and the value at E depends also on S. Hence we regard the conditions γ E = 0,

∂ S ψ R = 0,

S = ∂E ψR

13.6 A Decomposition-Free Thermodynamic Scheme

821

as characterizing the elastic regime and regard χ1 as the differential elastic modulus. The slope of the stress–strain curve is assumed to be non-negative; since χ1 > 0 it is consistent to assume χ1 + χ2 sgn E˙ ≥ 0,

|χ2 | ≤ χ1 .

(13.34)

The monotonicity condition (13.34) follows from physical arguments about the stress–strain curve but is not required by thermodynamics. To establish models of elastic–plastic, rate-independent materials, we have to specify the functions χ1 and χ2 . By definition, χ1 is fully determined by the free energy ψ while χ2 depends also on γ E . Consequently once a free energy function ψ is selected, different models are determined by different functions γ E . Indeed, since the hysteretic properties of the model depend on γ E then we refer to γ E as the hysteretic function. The possible models are framed within the following scheme: Hyperelastic regime

γE = 0

∂S ψR = 0

Hypoelastic regime

γE = 0

∂S ψ R = 0

Hysteretic regime

γ E = 0

∂S ψ R = 0

S = ∂E ψR dS S − ∂E ψR = dE ∂S ψR γE dS S − ∂E ψR − sgn E˙ = dE ∂S ψR ∂S ψR

The next investigation deals with non-elastic bodies and hence is developed under the assumption that ∂ S ψ R = 0.

A Formal Additive Decomposition of the Strain Rate An additive decomposition of the strain rate, not the strain, follows by letting8 ˙ the function γT ≥ 0 is non-negative and possibly S − ∂ E ψ R = 0 and J θσ = γT | S|; ˙ dependent on E, S, sgn S and parameterized by θ. Hence Eq. (13.30) becomes ˙ ∂ S ψ R S˙ + (∂ E ψ R − S) E˙ = −γT | S|. Dividing by ∂ E ψ R − S we have ˙ E˙ = ξ1 S˙ + ξ2 | S|,

ξ1 =

1 ∂S ψR = , χ1 S − ∂E ψR

ξ2 =

γT . S − ∂E ψR

This splitting mimics the standard relation E˙ = E˙ e + E˙ p of the one-dimensional incremental theory of plasticity and suggests that we introduce an hardening variable S by means of the differential relation 8

This amounts to assuming ∂ E φ R = 0, φ R := ψ R − E S.

822

13 Plasticity

˙ S˙ = [sgn (S − ∂ E ψ)] E˙ p = [sgn (S − ∂ E ψ)]ξ2 | S|, with a zero initial value. This quantity turns out to be always non-negative and strictly related to the entropy production, ˙ = S

J θσ γT ˙ = | S| . |S − ∂ E ψ R | |S − ∂ E ψ R |

(13.35)

In addition, whenever S˙ = 0, we can write dE ˙ = ξ1 + ξ2 sgn S. dS Thus we require that

(13.36)

ξ1 + ξ2 sgn S˙ ≥ 0.

Some remarks are in order about energy dissipation. It is often asserted that the area of the hysteresis loop is related to the amount of energy dissipation. Now, in a cycle from t1 to t2  Sd E = ∫tt21 J θσdt. ˙ then letting J θγ E The left-hand side is the area A of the loop. If we let σ = γ E | E| roughly constant along the cycle we have t2

˙ A  J θγ E ∫ | S|dt = J θγ E 2E, t1

E being the variation of E. Hence 2J θγ E 

A E

˙ then is the approximate (vertical) thickness of the loop. Likewise, if σ = γ E | S| 2J θγT 

A S

is the approximate (horizontal) thickness of the loop. Narrow (wide) loops are given by small (large) values of γ E or γT .

13.6.6 The Helmholtz Free Energy A model is characterized by χ1 , χ2 and χ1 follows from the free energy ψ R . To select ψ R we start from the generic assumption

13.6 A Decomposition-Free Thermodynamic Scheme

823

ψ R (E, S) = L(S − G(E)) + F(S) + H(E), L, G, F, and H being so far undetermined differentiable functions, possibly parameterized by the temperature θ. By (13.32) we find χ1 =

S + L (S − G(E))G  (E) − H (E) , L (S − G(E)) + F  (S)

γE χ2 = −  . L (S − G(E)) + F  (S)

For simplicity we let χ1 = g(E) ≥ 0, g being characteristic of the model. Hence it follows S − F  (S)g(E) − L (S − G(E))[g(E) − G  (E)] = H (E). A solution is given by g(E) − G  (E) = α,

F  (S) = 0,

H (E) = G(E),

L (S − G(E)) =

1 (S − G(E)), α

where α is a constant. Hence χ2 is given by χ2 = −α

γE . S − G(E)

Since g ≥ 0 it is convenient to let α > 0. The properties of the model are then summarized as follows. • The free energy ψ R is characterized by the function G(E) and parameterized by α (and the temperature θ) in the form ψ R (E, S) =

1 [S − G(E)]2 + H(E), 2α

H (E) = G(E);

(13.37)

γE . S − G(E)

(13.38)

• the functions χ1 and χ2 take the form χ1 = G  (E) + α ≥ 0,

χ2 = −α

As a comment, the positive parameter α is the difference between the differential elastic modulus χ1 and the slope of the elastic function G(E). The occurrence of both G and α allows a greater flexibility in the modelling of hysteretic materials.

824

13 Plasticity

13.6.7 Some Hysteretic Models The hysteretic properties are closely related to γ E . Here we show some examples induced by choices of γ E .

Plastic Flow with Asymptotic Strength Let G(E) = 0 so that ψ = S 2 /2α and χ1 = α. Select the hysteretic function γ E in the form S2 , Su > 0. γ E (S) = Su Hence (13.31) can be written as α dS ˙ = (Su − S sgn E). dE Su Consequently α d ˙ ≤0 (S − Su )2 = 2 (S − Su )(Su − S sgn E) dE Su and then hysteresis loops are confined to the open strip |S| < Su . No elastic region exists; at any point E˙ affects the slope of the curve. Moreover |S| < Su implies that d S/d E > 0, which implies the monotonicity of the curve in any region. The model is characterized by the positive parameters α, Su ; since α = χ1 then α is the slope of the elastic stress–strain path, while Su is the ultimate tensile stress or the asymptotic strength as |E| → ∞. To obtain a picture of the hysteresis loops, we consider the system of equations  E˙ = ωE cos ωt, ˙ S˙ = α(Su E˙ − S| E|)/S u, with initial values E 0 , S0 . The rate independence of the model makes the loops to be independent of the angular frequency ω (Fig. 13.1).

Plastic Flow with Nonlinear Elastic Function A more involved model is established by letting the elastic function G(E) be nonzero. Indeed we let G(E) be nonlinear, G(E) = tanh(κE), and we let

κ > 0,

13.6 A Decomposition-Free Thermodynamic Scheme

825

Fig. 13.1 Plastic model with asymptotic strength: hysteresis loops (solid) and asymptotic bounds |S| = Su (dashed) with α = 1, Su = 1.5, E = 1, 2, 3 and starting from (E 0 , S0 ) with E 0 = 0, S0 = 0, −0.1

γE =

1 [S − tanh(κE)]2 , λ

λ > 0.

By (13.37) and (13.38) it follows that ψ R (E, S) =

1 1 [S − tanh(κE)]2 + ln[cosh(κE)], 2α κ

χ1 = κ[1 − tanh2 (κE)] + α,

˙ χ2 = −α[S − tanh(κE)]sgn E.

Accordingly, we have α dS ˙ = κ[1 − tanh2 (κE)] + {λ − [S − tanh(κE)]sgn E}. dE λ The elastic region reduces to the curve S = G(E). As |E| → ∞ we have |S| → (λ + 1)− . The loops rely within the horizontal strip |S| ≤ 1 + λ. Examine the requirement 0 ≤ χ1 − |χ2 | = κ[1 − tanh2 (κE)] + α −

α |S − tanh(κE)|. λ

As E > 0 the requirement is satisfied if |S − tanh(κE)| ≤ λ + 1 − tanh(κE) ≤ λ +

λκ [1 − tanh(κE)] α

826

13 Plasticity

Fig. 13.2 Plastic model with nonlinear hardening. Hysteresis loops (solid) and yield-strength bounds |S| = 1 + λ (dashed) are determined by letting α = 1, κ = 4, λ = 1.5; the amplitude of the oscillations is E = 2.5

and hence if λκ ≥ α. The same conclusion follows if E < 0. The Fig. 13.2 shows some loops determined with λκ/α = 6.

Elastic–Plastic Model For simplicity, we consider a solid undergoing a linear behaviour in the elastic regime. Accordingly we let G(E) = 0 and hence ψ=

S2 , 2α

χ1 = α.

Let Su > S y > 0. The hysteretic function is chosen to be nonzero in the region |S| ∈ [S y , Su ] in the form ˙ = γ E (S, sgn E)

 |S|−Sy |S| if |S| ≥ S and S E˙ > 0, y Su −Sy 0 if |S| < S y or |S| ≥ S y , S E˙ < 0.

Hence it follows  α (S − |S|) if |S| ≥ S and S E˙ > 0, dS u y = Su −Sy dE α if |S| < S y or |S| ≥ S y , S E˙ < 0.

13.6 A Decomposition-Free Thermodynamic Scheme

827

Fig. 13.3 Elastic–plastic model: hysteresis loops (solid) and yield-strength bounds |S| = S y (dashed) with α = 1, S y = 1.5, Su = 2.5, E = 4 and starting from the origin (E, S) = (0, 0)

The loops are placed within the strip S y ≤ |S| ≤ Su . Within the open strip the body behaves elastically. Within the region |S| ∈ [S y , Su ] the material behaves elastically during unloading and plastically during loading (Fig. 13.3).

13.6.8 Models with Additive Decomposition of the Strain Rate The model of E in terms of S as is given in (13.35) allows us to find connections with the one-dimensional incremental theory of plasticity [216, 349]. For simplicity, we first consider a linear strain hardening and a linear dependence in the elastic regime. Hence, we let G(E) = 0 so that ψ = S 2 /2α. Since ∂ E ψ = 0 then by (13.35) we have ξ1 = 1/α and ξ2 = γT /S. The hysteretic function γT is assumed in the form ˙ = γT (S, sgn S)



|S|/ h if |S| ≥ S y and S S˙ > 0, 0 otherwise,

where S y , h > 0 represent the yield strength and the hardening parameter, respectively. Consequently, the stress–strain evolution is given by  dE 1/α + 1/ h if |S| ≥ S y and S S˙ > 0, = 1/α otherwise. dS

828

13 Plasticity

Fig. 13.4 Elastic–plastic model with linear strain hardening: hysteresis loops (solid) and yieldstrength bounds |S| = S y (dashed) are determined by α = 1, h = 1, S y = 3

The elastic region is identified with the horizontal strip {(E, S) : |S| ≤ S y } (Fig. 13.4). The slopes of the stress–strain curves are α, in the elastic region and when S S˙ < 0, and αh/(α + h) < α when |S| ≥ Sy , S S˙ > 0. The strain rate might be decomposed in the form 1˙ E˙ e = S, α

E˙ p =

˙ S/ h if |S| ≥ S y and S S˙ > 0, 0 otherwise.

By (13.35) we define the hardening variable S on [t0 , t], S(t0 ) = 0, via the rate equation  ˙ ˙ γT | S| | S|/ h if |S| ≥ S y and S S˙ > 0, S˙ = = 0 otherwise. |S| Hence S˙ ≥ 0 and

˙ t t ˙ )dτ = ∫ γT (τ )| S(τ )| dτ . S(t) = ∫ S(τ |S(τ )| t0 t0

Consequently S(t) =

1 h

˙ )|dτ , ∫ | S(τ

I(t0 ,t)

˙ ) > 0}. where I(t0 , t) = {τ ∈ (t0 , t) : |S(τ )| ≥ S y and S(τ ) S(τ A nonlinear strain hardening is now modelled by letting again G(E) = 0 and hence ψ = S 2 /2α, ξ1 = 1/α. Letting Su > S y > 0 we choose the hysteretic function

13.7 Constitutive Models of Polymeric Foams

˙ = γT (S, sgn S)



|S|−Sy |S| α(Su −|S|) 1 α

829

if |S| ≥ S y and S S˙ > 0, otherwise.

Hence γT > 0 in the open strip |S| ∈ (S y , Su ) when S S˙ > 0. It follows that dE = dS



Su −Sy α(Su −|S|) 1 α

if |S| ≥ S y and S S˙ > 0, otherwise.

The resulting loops have then the form of Fig. 13.3. The additive decomposition of the strain rate is possible in the form 1˙ E˙ e = S, α

E˙ p =



|S|−Sy ˙ S α(Su −|S|)

0

if |S| ≥ S y and S S˙ > 0, otherwise.

Moreover, the rate of the hardening variable S is given by S˙ = (sgn S) E˙ p = Hence S(t) =



|S|−Sy ˙ | S| α(Su −|S|)

0

if |S| ≥ S y and S S˙ > 0, otherwise.

|S(τ )| − S y ˙ | S(τ )|dτ , α(S I(t0 ,t) u − |S(τ )|) ∫

˙ ) > 0}. where I(t0 , t) = {τ ∈ (t0 , t) : |S(τ )| ≥ S y and S(τ ) S(τ

13.7 Constitutive Models of Polymeric Foams Polymeric foams can be viewed as porous media with a solid matrix while the pores are saturated with a gas. The importance of foams for industrial applications is connected with the energy-absorbing capacity; they undergo large deformation while maintaining a nearly constant stress value. Restrict attention to one-dimensional settings and denote by S and H the engineering stress and the engineering strain9 ; S is the axial force per unit undeformed area (Piola stress) while H = ∂ X u  (L − L 0 )/L 0 , where L and L 0 are the current length and the reference length; by definition we have H > −1. The stress–strain curve of polymeric foams generally shows three qualitative regions: an initial linearly elastic region, next a plasticity-like plateau region, and then a final densification region where the stress rises steeply. The stress–strain relation has been investigated experimentally. We mention two models of S = f (H ) with five parameters, [20, 280] 9

Denoted by σ and ε frequently in the literature.

830

13 Plasticity

f 1 (H ) = a

exp(αH ) − 1 + c[exp(γ H ) − 1], exp(β H ) + 1

f 2 (H ) = a{1 − exp[(−e/a)H (1 − H )m ]} + b



H n , 1− H

where a, c, α, β, γ and a, b, e, m, n are the pertinent parameters. Yet the dynamic loading shows a dependence of S also on the strain rate. To include strain-rate effects, the dependence has been improved in the form [341] S = f (H )h(H, H˙ ),

(13.39)

where f, h may depend on the temperature θ and the (initial) mass density ρ. The literature shows various forms of the functions f, h, possibly f = f 1 , f 2 , and, e.g. h(H, H˙ ) = ( H˙ / H˙ 0 )a+bH and h(H, H˙ ) = 1 + (a + bH ) ln( H˙ / H˙ 0 ), where H˙ 0 is a reference strain rate (frequently H˙ 0 = 10−3 s−1 ). The strain-rate dependence may indicate that also hysteretic effects occur. Hysteresis really happens as is checked experimentally [285]. Here we take the suggestion (13.39) for a constitutive equation S(θ, H, H˙ ) and look for thermodynamically consistent models of solid foams. Following the literature we consider a one-dimensional setting and take the balance equation of energy in the form ρε˙ = S H˙ − ∂x q + ρr. The entropy inequality ρη˙ + ∂x (q/θ) = σ ≥ 0 leads to ˙ + S H˙ − 1 q∂x θ = θσ. −ρ(ψ˙ + η θ) θ Let θ, H, S, ∂x θ, H˙ , S˙ be the set of independent variables and ψ, η, σ the constitutive functions; the joint dependence on H, S, H˙ , S˙ is suggested by the need of modelling hysteretic effects. Upon evaluation of ψ˙ and substitution, we find 1 −ρ(∂θ ψ + η)θ˙ − (ρ∂ H ψ − S) H˙ − ρ∂ S ψ S˙ − ρ∂∂x θ ψ ∂x˙ θ − ρ∂ H˙ ψ H¨ − ρ∂ S˙ ψ S¨ − q∂x θ = θσ. θ

¨ ∂x˙ θ, θ˙ implies that The arbitrariness of H¨ , S, ∂ H˙ ψ = 0, ∂ S˙ ψ = 0, ∂∂x θ ψ = 0, η = −∂θ ψ. Since ψ is independent of ∂x θ then letting ∂x θ = 0 we obtain

13.7 Constitutive Models of Polymeric Foams

831

− (ρ∂ H ψ − S) H˙ − ρ∂ S ψ S˙ = θσmech ≥ 0,

(13.40)

where σmech = σ|∂x θ=0 . Since ψ = ψ(θ, H, S) then at any isothermal closed curve in [t1 , t2 ], H (t2 ) = H (t1 ), S(t2 ) = S(t1 ), we have t2

t2

t2

t1

t1

t1

0 ≤ ∫ θσmech dt = ∫[−ρ∂ S ψ S˙ + (S − ρ∂ H ψ) H˙ ]dt = ∫ S H˙ dt.

t The non-negative value of t12 S H˙ dt along a closed curve implies that the curve is run in the clockwise sense. We now apply Eq. (13.40) to derive a rate equation for the stress S. Let ∂ S ψ = 0 and assume γ ≥ 0. θσmech = γ(θ, H, S, H˙ )| H˙ |, In view of (13.40), we can write (S − ρ∂ H ψ) ˙ γ | H˙ |, S˙ = H− ρ∂ H ψ ρ∂ H ψ

(13.41)

and hence, except at the inversion points where H˙ = 0, S˙ γ S − ρ∂ H ψ − sgn H˙ . = ˙ ρ∂ S ψ ρ∂ S ψ H

(13.42)

It is worth remarking that Eq. (13.41) is invariant under the time transformation t → ct,

c > 0,

if γ is invariant and in particular if γ is independent of H˙ . Here instead we are interested in the dependence of γ on H˙ and hence the present model describes a rate-dependent behaviour. Since γ depends on H˙ then (13.42) is a nonlinear rate equation for S. If, however, we restrict attention to numerical or experimental testing where H˙ is constant then the right-hand side of (13.42) is a function of S, H and, as with (13.31), dS S˙ = ˙ d H H is the slope of the stress–strain curve, parameterized by θ and H˙ . Alternatively, we observe that if S = f (H, y) then, as y is constant, S˙ = ∂ H f H˙ . For the sake of ˙ H˙ so that also a non-constant function H˙ generality we keep using the notation S/ is considered. In view of (13.42), we define

832

13 Plasticity

χ1 = so that

S − ρ∂ H ψ , ρ∂ S ψ

χ2 = −

γ ρ∂ S ψ

(13.43)

S˙ = χ1 + χ2 sgn H˙ . H˙

According to (13.41), the model is determined by means of the two functions ψ(S, H ), γ(S, H, H˙ ), parameterized by θ. We start with the generic assumption ψ(H, S) = L(S − G(H )) + F(S) + H(H ). Hence it follows χ1 =

S + L (S − G(H ))G  (H ) − H (H ) , L (S − G(H )) + F  (S)

γ χ2 = −  . L (S − G(H )) + F  (S)

Let χ1 = g(H ). It follows from (13.43) that S + L (S − G(H ))[G  (H ) − g(H )] − F  (S)g(H ) = H (H ), which holds if 1 L (S − G(H )) = − [S − G(H )], α

and

G(H ) = α(H + 1),

F  (S) =

1 S α

H (H ) = (H + 1)[α − g(H )].

Hence the free energy ψ takes the form H

ψ(H, S) = (H + 1)S − ∫ (1 + ξ)g(ξ)dξ 0

and χ1 = g(H ),

χ2 = −

γ , 1+ H

where γ ≥ 0 and g has to be selected. For definiteness and simplicity, in Fig. 13.5 we show an example of the loops obtained by solving the system of equations

H˙ = 21 ωH sin ωt, γ S˙ = g(H ) H˙ − 1+H | H˙ |,

with γ constant and g(H ) = f 1 (H ); for definiteness we let

13.7 Constitutive Models of Polymeric Foams

833

Fig. 13.5 Hysteresis loops (solid) and the graph of the function g (dashed) with H = 3, γ = 0.2, H0 = 0, S0 = 2.5

f 1 (H ) = 0.5

exp(3H ) − 1 + 0.2[exp(1.5 H ) − 1]. exp(H ) + 1

The rate independence of the model allows the loops to be independent of the angular frequency ω.

Chapter 14

Superconductivity and Superfluidity

Superconductivity and superfluidity have a common origin due to the discovery of perfect conduction in mercury and superfluidity in helium. The uncommon properties are often said to be quantum-mechanical phenomena. However, the effects show on a large scale so that we can describe them as macroscopic phenomena. In this chapter, superconductivity and superfluidity are developed in a continuum framework within the theory of mixtures. In superconductivity, the fluid is viewed as consisting of two constituents: normal electrons (satisfying Ohm’s law) and superconducting electrons (they do not suffer any resistance). Indeed, within the London equations, the superconducting current satisfies a rate equation and hence follows from a free energy potential. In superfluidity, the model involves two chemically reacting fluids, namely, the normal fluid and the superfluid (with zero entropy). The two fluids are viewed as a fluid mixture. A suitably approximated model which traces back to Landau allows the description of first and second sounds. Here the conjecture that second sound is generated by the normal heat flux is made formal by investigating the propagation of discontinuity waves governed by a Maxwell–Cattaneo objective equation.

14.1 Superconductors 14.1.1 Discoveries About Superconductivity An outline is given of the main discoveries about superconductivity, starting from the discovery made by Kamerlingh Onnes in 1911; a detailed review and some

© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_14

835

836

14 Superconductivity and Superfluidity

mathematical models are given in [193] along with an exhaustive list of references on the subject.1 Drude’s model of conductivity (1900) regarded metals as containing an electron gas responsible for the electrical conductivity. The known formula of conductivity σ = e2 n/mν is based on the view that the electrons are subject to the acceleration eE/m and that the mean velocity is v = eE/mν, where ν is the frequency of electron collisions with the lattice. Hence the current density J = nev, n being the number of electrons per unit volume, results in J=

ne2 E νm

whence the result for σ = J/E. However the temperature dependences n(θ) and ν(θ) remained quite unknown. This was a motivation for experiments in 1911 by Kamerlingh Onnes on the resistivity 1/σ of metals. Unexpected results appeared but these were unconvincing because the value was affected by the purity. Then Kamerlingh Onnes decided to study mercury and the resistance of the mercury sample dropped sharply at 4.15 K to an unmeasurably small value. It was natural that Kamerlingh Onnes would choose the name superconductivity for this phenomenon of perfect conductivity. Soon after the discovery by Kamerlingh Onnes many other metals were found to exhibit zero resistance when their temperatures were lowered below a characteristic value called the critical temperature,2 θc . In 1933 Meissner and Ochsenfeld studied the magnetic behaviour of superconductors in the presence of a magnetic field and found that, below their critical temperatures, the magnetic flux is expelled from the interior of the superconductor [309]. Moreover, at a fixed temperature θ ≤ θc , these materials lost their superconducting property above a certain time-dependent critical magnetic field, H0c (θ). Meissner and Ochsenfeld also discovered that, in the presence of a suitably weak magnetic field, by cooling the temperature below θc the field was expelled so that H = 0 everywhere in the superconductor; this property is usually referred to as the Meissner effect. Modelling and explanations of superconductivity were developed by F. and H. London (1935) and by Bardeen, Cooper, and Schriffer (1957). In 1962, Josephson [243] predicted a tunnelling current between two superconductors separated by a thin insulating barrier; the Josephson effect was next verified and many physical devices are based on this property. In 1986, Bednorz and Müller [32] showed that superconductivity occurs in some materials at about 30 K. This was another success, in the history of superconductivity, that opened research and applications at (relatively) high temperatures. Relative to the dependence of the critical magnetic induction on the temperature, two different behaviours are shown by superconductors. In type I superconductors 1

In 1913 Kamerlingh Onnes was awarded the Nobel prize in physics for the study of matter at low temperatures and the liquefaction of helium. As remarked by Ginzburg [193], the superconductivity was first observed by Gills Holst who conducted measurements of memory resistance. Kamerlingh Onnes did not mention the name of Holst in the corresponding publication. 2 Usually denoted by T . c

14.1 Superconductors

837

Fig. 14.1 The behaviour of the critical magnetic field with temperature was found empirically to be parabolic: H(θ) = H0c (θ)/H0c (0), ϑ = θ/θc

the critical magnetic induction varies with temperature according to the approximate law H0c (θ) = H0c (0)[1 − (θ/θc )2 ], θ ∈ [0, θc ]. As H0 increases, with 0 ≤ H0 ≤ H0c (0), the temperature at which the material is no longer superconducting decreases. If H0 > H0c (0) then the material never becomes superconducting at any temperature θ (Fig. 14.1). By the 1950s it was clear that there is another class of superconductors then named type II superconductors. These materials are characterized by two critical fields, say Hc1 and Hc2 with Hc1 < Hc2 . As θ < θc , when the applied field is smaller than the lower critical field Bc1 the material is superconducting and there is no flux penetration, as with type I superconductors. When the applied field is between Hc1 and Hc2 the material is in a mixed state, referred to as the vortex state, where the material has perfect conductivity but allows for partial flux penetration. If H > Hc2 then the material is a normal conductor.

14.1.2 The London Equations In 1935, soon after the discovery of the Meissner effect, the brothers Heinz and Fritz London developed a phenomenological theory of superconductivity [281]. The London theory provides a remarkable scheme for the electrodynamics of superconductors. Among the consequences is the prediction of a nonzero penetration depth. Moreover, the London equations are likely to provide an idealized limit model of more involved equations of superconductivity. The London equations are based on the so-called two-fluid model which traces back to Becker et al [31] and Gorter and Casimir [200]. The electrons of a superconducting material are divided in normal (as the electrons in a normal material scatter and suffer resistance to their motion) and superconducting (they cross the metal without suffering any resistance). Below the critical temperature θc the current consists of superconducting electrons and normal electrons. Above the critical temperature

838

14 Superconductivity and Superfluidity

only normal electrons occur. Consistently, the current density J is supposed to be the sum of a normal and a superconducting part, that is, J = Jn + Js , the subscripts n and s denoting quantities pertaining to normal and superconducting electrons. While Jn is assumed to satisfy Ohm’s law, Jn = σE, the superconducting part is assumed to satisfy an appropriate (London’s) equation. To obtain the whole set of London electrodynamic equations, we start with the view that the superconducting electrons are an incompressible, inviscid, charged fluid. The fluid is acted upon by an electric field E and a magnetic induction B. The Lagrangian derivative v` s can be written as v` s = ∂t vs + (vs · ∇)vs = ∂t vs + 21 ∇vs2 − vs × (∇ × vs ). Hence, letting γe = e/m e we can write the equation of motion in the form ∂t vs + 21 ∇vs2 − vs × (∇ × vs ) = − whence

1 ∇ ps + γe (vs × B) + γe E, ρs

∂t vs + 21 ∇vs2 − vs × w = −

where

1 ∇ ps + γe E, ρs

w = ∇ × vs + γe B.

(14.1)

(14.2)

By applying the curl operator to (14.1) and using Faraday’s law ∇ × E = −∂t B and (14.2) we obtain ∂t w = ∇ × (vs × w). Of course ∇ · w = 0. Hence using the divergence and Stokes theorems we find ∫ w · n da = 0,

S0

∫(vs × w) · n da = ∂t ∫ w · t ds S

∂S

for any closed surface S0 and open surface S. These relations are compatible with the assumption w = 0, that is, ∇ × vs + γe B = 0,

(14.3)

at any point and time. Consequently, since Js = n s evs then it follows ∇ × (Js ) + B = 0,

(14.4)

14.1 Superconductors

839

where =

m e2 n

. s

Another equation follows by considering again the equation of motion and using Eq. (14.3) to obtain 1 e E. ∂t vs + 21 ∇vs2 = − ∇ ps + ρs me Now two assumptions are made, namely, the nonlinear term ∇vs2 is neglected and the pressure ps is taken to be zero. Hence, the equation of motion simplifies to ∂t vs =

e E, me

whence ∂t (Js ) = E.

(14.5)

Equations (14.4) and (14.5) are the two London equations of superconductivity. Some comments are in order. The original derivation of (14.5) was based on the equation of motion m e v˙ s = eE, which upon substitution of vs = Js /n s e gives (Js )˙ = E, the dot denoting an unspecified time derivative. As a further comment, look at Faraday’s law along with (14.5). We can write ∂t (Js ) = E,

∇ × E = −∂t B.

Applying the curl operator to the first equation and substituting ∇ × E, we have ∂t [∇ × (Js ) + B] = 0.

(14.6)

In [281] not only the derivative is taken to be zero but also the function is assumed to vanish. If  is constant then (14.3) follows. The whole set of differential equations governing the evolution of the superconducting body is given by Maxwell’s equations, ∇ · D = q, ∇ · B = 0, where J = Jn + Js while

∇ × E = −∂t B, ∇ × H = J + ∂t D,

840

14 Superconductivity and Superfluidity

Jn = σE,

∇ × (Js ) + B = 0,

∂t (Js ) = E.

The scalar  is now taken to be constant. The scheme is complete once the constitutive relations for D and H are made precise.

Penetration Depth and Meissner Effect Assume for simplicity that time variations are slow enough that the displacement current ∂t D is negligible. Hence we assume B = μH and consider the approximate equation ∇ × B = μJ. Consistently, we assume that stationary conditions are allowed so that we take ∂t B = 0 whence ∇ × E = 0, ∇ × Jn = σ∇ × E = 0. Hence, we can write ∇ × (∇ × B) = μ∇ × (Js + Jn ) = μ∇ × Js = −

μ B. 

Thus, the magnetic induction satisfies the differential equation B = αB,

α :=

μn s e2 μ = .  me

(14.7)

This equation allows us to evaluate the magnetic induction in a superconductor. Address attention to the half-space x ≥ 0 and assume that B depends on the single Cartesian coordinate x. The differential equation has the (bounded) solution √ B(x) = B(0) exp(− α x). Thus the magnetic field is exponentially screened from the interior of the sample. Indeed,  √ λL := 1/ α = m e /μn s e2 is called the London penetration depth. The same screen effect holds for the current Js . By ∇ × Js = −B/ it follows that B and Js are orthogonal to each other and Js (x) = Js (0) exp(−x/λL ),

|Js | = (λL /)|B|.

We now see how the London equations allow us to explain the Meissner effect and, meanwhile, that a superconductor is not merely a perfect conductor. Inside a perfect conductor the resistivity is zero and hence E = 0. Consequently, by Faraday’s

14.1 Superconductors

law

841

˙ B, ∫ E · tds = − C

∇ × E = −∂t B,

it follows that the magnetic flux  B through any loop in the material is constant or ∂t B = 0. Now, if a perfect conductor is cooled below its critical temperature, in the presence of an applied magnetic field, the field should be trapped in the interior of the conductor even after the field is removed (∂t B = 0). If instead the field is applied after cooling below θc the field should be expelled from the (perfect) conductor. Yet, the experiments conducted by Meissner and Ochsenfeld showed that, at θ < θc , the magnetic field is expelled whether the field was applied before or after the material was cooled below the critical temperature. The difference between perfect conductors and superconductors occurs when the field is applied before the cooling below θc . Now, the reason why the field is expelled from the superconductor is due to the penetration depth that bounds B (and Js ) to a layer of thickness λL which in turn is a consequence of (14.4). We can also describe the effect by saying that when a sample is placed in an external induction field B0 the sample acquires a magnetization M. The magnetic induction B inside the sample is given by B = B0 + μ0 M. In the superconducting state B = 0 and then it follows that M=−

1 B0 . μ0

Whenever a material is in a superconducting state, the magnetization opposes the external magnetic induction and hence the corresponding susceptibility is negative as it happens for diamagnetic substances. Yet the vanishing of B within the superconductor may be viewed as the result of induction surface currents such that the produced magnetic induction Bs , via ∇ × (Js ) + Bs = 0, makes the whole induction to vanish, B = B0 + Bs = 0.

Thermodynamic Consistency of the London Equations First, we observe that the London equations consist of the Maxwell equations, the splitting J = Jn + Js of the current, and Jn = σE,

∂t Js = E.

The curl of the last one gives (14.6). We now show that this set of equations is thermodynamically consistent. The London equations are then consistent with thermodynamics in that (14.4) follows from (14.6) as a particular case.

842

14 Superconductivity and Superfluidity

The superconductor is assumed to be macroscopically undeformable. Moreover, as with the London equations, we neglect nonlinear terms. The total time derivative J˙ coincides with the partial time derivative ∂t J. A state of the superconductor is the set of values (Js , E, H, θ, ∇θ). The entropy (per unit mass) η, the Gibbs free energy3 φ = ε − θη − P · E/ρ − μ0 M · H/ρ, the normal current Jn , the heat flux q, and the time derivative ∂t Js are assumed to be functions of the state variables. Indeed, the constitutive functions for Jn and ∂t Js are just taken to be given by the London theory. The second-law inequality is taken in the form ˙ + E · J − P · E˙ − μ0 M · H ˙ − 1 q · ∇θ ≥ 0. −ρ(φ˙ + η θ) θ To save writing we let φ be independent of ∇θ. Upon evaluation of φ˙ and substitution we have ˙ − ρ∂J φ · J˙ s + E · J − −ρ(∂θ φ + η)θ˙ − (ρ∂E φ + P) · E˙ − (ρ∂H φ + μ0 M) · H s

1 q · ∇θ ≥ 0. θ

˙ E, ˙ H ˙ implies that The arbitrariness of θ, η = −∂θ φ,

P = −ρ∂E φ,

μ0 M = −ρ∂H φ.

Substitution of J = Jn + Js , Jn = σE, and J˙ s = E/ yields the reduced inequality (−

1 ρ ∂J φ + Js ) · E + σE2 − q · ∇θ ≥ 0.  s θ

The inequality holds if σ ≥ 0 and Js =

ρ ∂J φ,  s

q = −κ∇θ, κ ≥ 0.

Hence, the London theory is thermodynamically consistent and, moreover, the Gibbs free energy takes the form φ=

 2 J + (θ, E, H) 2ρ s

while Fourier law for the heat flux is allowed. We append some comments on the constitutive equation for the electric current J. By 3

In general we view the Gibbs free energy as a Legendre transform of the Helmholtz free energy ψ = ε − θη.

14.1 Superconductors

843

1 J˙ s = E,  it follows

Jn = σE,

J = Jn + Js

1 J˙ = σ E˙ + E. 

Applying the divergence operator we find that the charge density q satisfies the differential equation σ 1 q¨ + q˙ + q = 0. 0 0  Any charge buildup in superconductors decays at the rate exp(−t/τ ), τ > 0. F. London [283] argued that the decay is rapid in that τ is estimated to be about 10−12 sec. Yet the two values of τ are approximately τ+  0 /σ,

τ−  σm e /n 2s e.

Since σ → ∞ as θ → 0 and n s → 0 or θ → θc it follows that τ− becomes large as θ → θc and fluctuations should persist for long times. No such phenomena have ever been observed in superconductors as θ is close to zero or θc . Taking the time derivative of Faraday’s equation ∇ × B = μJ + μ∂t E, we have ∇ × ∂t B = μ∂t (Jn + Js ) + μ∂t2 E. Since ∂t B = −∇ × E, ∂t Js = E/, Jn = σE, if the charge density is zero it follows E = ∂t2 E + μσ∂t E + (μ/)E. For slowly varying fields, we can neglect the time derivatives and consider the approximate equation 1 μn s e2 1 = . E = 2 E, λL λ2L me This equation implies that an electric field penetrates a distance λL , as a magnetic field does. However, by ∂t Js = E/, the occurrence of a static electric field would imply an unbounded current Js , which is not observed experimentally. This motivates alternative approaches to the modelling of superconductors.

14.1.3 An Alternative to London Equations Maxwell’s equations ∇ · B = 0 and ∇ × E = −∂t B allow B and E to be expressed by a vector potential A and a scalar potential ϕ in the form

844

14 Superconductivity and Superfluidity

B = ∇ × A,

E = −∇ϕ − ∂t A.

The second London equation ∇ × Js + B = 0 allows us to view −Js as a vector potential for B. Now, since B = ∇ × A, it follows that A and −Js are equal to within a gradient term, Js = −

1 (A + ∇), 

(14.8)

where, so far,  is any smooth scalar function of space and time. Upon partial time differentiation and substitution of E we have ∂t (Js ) = E + ∇(ϕ − ∂t ).

(14.9)

In stationary conditions, it follows from (14.9) that E = −∇ϕ. This is consistent with the existence of a static electric field in a superconductor, given by the potential ϕ, which does not generate an electric current. The observation that ∂t Js = 0

⇐⇒

E = −∇ϕ

is consistent with the existence of a static electric field in a superconductor, given by a potential ϕ, which does not generate an electric current. This is conceptually different from the London equation ∂t Js = E whereby an electric field produces an electric current. Accordingly, we now take (14.9) as one of the equations of superconductivity. While ∇ × Js + B = 0 involves physical fields, the other equations involve the potentials , ϕ and such fields are admissible provided the physical fields turn out to be invariant. We then look for conditions (gauges) such that the physical fields are left invariant by changes of  and ϕ. As is customary, let the vector potential A and the scalar potential ϕ satisfy the Lorentz gauge ∇ · A + ∂t ϕ = 0. Hence the divergence of (14.8) results in ∇ · Js = ∂t ϕ − . If we let ∂t  = ϕ it follows ∇ · Js = ∂t2  − .

(14.10)

14.1 Superconductors

845

The condition of charge conservation ∇ · Js = 0 is related to admissible potentials  in the form ∇ · Js = 0 ⇐⇒ ∂t2  −  = 0. If the Coulomb gauge ∇ · A = 0 is assumed then (14.8) implies ∇ · Js = −/ and hence the physical admissibility reads ∇ · Js = 0

⇐⇒

 = 0.

Incidentally, with ϕ = ∂t , time differentiation of (14.10) and the divergence of ∂t (Js ) = E yield ∂t2 ϕ − ϕ = ∇ · E. Maxwell’s equation ∇ ·E=

q 0

then provides the well-known D’Alembert equation for the scalar potential ϕ, ∂t2 ϕ − ϕ =

q . 0

Let  be the region occupied by the superconductor. In view of (14.10), the boundary condition on ∂, Js · n = 0, results in ∫(∂t2  − )dv = 0. 

The time derivative and the condition ∂t  = ϕ lead to ∫ q dv = 0, 

whence it follows that the superconductor is globally neutral.

14.1.4 A Nonlocal Model In static situations, experiments show a magnetic penetration depth that is significantly larger than the prediction of London theory, in particular, in dirty samples

846

14 Superconductivity and Superfluidity

with large scattering rates in the normal state. An explanation is provided on the basis of a nonlocal electromagnetic response of the superconductor. The underlying idea is that the quantum state of the electrons forming the superfluid cannot be arbitrarily localized. Pippard [363] introduced the coherence length ξ0 as a measure of the minimum extent of electronic wavepackets and proposed that the local equation Js = −A/ be replaced by the nonlocal equation Js (x) = −

3 4πξ0

 R3

exp(−r/ξ0 )

r · A(y) r dv y , r4

r = x − y, r = |r|.

The form of this equation was motivated by an earlier nonlocal generalization of Ohm’s law. The main point is that the action of the field A is appreciable in a neighbourhood of radius ξ0 . Incidentally, if A does not change appreciably within the sphere |y − x| ≤ ξ0 then 3 Js (x)  − 4πξ0

 R3

exp(−r/ξ0 )

r · A(x) r dv y = −A(x). r4

In the presence of strong scattering, due to dirty samples, the electrons can be localized on the scale of the mean free path . Hence, in [363], the neighbourhood of appreciable influence is formally magnified by letting Js (x) = − where

3 4πξ0

 exp(−r/ξ) R3

r · A(x) r dv y , r4

1 1 1 = + ξ ξ0 β

and β is a constant of order unity. This expression is essentially the same as that obtained within the BCS [28] theory. In the dirty limit  ξ0 , so that ξ  β, if A is appreciable only within r ≤ ξ then it follows that Js (x) = −

β A(x). ξ0

The penetration depth is then increased by a factor

√ ξ0 /β.

14.1.5 Phase Transition Curve The transition curve between the superconducting and the normal state shows that, as θ ≤ θc , the transition is affected by the magnetic field. For simplicity, let E = 0 and hence consider the magnetic Gibbs function per unit mass φ(θ, B) = ε − θη − M · B/ρ,

14.1 Superconductors

847

ρ being the constant mass density. Since M = −ρ∂B φ then

B 1B φ(θ, B) − φ(θ, 0) = ∫ ∂B φ · dB = − ∫ M · dB. ρ0 0

We assume the dependence of M on B is linear, M=

χ B; μ0

indeed conductors and superconductors exhibit diamagnetism so that χ < 0, possibly χ = −1. Let B = B(ξ), ξ ∈ [0, 1]. Hence it follows φ(θ, B) − φ(θ, 0) =

|χ| 2 |χ| 1 ∫ B · B dξ = B . ρμ0 0 2ρμ0

For the type I superconductor, as B varies from 0 to the critical value Bc , we have φs (θ, Bc ) − φs (θ, 0) =

|χ| 2 B . 2ρμ0 c

The transition curve is the graph of the function Bc (θ), θ ∈ (0, θc ]. At the states (θ, Bc (θ)) the superconducting electrons are at equilibrium with normal electrons and hence we let φs (θ, Bc (θ)) = φn (θ, Bc (θ). The normal phase is assumed to be nonmagnetic and then φn (θ, Bc (θ)) = φn (θ, 0). Hence it follows that φn (θ, 0) − φs (θ, 0) =

|χ| 2 B . 2ρμ0 c

Differentiation with respect to θ gives ηn (θ) − ηs (θ) = −

|χ| d Bc . Bc ρμ0 dθ

Now θ(ηn − ηs ) is the latent heat L of transition from superconductor to normal conductor. Hence we have |χ| d Bc . Bc L=− ρμ0 dθ

848

14 Superconductivity and Superfluidity

By the third law of thermodynamics, the entropy change associated with any condensed system undergoing a reversible isothermal process approaches zero as the temperature at which it is performed approaches 0 K. Now, ηn (θ) − ηs (θ) → 0 as θ → 0 requires that any transition curve satisfies d Bc → 0 as θ → 0. dθ

(14.11)

For the type II superconductor, the argument can be repeated except that the upper value of Bc is Bc2 , that is, the value at the transition from the vortex state to the normal state.

14.1.6 Ginzburg–Landau Theory The Ginzburg–Landau theory [194] deals with the intermediate phase between the normal state and the superconducting one in stationary conditions. The approach is variational in character in that it is based on an assumption about the free energy functional and the pertinent equations governing the system are the corresponding Euler–Lagrange equations. Landau [266] had already suggested that the transition should be described by a complex-valued parameter , called the order parameter. The physical meaning of  is specified by saying that ||2 is the number density, n s , of superconducting electrons. Hence  = 0 means that the material is in the normal state, θ > θc , whereas || = 1 corresponds to the state of a superconductor, θ = 0. Incidentally, at first Gorter and Casimir [200] elaborated a thermodynamic potential with a real-valued parameter. Later Ginzburg and Landau argued that the order parameter should be complex valued so as to make the theory gauge invariant. The starting idea is that, at zero magnetic field, the (magnetic) free energy φ0 , as θ is around θc , is written in the form φ0 (θ, ) = −a(θ)||2 + 21 b(θ)||2 . Higher order terms in ||2 are neglected, which means that the model holds for small values of ||. If a magnetic field occurs then the free energy comprises a magnetic energy density B2 /2μ. Letting  be the region occupied by the material we express the global free energy in the form  = ∫ φ(θ, , B)dv = ∫ [φ0 (θ, ) + 



1 2 1 1 ∫ A × Bex · n da, B + |i∇ + es A|2 ]dv − 2μ 2m s μ ∂

14.1 Superconductors

849

where m s and −es are the effective mass and charge of the superelectrons, A is the vector potential associated with B, and  is Planck’s constant. The vector Bex denotes the external magnetic induction on the boundary ∂ and is subject to ∇ × Bex = 0. Assume that the free energy is stationary at equilibrium. The temperature θ is supposed to be fixed and consistently attention is restricted to quasi-static processes whereby E = 0. Since the Ginzburg–Landau theory applies to stationary conditions we let ∇ × B = μJs , where Js is considered instead of J = Jn + Js . Since B = ∇ × A we can write ∇ × (∇ × A) = μJs . The functional  involves  and A as the unknown functions and hence we let B2 = (∇ × A)2 . We then derive the corresponding Euler–Lagrange equations. Observe that ||2 =  ∗  is differentiable at  = 0. It is standard to let ∂ ||2 =  ∗ . Moreover, observe ∇ · ∂∇A 21 (∇ × A)2 = ∇ × (∇ × A). Hence the Euler–Lagrange equations of φ, ∂ φ − ∇ · ∂∇ φ = 0,

∂A φ − ∇ · ∂∇A φ = 0,

can be written in the form 1 (i∇ + es A)2  − a + b||2  = 0, 2m s Js = −i

es e2 ( ∗ ∇ − ∇ ∗ ) − s ||2 A. 2m s ms

(14.12)

(14.13)

Equations (14.12) and (14.13) are called the Ginzburg–Landau equations (of superconductivity). A connection with London equations now follows. Let  = || exp(iα) so that Eq. (14.13) reads Js =

 es e2 1 ||2 ∇α − s ||2 A = − (− ∇α + A) ms ms  es

where  = m s /es2 ||2 . Applying the curl operator we obtain the London equation ∇ × (Js ) + B = 0.

850

14 Superconductivity and Superfluidity

14.2 Superfluids 14.2.1 History of Superfluidity and Properties of Superfluids Superfluidity is the characteristic property of a fluid with zero viscosity which therefore flows without loss of kinetic energy. When stirred, a superfluid forms cellular vortices that continue to rotate indefinitely. Superfluidity occurs in certain substances under special conditions. In particular, it occurs in two isotopes of helium when they are liquified by cooling to cryogenic temperatures. Helium was first liquefied in 1908 by Kamerlingh Onnes who cooled it below the liquid/gas transition temperature of 4.2 K. Later, in 1927, Wolfke and Keesom realized that there is another phase transition at a lower temperature, 2.17 K. This phase transition had shown up via a discontinuity of the specific heat, whose curve as a function of temperature resembles the letter λ, whence the name of λ point for that transition. The two phases of liquid helium were termed helium I and helium II. The remarkable superfluid properties of helium II (strikingly low viscosity) were experimentally established by Kapitza [250] and independently by Allen and Misener [5] in 1938. Shortly afterwards Fritz London suggested that superfluidity could have some connection with Bose–Einstein condensation. London also realized that there might be a strong connection with superconductivity, which had been discovered many years before and which could be seen as superfluidity in the electron gas in a metal. Next London [282] and Tisza [417, 418] suggested that liquid helium could be described by a two-fluid model, the condensed and non-condensed atoms being identified with the superfluid and normal components, respectively; natural helium consists essentially of isotope 4 He satisfying the mentioned properties.4 Some remarkable effects should be considered to set up a realistic model. The thermomechanical effect, discovered in 1938, is shown up as follows. Consider a flask filled with liquid helium. A differential container is placed inside the liquid. Throughout the liquid has a uniform concentration of normal fluid (and superfluid). Upon heating the liquid in the inner container, the concentration of normal component increases in that a fraction of superfluid becomes normal. By diffusion, superfluid from the colder region flows into the hotter (though θ < θλ ) region while a flow of normal fluid in any direction, in the inner container, cannot occur because of (viscosity and) the channel resistance. The added superfluid in the hotter region partly becomes normal to get the equilibrium concentration. We then observe an increased mass of the liquid in the hotter region. The process stops when the pressure in the superfluid component is uniform across the channel. The mechanocaloric effect was discovered in 1939. The apparatus consists of a round flask filled with a fine powder and liquid helium. The flask has an opening at the bottom. The superfluid component flows through the fine powder whereas the normal component, being viscous, cannot flow. As a result, the concentration of 4

Helium has another isotope, 3 He. It becomes a superfluid at temperatures of about 2 mK.

14.2 Superfluids

851

the normal component increases above the powder. A thermometer inside the liquid shows an increase of the temperature. Indeed, the temperature increases up to the equilibrium value corresponding to the new concentration. The fountain effect shows up in the following experience. A tube, with a fine capillary, is filled with a fine powder (superleak) and is immersed in liquid helium. By heating the powder, and hence the liquid in the tube, eventually the superfluid squirts out through the fine capillary at the top. The creeping effect is also named Rollin effect after Bernard V. Rollin who discovered it in 1937. Consider a round flask containing liquid helium. If the flask is lowered into a bath of liquid helium then a (Rollin) film clings to the wall and gradually fills the flask. If, instead the flask is raised above the bath level then it empties out. In both cases, the fluid is able to flow against gravity, in the form of about 30 nm film, with a net flux from the higher level to the lower one. For the liquid film, the capillary forces (Van der Waals attraction) dominate the gravity and viscous forces. The rate of flow turns out to be independent of the height of the barrier and it increases as temperature decreases and hence the superfluid concentration increases. The rate is zero at θ = θλ (absence of superfluid) and is constant below 1.5 K (superfluid concentration constant). The effects described so far seem to indicate that liquid helium should be viewed as a binary mixture of two fluids, the normal fluid and the superfluid. Following this two-fluid model, denote by the subscripts n and s the quantities pertaining to the normal fluid and the superfluid. Hence ρn , vn , and ρs , vs are the mass density, and the velocity of the two fluids. Also let ρ = ρn + ρs be the density of helium. It is worth observing that, in stationary conditions, the two mass fractions ωs , ωn satisfy the conditions ωs + ωn = 1,

ωs → 0 as θ → θλ− ,

ωs → 1 as θ → 0+ .

Hence, as θ ∈ (0, θλ ), there is a free transition from the superfluid to the normal fluid and vice versa [337], Sect. 6.8. We then regard helium as a reacting mixture with a mass growth τs = −τn given by a function of θ and ωs , or θ and ωn . The normal fluid is taken to be viscous, Tn = − pn 1 + 2νn Dn + λn ∇ · vn 1, whereas the superfluid is regarded as an ideal fluid, Ts = − ps 1, qs = 0. With this view, the balance equations can be written as ∂t ρn + ∇ · (ρn vn ) = −τs , ∂t ρs + ∇ · (ρs vs ) = τs , ∂t (ρn vn ) + ∇ · (ρn vn ⊗ vn ) − ∇ · Tn − ρn bn = −ms , ∂t (ρs vs ) + ∇ · (ρs vs ⊗ vs ) + ∇ ps − ρs bs = ms , ∂t (ρn εn ) + ∇ · (ρn εn vn ) − Tn · Dn + ∇ · qn − ρn rn = −ls , ∂t (ρs εs ) + ∇ · (ρs εs vs ) + ps ∇ · vs − ρs rs = ls ,

852

14 Superconductivity and Superfluidity

ms and ls being the growths of linear momentum and energy of the superfluid. The inviscid character of the superfluid component is modelled by assuming that it carries no heat or entropy (ηs = 0). Moreover vs is curl free,5 ∇ × vs = 0. At any point of a boundary, with unit normal n, the tangent component of vs is unrestricted whereas vs · n = 0 and vn = 0. Since we let the two fluids have a common temperature, θn = θs = θ, the secondlaw inequality (9.42) can be written as −ρn (ψ` n + ηn θ`n ) − ρs (ψ` s + ηs θ`s ) − τs (ψs − ψn + 21 (vn2 − vs2 )) 1 −ms · (vs − vn ) + Tn · Dn − ps ∇ · vs − qn · ∇θn ≥ 0, θn the extra-entropy fluxes kn , ks being considered to be zero for the present purposes. For simplicity, we let ψn = ψn (θn , ρn ),

ψs = ψs (θs , ρs ),

qn = −κn ∇θn .

Since ρ`n = −ρn ∇ · vn − τs ,

ρ`s = −ρs ∇ · vs + τs

it follows that −ρn (∂θn ψn + ηn )θ` n − ρs (∂θs ψs + ηs )θ` s + (ρ2n ∂ρn − pn )∇ · vn + (ρ2s ∂ρs ψn − ps )∇ · vs −τs (ψs + ρs ∂ρs ψs − ψn − ρn ∂ρn ψn + 21 (vn2 − vs2 )) 1 κn (∇θn )2 ≥ 0. −ms · (vs − vn ) + 2νn Dn · Dn + λn (∇ · vn )2 − θn

As with the Navier–Stokes–Fourier’s fluid we obtain ηn = −∂θn ψn , ηs = −∂θs ψs ,

pn = ρ2n ∂ρn ψn ,

ps = ρ2s ∂ρs ψs ,

−τs [ψs + ρs ∂ρs ψs − ψn − ρn ∂ρn ψn + 21 (vn2 − vs2 )] − ms · (vs − vn ) ≥ 0, and νn ≥ 0, 2νn + 3λn ≥ 0, κn ≥ 0. Since ρn ∂ρn ψn = pn /ρn and the analogue for ps , let pn ps μn = ψn + , μs = ψs + ρn ρs be the chemical potentials. Since μn , μs are independent of the velocities then, borrowing from (9.81), we find that the two inequalities 5

This is not exactly true, as we comment upon in Sect. 14.2.5.

14.2 Superfluids

853

(μs − μn )τs ≤ 0,

1 τ (v2 2 s n

− vs2 )) + ms · (vs − vn ) ≤ 0

hold separately. The first inequality governs the time dependence of the transition. The second inequality holds with ms = M(vn − vs ) + τs v + 21 τs (un + us ), where M ≥ 0, whence 1 τ (v2 2 s n

− vs2 )) + ms · (vs − vn ) = −M(vn − vs )2 .

As a comment to the equation for ms , we observe that the term τs v vanishes in the barycentric reference. Moreover, since ρn vn + ρs vs = 0 it follows that us and un are antiparallel thus making un + us rather small. It is then likely that M(vn − vs ) is the dominant term and this justifies the customary assumption ms = M(vn − vs ). At equilibrium, the ratio ρs /ρ is a function of temperature, ρs /ρ = f (θ). It is then natural to let τs = τˆs (ρs /ρ − f (θ)),

y τˆs (y) ≤ 0, τˆ (0) = 0.

It is worth looking at other model equations which appeared in the literature. First, it is usually assumed that ηs = 0, which would imply ψs = ψs (ρs ). This assumption is motivated by quantum statistics which suggests that the superfluid is the prevailing phase around absolute zero and there the entropy vanishes according to Nernst’s third law of thermodynamics. Often6 the equations of motion are written in the form ρn ∂t vn + ρn (vn · ∇)vn = −ωn ∇ pn − ρs η∇θ − ms + νvn , ρs ∂t vs + ρs (vs · ∇)vs = −ωs ∇ ps + ρs η∇θ + ms + ρs t,

(14.14) (14.15)

where the present notation is used and t is the tension. According to the literature, the terms ω∇ p and ±η∇θ arise from −∇μ/ρ as the force per unit mass. Indeed, by [10], Eq. (B1), Euler’s equation of motion in a classical ideal fluid is ∂t v + (v · ∇)v = −∇μ. Likewise, by [432], Sect. 2, the effective energy per atom in the condensate state is (the mass times) the chemical potential per unit mass and then7 ∂t vs = −∇μ. Since ∂θ μ(θ, p) = −η, ∂ p μ(θ, p) = 1/ρ, then ∇μ = −η∇θ + ∇ p/ρ. 6

See, e.g. [135] and [34, 221] for the so-called HVBK model. In Sect. 14.2.6, we point out how −∇μ can be viewed as the force acting on the superfluid on the basis of Landau’s approach.

7

854

14 Superconductivity and Superfluidity

Hence Eqs. (14.14), (14.15) are claimed to follow. Yet, within the theory of mixtures we have pα = ρ2α ∂ρα ψα , ηα = −∂θ ψα , ρα ∂ρα μα = ∂ρα pα , ρα ∂θ μα = ∂θ pα − ρα ηα , whence ρα ∇μα = ∇ pα − ρα ηα ∇θ. Now, (14.15) follows by letting ∇μs = −η∇θ + (1/ρ)∇ ps , where ρ and η occur instead of ρs and ηs . In addition we observe that ηs = 0 is a common assumption in theories of liquid helium. If Eqs. (14.14)–(14.15) are assumed then the second-law inequality reduces to 1 (−ms + ρs η∇θ) · (vs − vn ) − 21 τs (vn2 − vs2 ) − qn · ∇θ ≥ 0 θ or

1 −ms · (vs − vn ) − 21 τs (vn2 − vs2 ) − (qn + θρs η(vn − vs )) · ∇θ ≥ 0. θ

This inequality holds if [164] −ms · (vs − vn ) − 21 τs (vn2 − vs2 ) ≥ 0, qn = θρs η(vn − vs ) − κn ∇θ, κn ≥ 0. Though this constitutive scheme, and hence the heat flux θ

ρs ρn ηn (vn − vs ) ρs + ρn

in the normal fluid, is formally consistent with the second-law inequality, to our mind, both the equation of motion in the form ∂t vs = −∇μ and the view of ±ρs η∇θ as growths of linear momentum are hardly convincing on physical grounds.

14.2.2 First and Second Sound In helium below the lambda point, a wave propagates in addition to the customary sound wave and that is why it is called second sound. Second sound is observed via pulse propagation; the speed is close to zero near the lambda point, increasing to approximately 20 m/s around 1.8 K, about ten times slower than normal sound waves. At temperatures below 1 K, the speed of second sound in helium II increases as the temperature decreases [440]. Peshkov, in 1944, detected second sound waves and confirmed that its temperature variations are uncoupled from the pressure variations (first sound) [359].

14.2 Superfluids

855

The rather involved structure of the balance equations for helium II justifies some approximations; here we revisit an approach that traces back to Landau8 [267, 268]. Following are the approximations: the normal fluid is inviscid, Tn = − pn 1; ms = 0 and bs , bn = 0; the two components are at equilibrium in that μs = μn ; there is no thermal expansion so that ∂θ ρ(θ, p) = 0, ∂ p η(θ, p) = 0; ρ = ρ¯ +  and quadratic terms in n , s , vn , vs are neglected. Summation of the continuity equations and of the equations of motion leads to ∂t ρ + ∇ · (ρv) = 0,

∂t (ρv) + ∇ p = 0,

the quantities ∇ · (ρn vn ⊗ vn ) and ∇ · (ρs vs ⊗ vs ) being neglected. Since ρ = ρ( p) then dρ ∂t p. ∂t ρ = dp By linearization dρ/dp is evaluated at p(ρ). ¯ By applying the time derivative and the divergence, respectively, and substituting ∂t ∇ · (ρv) we obtain ∂t2 p −

dp p = 0 dρ

(14.16)

√ whence it follows that perturbations of p and ρ propagate with speed c = dp/dρ (first sound waves). We now take the difference between the equations of motion with ms = 0, bs = 0, ¯ t v, letting bn = 0. Upon linearization, ∂t (ρv)  ρ∂ Vs = vs − vn we find ∂t Vs +

1 1 ∇ ps − ∇ pn = 0. ρ¯s ρ¯n

Since ∂ρα pα = ρα ∂ρα μα and ∂θ εα = θ∂θ ηα then 1 1 1 ∇ pα = (∂ρα pα ∇ρα + ∂θ pα ∇θ) = ∇μα − ∂θ μα ∇θ + ∂θ pα ∇θ ρα ρα ρα = ∇μα − ∂θ (εα − θηα )∇θ = ∇μα + ηα ∇θ whence, in view of the assumptions μs = μn and ηs = 0, 1 1 ρ ∇ ps − ∇ pn = ∇(μs − μn ) + (ηs − ηn )∇θ = −ηn ∇θ = − η∇θ. ρs ρn ρn Consequently, upon linearization, 8

See also [337], Sect. 6.8.1.2.

856

14 Superconductivity and Superfluidity

ρ¯n ∂t Vs = ρ¯η∇θ. ¯

(14.17)

The analysis of the balance of energy allows us to find a further equation for Vs thus leading to a differential equation for the temperature. By neglecting us2 - and un2 -terms we can express the balance of energy in the form ∂t (ρε) + ∇ · (ρεv) + ∇ ·



ρα (εα + pα /ρα )uα + p∇ · v = 0,

α

where ρε 

 α

ρα εα and p 

 α

pα ; hence ε  ε I . Since ρs us + ρn un = 0 then

ρs us = ρs (vs − v) = ρs (Vs + un ) = ρs Vs − (ρs /ρn )ρs us whence it follows the identity ρs us =

ρs ρn Vs . ρ

Hence, upon the approximation ∂t (ρε) + ∇ · (ρεv)  ρ∂ ¯ t ε and linearization the balance of energy yields ρ¯ ∂t ε +

ρ¯s ρ¯n p¯ s p¯ n (ε¯s + − ε¯n − )∇ · Vs + p¯ ∇ · v = 0. ρ¯ ρ¯s ρ¯n

(14.18)

Since p ρ˙ = −ρ p∇ · n then by (9.85) we have  ρ ε˙ + p∇ · v = ρθη˙ + ρ α μα ω˙ α . Moreover ρ ω˙ s = −∇ · (ρs us ) = −∇ ·

ρs ρn Vs . ρ

Hence it follows ρ ε˙ + p∇ · v = ρθη˙ + ρ(μs − μn )ω˙ s = ρθη˙ − ∇ · whence

ρs ρn (μs − μn )Vs ρ

¯ t η − ρ¯s ρ¯n (μ¯ s − μ¯ n )∇ · Vs . ρ¯ ∂t ε + p¯ ∇ · v = ρ¯θ∂ ρ¯

Substitution of ρ¯ ∂t ε + p¯ ∇ · v in the balance of energy (14.18) and the assumption ηs = 0 show that ρ¯s ∂t η − η¯ ∇ · Vs = 0. ρ¯ Now,

14.2 Superfluids

857

∂t η = ∂θ η(θ, p)∂t θ + ∂ p η(θ, p)∂t p,

∂ p η(θ, p) =

1 ∂θ ρ(θ, p). ρ2

The assumption that there is no thermal expansion, ∂θ ρ(θ, p) = 0, and the observation that θ∂θ η(θ, p) is the specific heat at constant pressure, c p , imply that ∂t θ −

ρ¯s θ¯η¯ ∇ · Vs = 0. ρ¯ c p

(14.19)

The divergence of (14.17) and the partial time derivative of (14.19) lead to ∂t2 θ −

ρs θη 2 θ = 0, ρn

(14.20)

where the superposed bar on ρs , ρn , θ, η are understood.  This means that a temperature wave (second sound) propagates with speed c = ρs θη 2 /ρn . Remark 14.1 By (14.17) we might say that the temperature gradient ∇θ acts as a driving force as with the HVBK model. Yet (14.17) involves the velocity difference Vs = vs − vn instead of vs , and ρη∇θ is a force and not a growth. Moreover (14.17) arises because of the assumption μs = μn .

14.2.3 Discontinuity Waves The second sound is a temperature wave. Equation (14.20) arises from (14.17) and (14.19) thus showing that the mechanism underlying second sound is the coupling between temperature and diffusion via ∇θ and Vs , the main physical role being played by the force ρη∇θ in (14.17). Yet we can guess that the observed second sound is generated by the heat flux qn . Hence, we let qn be governed by the (objective) equation 1 (14.21) q` n − Wn qn = − (qn + κn ∇θ), τn > 0, τn where Wn = skwLn . For simplicity we let Tn = − pn 1. The independent variables are θn , ρn , θs , ρs , qn , ∇θ. Assume that ψn = ψn (θn , ρn , qn ),

ψs = ψs (θs , ρs ).

Granted the restrictions pn = ρ2n ∂ρn ψn , ps = ρ2s ∂ρs ψs , ηn = −∂θ ψn , ηs = −∂θ ψs , we find that the entropy inequality simplifies to −ρn ∂qn ψn · (Wn qn −

1 ρn 1 qn ) + ∂qn ψn · κn ∇θ − qn · ∇θ ≥ 0. τn τn θ

858

14 Superconductivity and Superfluidity

The arbitrariness of Wn ∈ Skw and ∇θ imply that ∂qn ψn ⊗ qn ∈ Sym, ∂qn ψn · qn ≥ 0, κn

ρn 1 ∂q ψn = qn . τn n θ

A direct integration of the last equation (for ∂qn ψn ) leads to ψn = ψˆ n (θ, ρn ) +

τn q2 . 2κn ρn θ n

The symmetry of ∂qn ψn ⊗ qn holds. Moreover, substitution of ∂qn ψn results in κn 2 q ≥0 θ n whence κn ≥ 0. Hence the superfluid is governed by the free energy function ψn (θ, ρn , qn ) = ψˆ n (θ, ρn ) +

τn q2 , 2κn ρn θ n

κn > 0.

We now look for discontinuity wave solutions. We let ρn , ρs , vn , vs , θ be continuous functions of x and t, jointly for all x in the space occupied by the body and t ∈ R, whereas ∂t ρn , ∂t ρs , ∇ρn , ∇ρs , ∂t vn , ∂t vs , ∇vn , ∇vs , ∂t θ, ∇θ and all higher order derivatives suffer jump discontinuities across a surface σ but are continuous, jointly in x and t, everywhere else. The external supplies bn , bs , rn and the growths τs , ms , ls are assumed to be continuous across σ. Moreover, the two components are assumed to have the same temperature θ. This makes it redundant the energy equation for the superfluid. Since pn and εn are functions of ρn and θ we have ∇ pn = ∂ρn ∇ρn + ∂θ pn ∇θ and the like for εn . Likewise, ∇ ps = ∂ρs ∇ρs + ∂θ ps ∇θ, while the heat flux qn is governed by (14.21). The state ahead of the wave is taken to be at rest (vn , vs = 0). As is customary, we let [[·]] denote the jump across the wave. By applying the geometrical and kinematical compatibility conditions the balance equations result in9 −u n [[∂x ρn ]] + ρn n · [[∂x vn ]] = 0, −u n [[∂x ρs ]] + ρs n · [[∂x vs ]] = 0, To avoid easy ambiguities among n for normal fluid and n for normal direction, we denote by ∂x , instead of ∂n , the normal derivative; u n is the wave speed.

9

14.2 Superfluids

859

−ρn u n [[∂x vn ]] + n(∂ρn pn [[∂x ρn ]] + ∂θ pn [[∂x θ]]) = 0, −ρs u n [[∂x vs ]] + n(∂ρs ps [[∂x ρs ]] + ∂θ ps [[∂x θ]]) = 0, −ρn u n (∂ρn εn [[∂x ρn ]] + ∂θ εn [[∂x θ]]) + (ρn εn + pn )n · [[∂x vn ]] + n · [[∂x qn ]] = 0. We now look at (14.21). Observe that [[Wn ]] = 21 ([[∂x vn ]] ⊗ n − n ⊗ [[∂x vn ]]) and hence [[Wn qn ]] = 21 [[∂x vn ]] qn · n − 21 ([[∂x vn ]] · qn )n. Moreover, since vn = 0 then [[q` n ]] = [[∂t qn ]] + [[vn · ∇qn ]] = −u n [[∂x qn ]]. Consequently (14.21) leads to 1 κn − u n [[∂x qn ]] − n · qn [[∂x v]] + 21 ([[∂x v]] · qn )n = − [[∂x θ]]n. 2 τn

(14.22)

This shows that the possible waves are affected by the selection of the time derivative, here the Jaumann-like derivative q` n − Wn qn . Yet, the scalar character of the energy equation and the assumption that the normal fluid is inviscid, Tn = − pn 1, allow us to conclude that the possible waves are longitudinal, that is, [[∂n vn ]] and [[∂n vs ]] are parallel to n. For simplicity, assume qn = 0 ahead of the wave. Thus (14.22) implies that a nonzero [[∂x qn ]] is longitudinal, namely,  n. Hence we let [[∂x qn ]] =

κn [[∂x θ]]. u n τn

Likewise it follows that [[∂x vn ]] = [[∂x vn ]]n,

[[∂x vs ]] = [[∂x vs ]]n

where vn = vn · n, vs = vs · n and hence ρn [[∂x vn ]] = u n [[∂x ρn ]],

ρs [[∂x vs ]] = u n [[∂x ρs ]].

We then obtain the following system of equations: ⎡

⎤⎡ ⎤ −u 2n + ∂ρn pn ∂x ρn 0 ∂θ p n ⎣ ⎦ ⎣ ∂x ρs ⎦ = 0. 0 −u 2n + ∂ρs ps ∂θ p s 2 (εn − ρn θ∂ρn ηn )u n 0 −ρn u n ∂θ εn + κn /τn u n ∂x θ

860

14 Superconductivity and Superfluidity

The possible speeds of propagation u n are given by the propagation condition (−u 2n + ∂ρs ps )[(−u 2n + ∂ρn pn )(−ρn u 2n ∂θ εn + κn /τn ) − ∂θ pn (εn − ρ2n θ∂ρn ηn )u 2n ] = 0.

 The immediate solution u n = ∂ρs ps corresponds to a nonzero [[∂x ρs ]] while [[∂x ρn ]] = 0, [[∂x θ]] = 0. The remaining equation (−u 2n + ∂ρn pn )(−ρn u 2n ∂θ εn + κn /τn ) − ∂θ pn (εn − ρ2n θ∂ρn ηn )u 2n = 0 yields the two speeds of other two modes. The modes are decoupled inasmuch as ∂θ pn and ∂θ ps are negligible. In this instance, the speeds un =



∂ρ n p n ,

un =

 κn /ρn τn ∂θ εn

correspond to the nonzero discontinuities [[∂x ρn ]] and [[∂x θ]], respectively.

14.2.4 Mass Fractions and Chemical Potentials The reduced entropy inequality for fluid mixtures holds if the chemical potentials, the mass growth, and the growth of linear momentum satisfy (μs − μn )τs ≤ 0,

1 2 (v 2 n

− vs2 ) + ms · (vs − vn ) ≤ 0.

For any growth τs , the function ms = M(vn − vs ) + 21 τs (vs + vn ) makes the second inequality identically true. The inequality (μs − μn )τs ≤ 0 holds if and only if sgn τs = −sgn (μs − μn ). To determine an appropriate constitutive function for τs , we preliminarily observe that μs = μˆ s (ρs , θ),

μn = μˆ n (ρn , θ).

Let θλ be the threshold temperature; in helium θλ  2.1 K. Restrict attention to the interval 0 < θ ≤ θλ where normal fluid and superfluid coexist. Experiments show that at rest, that is, when vs = vn , the ratio ρn /ρ is an increasing function, say α(θ), subject to α(0) = 0, α(θλ ) = 1. We let μs (ρs , θ) = μ0 (θ) + 21 α(θ)ρs ,

μn (ρn , θ) = μ0 (θ) + 21 (1 − α(θ))ρn .

14.2 Superfluids

861

Hence, we have μs − μn = 21 [α(θ)ρs − (1 − α(θ))ρn ] = 21 ρ[α(θ) − ωn ]. Thus sgn τs = −sgn (μs − μn ) = sgn (ωn − α(θ)). Hence τs > 0 and the mass density of the superfluid increases if ωn > α(θ), and vice versa. If θ → 0 then μs → μ0 (0+ ) and μ(0+ ), since α → 0 and ρn → 0. Likewise, μs = μ0 (θλ ) and μn = μ0 (θλ ) at θ = θλ . Moreover, the physical literature shows that α is well modelled by the function α(θ) = (θ/θλ )δ ,

δ ∈ [1, 1.5].

The thermodynamic restriction on τs and indications of experimental result suggest that we assume γ > 0, τs = −γωs (μs − μn ), so that, upon substitution of μs − μn , we can write τs = −γρωs [α(θ) − (1 − ωs )].

(14.23)

The result (14.23) applies if the two fluids are at rest, namely, they move with the same velocity. The occurrence of a nonzero relative velocity hinders the appearance of the superfluid [49]. To model this feature, appeal is usually made to Landau’s critical velocity of the superfluid, vλ . Let θ∗ be a temperature defined by θ ∗ = υ 2 θλ ,

υ 2 = |vs − vn |2 /|vλ |2 ,

where vλ is Landau’s critical velocity possibly dependent on temperature. It seems more appropriate to think that |vλ |2 is a temperature-dependent parameter; the parameter θ∗ has the dimension of a temperature and depends on the velocities vs , vn via the objective vector vs − vn . Now we formally replace θ in (14.23) by (θ + θ∗ ). Hence it follows τs = −γρωs [α(θ + θ∗ ) − (1 − ωs )]. Since α is an increasing function then the derivative α is positive and hence it follows dτs = −γρωs α < 0. dθ∗ This implies that the supply of superfluid τs decreases, and possibly becomes negative, if θ∗ increases. If |vs − vn | = |vλ | then υ 2 = 1 and θ∗ = θλ . Since α(θ + θλ ) > 1 then τs < 0; at any temperature θ > 0 the occurrence of |vs − vn | = |vλ | forbids the (stable) existence of the superfluid.

862

14 Superconductivity and Superfluidity

In view of the equations of motion, ∂t (ρs vs ) + ∇ · (ρs vs ⊗ vs ) = −∇ ps + ρs g + ms , ∂t (ρn vn ) + ∇ · (ρn vn ⊗ vn ) = −∇ pn + ρn g − ms , we try to describe the characteristic effects of superfluids. As a reasonable assumption, we let the pressures ps , pn be given by constitutive functions of the form ps = ps (ρs , θ),

pn = pn (ρn , θ).

At equilibrium, between superfluid and normal fluid, the fractions ωs and ωn are given by ωn = (1 − ωs ) = α(θ). Now ωs =

ρρs − ρs ρ , ρ2

the prime denoting the derivative with respect to θ. We further assume that ρ is slightly variable, possibly constant, so that sgn ωs = sgn ρs . Hence, we have

sgn ρs = −sgn α .

Along the equilibrium curve, ` η`s = (∂θ ηs + ∂ρs ηs ρs )θ. By the standard thermodynamic restrictions ηs = −∂θ ψs , we find ∂ρs ηs = −

1 ∂θ p s , ρ2s

ps = ρ2s ∂ρs ψs

∂θ ηs =

1 1 ∂θ εs = cv , θ θ

the specific heat cv being assumed to be positive. Upon substitution we can write η`s = (

1 cv ` − 2 ∂θ ps ρs )θ. θ ρs

By the customary assumption that ηs be constant it follows that

14.2 Superfluids

863

∂θ p s =

ρ2s cv . θρs

The negative value of ρs implies that ∂θ ps < 0.

(14.24)

The negative value of ∂θ ps is quite surprising in that we are accustomed to ideal gases where ∂θ p = kρ > 0.

Fountain Effect The motion of the superfluid occurs subject to the force −∂ρs ps ∇ρs − ∂θ ps ∇θ + ρs g + ms . Suppose ∇θ is large enough that −∂θ ps ∇θ is the dominant force. The fountain effect shows that the superfluid moves in the direction of the temperature gradient. This is fully consistent with, and explained by, the condition (14.24). First, if heat is added to the power in the tube then θ increases and ps decreases; the pressure of the superfluid in the tube is lower than that in the bath. The superfluid then flows from the bath to the tube via the fine powder which impedes any flow of the normal fluid. The mass of the liquid in the tube increases and eventually the superfluid squirts out through the fine capillary at the top.

Thermomechanical Effect A flask is filled with liquid helium and a heating coil is placed inside a differential container. When heat is added to the fluid the temperature inside the inner container increases (by θ) and the concentration of the normal fluid increases too because of the phase transition (τs < 0). Moreover, by (14.24), ps decreases. Since ps in the bath is larger than the superfluid flows through the narrow channel from the bath to the heat region. The normal fluid, instead, cannot flow through the channel in view of its viscosity. As a result, an amount of fluid is added to the heated region so that the gravitational pressure ρs gh balances the decrease induced by temperature, ∂θ ps θ. The quantity ρs gh is called thermomechanical pressure head. The height h is given by |∂θ ps | θ. h= ρs g

864

14 Superconductivity and Superfluidity

Mechanocaloric Effect A flask is filled with a fine powder and, above it, liquid helium. The flask has an opening at the bottom. The superfluid component flows through the fine powder and hence the concentration of normal fluid increases above the powder. The temperature of the fluid inside the flask increases, which is sensed by a thermometer placed in the fluid. The temperature increase is due to the fact that energy is conserved in that the superfluid has no thermal energy and hence no energy flows outside the flask. The conserved energy is then distributed over a smaller quantity of liquid helium thus resulting in a higher temperature.

Rollin (or Creeping) Effect The liquid helium exhibits a property of clinging to the walls of the container. To show this effect, we consider a test tube filled with liquid helium. When the test tube is lowered into a liquid helium bath, a fluid film clings to the tube and gradually fills the tube. If, instead, the tube is raised above the bath level it empties out. The experiments show that in these films the capillary forces dominate the gravity and viscous forces. Moreover, if the rate of flow is independent of the height of flow, it increases with drop in temperature and vanishes at lambda point. To understand the phenomenon we first observe that, at the outset, when the container is just filled with helium, a force arises from the interaction between helium atoms and molecules of the container’s wall (Van der Waals force); this draws an helium film upwards, parallel to the wall. Gravity is then overcome. The absence of viscosity favours the flow and hence we can think that the flow is made of the superfluid component. Although the superfluid component alone quits the container, ωs cannot become zero in that the equilibrium at the existing temperature forces the transition from normal fluid to superfluid. When the test tube is lowered into the bath then both flows (inward and outward) occur and the inward flow prevails because of pressure and temperature of the bath. Lower temperatures favour the flow due to the greater fraction of superfluid.

14.2.5 Rotation of Superfluids Suppose that, at θ > θλ , a cylindrical container is made to rotate fast around the cylindrical axis with angular velocity10 ω. The liquid helium, inside the container, rotates accordingly. If the liquid is cooled below θλ and then the container is stopped then the liquid continues to rotate with angular momentum

To avoid ambiguities, the angular velocity is denoted by the vector ω; the scalars ωs , ωn still denote the mass fractions.

10

14.2 Superfluids

865

L = ωs (θ)I ω, where I is the moment of inertia around the cylindrical axis. This means that, upon cooling the temperature to θ < θλ , a fraction ωs (θ) of superfluid appears and the observed rotation can be explained as though the fraction s of the superfluid rotates with the given angular velocity whereas the normal component is at rest; by viscosity, the container makes the normal fluid to be at rest. We now let the container rotate slowly,11 at θ > θλ , with angular velocity ω. Of course L = I ω. We now cool the liquid, below θλ , while rotating the container. We observe that L = ωn I ω. The picture in terms of the two fluids is as follows: the normal component keeps to rotate, with moment of inertia ωn I , whereas the superfluid stays at rest as though the superfluid could not assume arbitrarily small values of angular velocity. This feature is known as the Hess–Fairbank effect . This effect is a direct motivation of the Onsager–Feynman quantization relation. The circulation of vs along a superfluid vortex can assume a multiple of Planck’s constant h divided by the mass of the vortex region,

vs · tds = n

h . m

For a uniform flow, we can write that, along a circle with radius R, vs =

n ν, mR

ν being the unit normal to the vortex (plane) and  = h/2π. This means that there is a critical value  ωc = m R2 of the angular velocity |ω|, below which there are no superfluid vortices. In practice, this is the case if |ω| ωc ; it is customary to take |ω| < ωc /2. Hence if ω < ωc /2 the angular momentum is carried by the normal component, L = ωn (θ)I ω. If, instead, |ω| is around nωc then |L| = I [ωn (θ)|ω| + ωs (θ)nωc ].

11

As we see in a moment, slowly means ω < ωc /2.

866

14 Superconductivity and Superfluidity

The angular velocity of the superfluid is kept constant, nωc , for small variations of ω. The number n is called the winding number. For large values of n, |ω| and nωc are almost equal and hence it will appear as the whole liquid is rotating.

14.2.6 The Ginzburg–Landau Theory Following Landau’s view [268], liquid helium can be described as a ground state constituted by a (Bose–Einstein) condensate of helium atoms where excitations move as quasi-particles. As we see in a moment, a complex macroscopic wave function of the condensate allows us to introduce the mass density and velocity of the superfluid component and the energy flux associated with the elementary excitations. Let ψ(x, t) = ϕ(x, t) exp(iχ(x, t)) be a scalar complex-valued function, ϕ, χ being real-valued functions. The starting point is the time-dependent Schrödinger equation, i∂t ψ = −

2 ψ + V (x, t)ψ, 2m

where V is the potential and m is the mass of the particle. For slow variation of ψ at interatomic distances, the Gross–Pitaevskii equation [213, 364] i∂t ψ = −

2 ψ + (μ + V0 ϕ2 )ψ 2m

is considered where μ is the chemical potential and V0 is the potential of the interatomic forces. Now, ψψ ∗ = ϕ2 is usually regarded as the number density n in the condensate state. Since i∂ϕ2 = iψ ∗ ∂t ψ + iψ∂t ψ ∗ , computation of ψ ∗ ∂t ψ, ψ∂t ψ ∗ , and substitution yields ∂ t ϕ2 = i Hence

 ∇ · [ψ ∗ ∇ψ − ψ∇ψ ∗ ]. 2m

j := −i 21 [ψ ∗ ∇ψ − ψ∇ψ ∗ ] = ϕ2 ∇χ

can be viewed as the particle flux vector so that the balance of mass is obtained in the form m∂t ϕ2 = −∇ · j. Since ϕ2 is the number density then mϕ2 is the mass density. The form of j suggests that we define a velocity

14.2 Superfluids

867

u :=

1  j = ∇χ. 2 mϕ m

Moreover, it seems natural to establish the equation of motion by taking the force as minus the gradient of the potential. Hence, neglecting the nonlinear term in ϕ we have [415] m∂t u = −∇(μ + 21 mu2 ), or, since ∇ × u = 0,

m u˙ = −∇μ,

where u˙ = ∂t u + (u · ∇)u. Standard models which appeared in the literature arise by identifying u with the velocity of the superfluid component. Moreover, this approach seems to be the motivation of the equation of motion where −∇μ is used as the force acting on the superfluid. With some formal variants, this view is now developed to apply the Ginzburg– Landau theory of superconductivity [194] to superfluidity. We start formally with the complex-valued function ψ = ϕ exp(iχ) and make the identification ω s = ϕ2 ,

ϕ2 ≤ 1.

Next we let ∇ × vs = 0, Further, observe

vs =

1 ∇χ. m

|∇ψ|2 = ∇ψ · ∇ψ ∗ = |∇ϕ|2 + ϕ2 |∇χ|2 .

Equilibrium values of ψ are determined as the stationary points of an appropriate functional. Look at the functional L = 21 β 2 |∇ψ|2 − 21 α(θ)|ψ|2 + 41 |ψ|4 whence L = 21 β 2 |∇ϕ|2 + 21 ϕ2 [β 2 |∇χ|2 − α(θ)] + 41 ϕ4 . The corresponding Euler–Lagrange equations 0 = δϕ L = −∂ϕ L − ∇ · (∂∇ϕ L), δχ L = −∂χ L − ∇ · (∂∇χ L)

868

14 Superconductivity and Superfluidity

are the Ginzburg–Landau equations ϕ(β 2 |∇χ|2 − α(θ) + ϕ2 ) − β 2 ϕ = 0, β 2 ∇ · (ϕ2 ∇χ) = 0. These equations are examined on the assumptions that ϕ is uniform, ∇ϕ = 0, and the (velocity) vector v = β∇χ is constant. Hence ∇ · (ϕ2 ∇χ) = 0 holds identically while the other equation reduces to ϕ[ϕ2 − α(θ) + v 2 ] = 0, where v = |v|. Look for solutions ϕ(θ, v). If v 2 ≥ α(θ) then only ϕ(θ, v) = 0 is a real-valued solution; this means that the superfluid cannot occur. If, instead, v 2 < α(θ) then three solutions hold, ϕ1 = 0,

 ϕ2,3 = ± α(θ) − v 2 .

For definiteness, let α(θ) = 1 − (θ/θλ )3/2 . Hence ϕ2,3 hold if 0 < θ < θλ and  ϕ2,3 = ± 1 − (θ/θλ )3/2 − v 2 ,

v 2 < 1 − (θ/θλ )3/2 .

Consequently, in addition to ϕ = 0, the number density ϕ2 = 1 − (θ/θλ )3/2 − v 2 > 0 is allowed if v 2 < 1 − (θ/θλ )3/2 .

Chapter 15

Ferroics

Ferroics is the general framework for ferroelectricity and ferromagnetism. Ferroelectrics are materials where the polarization exhibits nonlinear behaviour and hysteresis. They may have a strong permanent polarization but only if the temperature lies below a characteristic temperature, called the Curie temperature. In this chapter a nonlinear approach to ferroelectrics is investigated and then a general setting is established for the modelling of hysteretic effects. The leading idea is that the entropy production is, per se, a non-negative function here involving the time derivative of the electric field. Also with reference to the literature, the Ginzburg–Landau–Devonshire (GLD) theory is examined to describe the transition between linear paraelectric properties and hysteretic behaviours. In essence, the GLD theory characterizes the transition in terms of an order parameter; for the paraelectric–ferroelectric transition the order parameter is the (scalar) polarization. Next a nonlinear theory is developed for materials where the magnetization exhibits nonlinear behaviour and hysteresis. The leading idea is still to consider the entropy production as a non-negative function, here involving the time derivative of the magnetic field. As with ferroelectrics, the models account for the Curie temperature as a limit temperature for hysteretic effects.

15.1 Evolution Function and Hysteretic Effects in Ferroelectrics One-dimensional models of ferroelectrics are investigated [103, 188, 433, 435] where the time derivative P˙ of the polarization P is given by ˆ ˙ P˙ = P(P, E, E).

© The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_15

869

870

15 Ferroics

A special subclass is represented by the Duhem model1 ˙ P˙ = g(P, E) E˙ + f (P, E)| E|.

(15.1)

Consistent with (15.1), here we follow the view that both the electric field and the electric polarization are among the independent variables. This view is related to the fact that an hysteresis loop is a closed curve surrounding a region in the P − E plane or that P(E) would not be a single-valued function. Moreover, for the sake of generality, the deformation of the material and the dependence on the temperature are allowed to occur. We assume M and J are zero. The entropy inequality takes the form ˙ + ρE · π˙ + T · L − 1 q · ∇θ + θ∇ · k ≥ 0, −ρ(ψ˙ + η θ) θ where π = P/ρ is the electric polarization density, per unit mass. In terms of the free energy φ = ψ − E · π = ε − θη − E · π the entropy inequality can be written as ˙ − ρπ · E˙ + T · L − 1 q · ∇θ + θ∇ · k ≥ 0. − ρ(φ˙ + η θ) θ

(15.2)

The ferroelectric hysteretic behaviour involves both the electric field E and the electric polarization density π. For generality, the temperature θ and the deformation gradient F are taken to affect the response of the material and, by analogy with the physical literature that incorporates the nearest neighbours in the GLD model [384], the dependence on the gradient of the polarization density, ∇π, is also considered [404]. The dependence on π˙ is allowed so that a consistent approach is obtained for the dependence on ∇π while a dependence on E˙ is in order to account for hysteretic effects and a dependence on the stretching tensor2 D describes dissipative ˙ π) effects. Hence we might take F, E, θ, ∇θ, π, ∇π, D , E, ˙ as the set of independent variables for the constitutive functions φ, η, T, q, and k. However by the principle of objectivity the function φ has to be invariant under Euclidean transformations. Hence φ should depend only on Euclidean invariants. For definiteness, in addition to θ we ˙ as appropriate P · ∇P P, ∇θ · ∇θ, E˙ , P take C = FT F, E = FT E, P = F−1 P and ∇P invariant variables; φ and η should depend on scalars involving the set of variables3 ˙ ). P, ∇θ, D , E˙ , P  = (θ, C, E , P , ∇P

1

Many models of this kind are developed also within the ferromagnetic context depending on the choice of the characteristic functions g and f [433]. 2 In this chapter D is the stretching tensor, not to be confused with the electric displacement D = 0 E + P. 3 We might use as a variable the referential gradient ∇ P rather than ∇P P, and the like for ∇R θ and R ˙ and then a simpler expression of the stress tensor would arise. ∇θ. In that case (∇R P )˙ = ∇R P

15.1 Evolution Function and Hysteretic Effects in Ferroelectrics

871

Two remarks are in order. First, the invariance in S O(3) holds for both FT E and F−1 E and the like for P; this common property is due algebraically to the fact that for rotation tensors R−1 = R T . Secondly, E and P are the fields at the frame locally at rest which gives a clear physical meaning to the constitutive equations. Computation of φ˙ and substitution in (15.2) result in ˙P − ρ∂ φ · ∇θ ˙ ˙ − ρ∂∇P ˙ − ρ∂E φ · E˙ − ρ∂P φ · P − ρ(∂θ φ + η)θ˙ − ρ∂C φ · C P φ · ∇P ∇θ ˙ + T · L − P · E˙ − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ∂D φ · D θ

(15.3) ˙ θ˙ implies that ˙ , ∇θ, The arbitrariness of D ∂D φ = 0,

∂∇θ φ = 0,

η = −∂θ φ.

˙ = 2FT D F and Observe that C ˙ = (E ⊗ P) · L + P · E. ˙ P · E˙ = (F−1 P) · (F˙ T E + FT E) Hence substituting ρπ · E˙ with P · E˙ − (E ⊗ P) · L and dividing by θ it follows from (15.3) that 1 1 1 ˙ (T − 2ρF∂C φFT + E ⊗ P) · L − (ρ∂E φ + P ) · E˙ − ρ∂P φ · P θ θ θ ρ 1 T ˙ P) − 2 q · ∇θ + ∇ · k ≥ 0. − ∂∇P P φ · (∇ P − L ∇P θ θ Consider the identities ρ    ρ ρ ˙ ˙ ˙ ∂∇P (∂∇P P φ · ∇P = ∇ · P φ)P − P ∇ · ( ∂∇P P φ) , θ θ θ T T P) ≡ ∂xh P K ∂∂x j P K φL h j = [∇P P(∂∇P ∂∇P P φ · (L ∇P P φ) ] · L

For formal convenience let T P(∂∇P T :=T−2ρF∂C φFT +ρ∇P P φ) + E ⊗ P,

ρ ρ δP = ∂P φ − ∇ · ( ∂∇P P φ), θ θ

where δP is the standard variational derivative. Since φ is independent of D it is natural to let T depend on D or to view T as the dissipative stress. Upon obvious rearrangements we obtain 1 1 1 ˙ − 1 q · ∇θ + ∇ · (k − ρ ∂∇P ˙ P) · E˙ − ρδP φ · P T · L− (ρ∂E φ+P P φP ) ≥ 0. θ θ θ θ2 θ (15.4)

872

15 Ferroics

Hence we let k=

ρ ˙ ∂∇P P φP . θ

The arbitrariness of W = skwL implies T P(∂∇P T + ρ∇P P φ) + E ⊗ P ∈ Sym.

P then it follows that If φ is independent of ∇P skwT = skw(P ⊗ E), that is the classical relation arising from the balance of angular momentum. Back to inequality (15.4) we can write ˙ − 1 q · ∇θ ≥ 0. T · D − (ρ∂E φ + P ) · E˙ − ρδP φ · P θ Letting D = 0 and ∇θ = 0 we have ˙ ≤ 0. (ρ∂E φ + P ) · E˙ + ρδP φ · P

(15.5)

The whole inequality holds if 1 T · D − q · ∇θ ≥ 0. θ

(15.6)

Inequality (15.6) holds in particular with the Navier–Stokes–Fourier scheme. This in turn means that T P(∂∇P T = 2ρF∂C φFT − ρ∇P P φ) − E ⊗ P + T ; T P(∂∇P the stress T is the sum of an electro-elastic part, 2ρF∂C φFT − ρ∇P P φ) − E ⊗ T P, and a dissipative part subject to (15.6). We now examine some consequences of (15.5). First, though the values E and P ˙ cannot be independent of are taken as independent variables the derivatives E˙ and P ˙ can be taken arbitrarily then (15.5) implies each other. If E˙ and P

P, ρ∂E φ = −P

δP φ = 0.

−1 Let ∂∇P P φ = 0. Then δP φ = ∂P φ. Observe that P /ρ = F π. Further, φ depends on E and π only through E and P , respectively. Hence we find

∂E φ = F∂E φ = −π, If φ is a C 2 function then it follows

∂π φ = ρF−T ∂P φ = 0.

15.1 Evolution Function and Hysteretic Effects in Ferroelectrics

∂π ∂E φ = −1,

873

∂E ∂π φ = 0,

which is a contradiction. The contradiction is removed if π and E are not simulta˙ are mutually independent. neously independent variables when E˙ and P By the definition of entropy production we can replace inequality (15.5) with ˙ = −θσ P E , (ρ∂E φ + P ) · E˙ + ρδP φ · P

(15.7)

where σ P E ≥ 0 is the entropy production determined by the evolution of P , E when D = 0, ∇θ = 0. We can then view (15.7) as the evolution equation for E , P . Indeed, by the representation formula (A.7) for vectors with a given longitudinal part, and arbitrary transverse part, we can write ˙ = (P ˙ · e)e + (1 − e ⊗ e)w, P w being an arbitrary vector. Here we let e = δP φ/|δP φ| and use (15.7) to obtain ˙ · e)e = (P ˙ · ρδP φ) δP φ = −[(ρ∂E φ + P ) · E˙ + θσ P E ] δP φ . (P ρ|δP φ|2 ρ|δP φ| Hence (A.7) yields ˙ = −[(ρ∂E φ + P ) · E˙ + θσ P E ] δP φ + (1 − δP φ ⊗ δP φ )w P ρ|δP φ|2 |δP φ| |δP φ|

(15.8)

where w is any vector function of the set of variables . This is the most general (nonlinear) evolution equation for P consistent with the second-law inequality. Instead the simplest model happens when σ P E = 0 and w = 0. In this case (15.8) reduces to ˙ = −[(ρ∂E φ + P ) · E˙ ] δP φ . P ρ|δP φ|2 An intermediate model follows by letting w = 0 and P = −ρ∂E φ. Thus Eq. (15.8) simplifies to ˙ = − θσ P E δP φ. P ρ|δP φ|2 This result may be phrased by saying that δP φ = 0 expresses the equilibrium condition of the system and hence the deviation from equilibrium governs the evolution of the polarization π. This simpler approach leads to a Ginzburg–Landau evolution equation for the polarization which includes the Ginzburg–Landau–Devonshire model as a particular case and is close to the scheme employed in ferromagnetism for the magnetization dynamics [38, 155]. The resulting constitutive theory is enough to account for some features of hysteresis but is inadequate for a full description of hysteretic phenomena (major and minor loops, for instance) where the evolution of

874

15 Ferroics

P on a time interval depends on the evolution of E as well as on the initial values of E and P . ˆ ˙ A simple but fruitful model is obtained by letting ∂∇P P φ = 0 and θσ P E = ξ|E | so that Eq. (15.7) takes the form ˙ = −ξ| ˆ E˙ |. (ρ∂E φ + P ) · E˙ + ρ∂P φ · P

(15.9)

This equation is now investigated in a one-dimensional setting.

15.1.1

One-Dimensional Models of Ferroelectric Hysteresis

With reference to (15.9) we now let F be constant and hence ρ is constant too. Since ˙ = F−1 P˙ and ∂E φ = F−1 ∂E φ, ∂P φ = FT ∂P φ then it follows that ˙ P E˙ = FT E, ˆ T E|, ˙ (∂E  + P) · E˙ + ∂P  · P˙ = −ξ|F

(15.10)

where  = ρφ. Moreover assume E, P are collinear and directed along a principal direction of F. Thus we denote by E, P the components of E, P in their common direction and Eq. (15.10) simplifies to ˙ ≤ 0. (P + ∂ E ) E˙ + ∂ P  P˙ = −ξ| E|

(15.11)

Equation (15.11) is invariant under the time transformation t → c t,

c > 0,

and hence any associated model is rate-independent, if so is the function ξ. It is a consequence of (15.11) that, along any closed curve C, in the E-P plane, as t ∈ [0, T ], T

T

T

0

0

0

˙ dt = ∫[P E˙ + ] ˙ dt = ∫ P E˙ dt; 0 ≥ ∫[(P + ∂ E ) E˙ + ∂ P  P]

(15.12)

this implies that the closed curve C is run in the counterclockwise sense. Except at stationary points where E˙ = 0, we can divide Eq. (15.11) by E˙ to obtain ξ P˙ P + ∂E  ˙ − sgn E. =− ˙ ∂  ∂ E P P

(15.13)

Let θ be constant. If ξ depends on E˙ at most through sgn E˙ then the right-hand side ˙ E˙ is the derivative of P with respect of (15.13) is piecewise a function of P, E and P/ to E, d P/d E, the slope of the E − P curve. Let

15.1 Evolution Function and Hysteretic Effects in Ferroelectrics

χ1 = −

P + ∂E  , ∂P 

χ2 = −

ξ . ∂P 

875

(15.14)

Hence  and χ1 are functions of E and P while ξ is so far to be determined. The polarization P is then given by the differential equation dP ˙ = χ1 + χ2 sgn E. dE

(15.15)

In paraelectric materials a dielectric polarization occurs when an electric field is applied to the material and the material looses the polarization when the electric field is removed. If ξ = 0, and hence χ2 = 0, the model (15.15) describes a paraelectric behaviour where χ1 represents the positive slope of the polarization curve that is (o times) the differential susceptibility. We then require that χ1 > 0,

|χ2 | ≤ χ1 .

(15.16)

Unlike the counterclockwise property (15.12), inequalities (15.16) are not a consequence of the second law; they are assumed as physically sound restrictions so that the polarization be a monotone function of the electric field. We now look for explicit representations of χ1 and χ2 . Since the hysteretic behaviour occurs below an appropriate temperature θC , called Curie temperature, we let ξ = γ(E, P)h(θC − θ). We refer to γ(E, P) as the hysteretic function. We might identify h with the Heaviside function H . Rather, if hysteretic effects expressed by ξ have to become smaller and smaller as θ approaches θC from below, then it is more appropriate to identify h with the positive part of 1 − θ/θC , namely h(θC − θ) = max{0, 1 − θ/θC }. This identification is consistent with the feature that, as θ → θC from below, all loops shrink to lines as is the case in the paraelectric behaviour. While χ1 is fully determined by the potential , χ2 depends also on ξ and hence many different models can be established by using the same energy potential . The function χ2 = −ξ/∂ P  is related to the entropy production (l.h.s. of (15.11)) and hence describes the dissipative and hysteretic properties of the material. To determine χ1 and χ2 and hence the model of ferroelectric materials, we begin with the assumption that there is a characteristic non-decreasing function P such that P = P(E)



χ2 = 0,

(15.17)

the dependence on the parameter (temperature) θ being understood and not written. We refer to P as the anhysteretic function and to the corresponding graph

876

15 Ferroics

{(E, P(E)) : E ∈ R} as the anhysteretic curve [231]. Indeed, by (15.11) we might observe that (15.18) (P + ∂ E ) E˙ + ∂ P  P˙ = 0 is the thermodynamic condition corresponding to ξ = 0. Hence dP P + ∂E  =− = χ1 dE ∂P  represents the non-hysteretic behaviour and χ1 can be viewed as the anhysteretic differential susceptibility. This in turn suggests that we let χ1 be a non-negative function of E only, namely (15.19) χ1 = g(E) ≥ 0. Finally, the required counterclockwise property (15.12) of ferroelectric cycles suggests that ⎧ ⎪> 0 if P < P(E), ⎨ (15.20) χ2 (E, P) = 0 if P = P(E), ⎪ ⎩ < 0 if P > P(E).

The Hysteretic Gibbs Potential We now select the Gibbs potential  in the general form (E, P) = L(P − G(E)) + F(E) − P E,

(15.21)

L, G, and F being so far undetermined differentiable functions, possibly dependent on θ. Thus  := ρψ = L + F is the Helmholtz potential. Upon substitution of ∂ P  and ∂ E  in (15.14) we obtain L (P − G(E))G (E) − F , L (P − G(E)) − E ξ χ2 = − . L (P − G(E)) − E

χ1 =

By (15.19) it follows L (P − G(E))G (E) − F = g(E)[L (P − G(E)) − E], which implies

G = g,

F = Eg(E).

15.1 Evolution Function and Hysteretic Effects in Ferroelectrics

877

Since χ1 is assumed to be positive then G is a monotone increasing function. In order to determine L, we observe that the simplest way to satisfy (15.20) is to let sgn χ2 (E, P) = −sgn [P − P(E)]. Hence ξ ≥ 0 implies that −sgn [P − P(E)] = sgn χ2 = −sgn [L − E] whence

P − P(E) ≥ 0. − G(E)) − E

L (P Thus we take

P − P(E) 1 = > 0, L (P − G(E)) − E α

α being possibly parameterized by θ. It follows that L (P − G(E)) = α[P − G(E)],

P(E) = G(E) +

1 E. α

Summarizing all these results we conclude that 1. to within an additive constant (possibly dependent on θ) the expression (15.21) of  only depends on g and α as follows (E, P) = 21 α(P − G(E))2 + F(E) − P E, where

E

G(E) = ∫ g(y) dy, 0

E

F(E) = ∫ y g(y) dy, 0

2. the characteristic functions χ1 and χ2 are then given by χ1 = g(E),

χ2 = −

ξ , α[P − P(E)]

where the anhysteretic function takes the form E

P(E) = ∫[g(y) + 1/α] dy 0

and g(E) = P (E) − 1/α.

878

15 Ferroics

The Hysteretic Function γ To determine an explicit representation of χ2 , χ2 =

γ(E, P)h(θC − θ) , α(P − P)

we observe that the only thermodynamic requirement on γ is the non-negative valuedness. The anhysteretic susceptibility g, the positive parameter α, and the hysteretic function γ characterize the model of the material. With this in mind we now examine the hysteretic properties associated with a given  and four different functions γ. I. γ(E, P) = γ0 (E)|P − P(E)| where γ0 = β[1 − λ exp(−σ|E|)] and β, σ > 0, λ ∈ (0, 1). Let βh = βh(θ − θC ). It follows from (15.15) that βh dP ˙ = g(E) − [1 − λ exp(−σ|E|)]sgn [(P − P) E]. dE α

(15.22)

This model is characterized by the positive parameters α, β, λ, σ and the anhysteretic function P. Since h ≥ 0 then inequalities (15.16) reduce to g(E) ≥

βh (1 − λ) ≥ 0. α

To show the hysteretic features we now let E oscillate in time with angular frequency ω. Let E be the amplitude of the oscillations, E(t) = E sin ωt. Hysteresis loops in the (E, P)-plane are obtained by determining P(t) such that βh 1 ˙ [1 − λ exp(−σ|E|)]sgn [(P − P)]| E|, P˙ = [P (E) − ] E˙ − α α

E˙ = ωE cos ωt.

This is done by starting from different initial states (E 0 , P0 ). Since the model is rateindependent then ω may be chosen arbitrarily. To check the saturation condition, namely d P/d E → 0 as |E| → ∞, we observe that (P − P) E˙ < 0



lim

|E|→∞

βh dP βh = lim g(E) + ≥ . |E|→∞ dE α α

Hence, even if g → 0 as |E| → ∞ the saturation property may occur (approximately) only if βh /α 1, namely if α is constant and θ ≈ θC . II. γ(E, P) = γ0 (P)|P − P(E)|, where γ0 = β sgn [σ − |P|] and β, σ > 0.

15.1 Evolution Function and Hysteretic Effects in Ferroelectrics

879

Fig. 15.1 Model II: hysteresis loops (solid) and anhysteretic curve (dashed) assuming E = 1, 3 and starting from (E 0 , P0 ) with E 0 = 0, P0 = −1, −0.4, 0.5

This model allows for the saturation of P at large values of |E|. With this purpose, restrict attention to the strip |P| ≤ σ where γ0 > 0. By (15.15) it follows that βh dP ˙ = g(E) − sgn [(P − P)(σ − |P|) E], dE α

(15.23)

where βh = βh(θC − θ) and g = P − 1/α. Again the hysteretic behaviour is modelled via P and the parameters α, β, σ. Starting from different initial states (E 0 , P0 ) in the (E, P)-plane, hysteresis cycles are obtained by solving the equation βh 1 ˙ P˙ = [P (E) − ] E˙ − sgn [(P − P(E))(σ − |P|)]| E|, α α

E˙ = ωE cos ωt.

In Fig. 15.1 cyclic processes with large amplitudes (major loops reaching saturation) and small amplitudes (minor loops) are depicted assuming P(E) = k E and α = βh = 1/2, k = σ = 3. As is apparent, here the anhysteretic curve is given by  P=

P(E) ±σ

if |P(E)| < σ, if P(E) = ±σ,

and then the saturation property is achieved (see also [37]). III. γ(E, P) = γ0 [P − P(E)]2 , γ0 > 0, where g(E) = P (E) − 1/α. Let γh (θ) = γ0 h(θ − θC ). The derivative d P/d E is then given by

880

15 Ferroics

Fig. 15.2 Model III: hysteresis loops (solid), anhysteretic curve P (dashed) and graph of g (short dashed) with E = 1, 2.6. The initial state E 0 , P0 has E 0 = 0 and P0 = −0.2, 0.2

γh dP ˙ = g(E) − [P − P(E)] sgn E. dE α

(15.24)

This is a very simple model. Since g = P − 1/α, it is characterized merely by the positive ratio γ0 /α and the anhysteretic function P. Hysteresis cycles are obtained by solving γh 1 ˙ P˙ = [P (E) − ] E˙ − [P − P(E)]| E|, α α

E˙ = ωE cos ωt.

In Fig. 15.2 cyclic processes with large and small amplitudes are depicted assuming ⎧ 1 ⎪ ⎨ 2 (E − 1) if x < −1, P(E) = E if − 1 ≤ x ≤ 1, ⎪ ⎩1 (E + 1) if x > 1, 2 and α = 5, γh = 5/2. According to Eq. (1.5) in [103], the condition ∞

∫ [ f (ζ) − g(ζ)] exp(−τ ζ) dζ = E

1 exp(−τ E) ≥ 0, ατ

E ∈ [0, ∞)

is both necessary and sufficient for the validity of the assertion that the major loop has a counterclockwise orientation and is the boundary of the set of (attainable) states, i.e. the set of pairs (E, P) accessible, by appropriate variation of E, from the state

15.1 Evolution Function and Hysteretic Effects in Ferroelectrics

881

Fig. 15.3 Model IV: hysteresis loops (solid), anhysteretic curve P (dashed) and graph of g (short dashed) with E = 0.4, 1.4. The initial state E 0 , P0 has E 0 = 0 and P0 = −0.1, 0, 0.1

in which both E and P are zero. In addition, it follows from Eqs. (2.37) and (2.38) of [103] that the relation f ≥ 0 (hence, g ≥ 1/α) is necessary and sufficient for d P/d E to be non-negative. Finally, we stress that the relation f − g ≥ 0 (cfr. [103], Eqs. (1.6) and (3.18)), which ensures that “the energy expended in a complete traversal of a primitive loop is never negative”, is not enough to ensure thermodynamic compatibility. Indeed, previous arguments prove that a stronger condition is required in order to establish an energy potential , namely f − g =  for some  = 1/α > 0. IV. γ(E, P) = γ0 g(E)[P − P(E)]2 , γ0 > 0, where g(E) = P (E) − 1/α. Letting γh (θ) = γ0 h(θC − θ), from (15.15) it follows that γh dP ˙ = g(E)[1 − τh (P − P(E)) sgn E], > 0. τh = dE α The vanishing of g as |E| approaches infinity is a way of modelling the saturation property of hysteresis. Hysteresis cycles are given by ˙ P˙ = g(E)[ E˙ − τh (P − f (E))| E|],

E˙ = ωE cos ωt,

starting from different initial states (E 0 , P0 ). In Fig. 15.3 we use P(E) = 1.5(tanh 2 E + E) and α = 1.5, τh = 1. Then g(E) = 3/ cosh2 2E.

882

15 Ferroics

In summary, Models I and II involve hysteretic functions γ which are proportional to |P − P(E)|. In particular, Model I shows ‘bilinear’ hysteresis loops in connection with a linear anhysteretic function P, whereas Model II accounts for the appearance of the saturation regime, too. In Model III and IV the hysteretic function γ is proportional to [P − P(E)]2 . Model III shows hysteresis loops associated with a piecewise linear anhysteretic curve, whereas Model IV involves a nonlinear function P of the Langevin type. In addition, this model enjoys the saturation condition. The essence of the present approach to the modelling of ferroelectric hysteresis is that the thermodynamic inequality makes the relation between P˙ and E˙ to consist of two parts. One is fully characterized by the free energy. The other one is related to the dissipative character of the hysteresis. Models compatible with thermodynamics can then be determined by appropriate selections of the free energy and of the dissipative part. While the anhysteretic function is fully determined by the free energy potential, the dissipative part of the model, characterized by χ2 , involves both the free energy influence, via the denominator ∂ P , and the non-negative purely-thermodynamic term ξ. For any free energy  then there are a family of dissipative parts characterized by the chosen function ξ.

Relation to Other Approaches Among other schemes of ferroelectric materials, it is worth outlining an approach [404] based on the use of configurational forces [178, 179, 215]. The key assumption is that there is a micro-force system which consists of a vector stress ξ and scalar body forces π (internal) and γ (external). This system of forces is assumed to be subject to the micro-force balance ∫ ξ · n da + ∫ (π + γ)dv = 0, Pt

∂Pt

for each part Pt of the current configuration. Hence, for smooth functions ξ, π, γ it follows that ∇ · ξ + π + γ = 0. Moreover the power of these forces associated with the change of the order parameter, say ρ, is assumed to be [178, 179, 215] ∫ ξ · n ρ˙ da + ∫ (π + γ)ρ˙ dv Pt

∂Pt

and the second law is taken in the form of a dissipation inequality d dt



Pt

ψV dv ≤

∂Pt

ξ · n ρ˙ da +

ψV being the free energy per unit volume.

Pt

γ ρ˙ dv,

15.2 Ginzburg–Landau–Devonshire Theory

883

Following a similar view, Su and Landis [404] consider a micro-force tensor ξ and micro-force vectors π (internal) and γ (external), subject to the equilibrium condition ∫ ξn da + ∫ (π + γ)dv = 0, ∇ · ξ + π + γ = 0, Pt

∂Pt

and assume that the corresponding power associated with the change of the polarization P is ∫ (ξn) · P˙ da + ∫ (π + γ) · P˙ dv. Pt

∂Pt

The second law is eventually taken to be expressed by the inequality ˙ − π · P, ˙ + ξ · ∇P ˙ ψ˙ V ≤ σ · ε˙ + E · D ˙ is identified with ∇ P. ˙ Next upon where σ is the Cauchy stress and, presumably, ∇P the assumption that the independent variables are ε, D, P, ∇P, P˙ it is claimed that ∂P˙ ψV = 0, and

σ = ∂ε ψ V ,

E = ∂D ψ V ,

ξ = ∂∇P ψV

(π + ∂P ψV ) · P˙ ≤ 0.

This inequality is satisfied by letting ˙ π = −∂P ψV − β P, where β is a positive definite tensor. Hence, in view of the equilibrium condition for micro-forces, it follows that ˙ ∇ · ∂∇P ψV − ∂P ψV + γ = β P.

15.2 Ginzburg–Landau–Devonshire Theory Landau theory is a symmetry-based analysis of equilibrium near a phase transition [184]. Landau’s symmetry-based theory was first applied to ferroelectricity by Ginzburg and next by Devonshire [86]. Here we just outline this Ginzburg–Landau– Devonshire (GLD) theory both in the isotropic case (that is appropriate for systems with spatially uniform polarization) and in connection with special anisotropic mate-

884

15 Ferroics

rials (transversely isotropic materials and materials whose lattice has cubic symmetry). Look at (12.17)1 and assume ∂∇ R  ψ = 0. Hence in stationary conditions E = F−T ∂ ψ. Since =

1 J F−1 P, ρR

∂P ψ = ∂ ψ ∂P  =

1 −T F ∂ ψ ρ

then E = ρ ∂P ψ. The effective electric field Eeff may be thought of as the sum of several contributions due to different phenomena. We assume that Eeff = E + Eint + Ean ,

(15.25)

where E is the external applied field, Eint is the electric field due to interaction of the polarization field with itself and Ean is due to anisotropic polarization (which strongly depends on the particle shape). By analogy with ferromagnetism we let Eint = −D π ,

π = P/ρ ,

where D is a symmetric objective tensor. In absence of an external field, ferroelectric bodies tend to be polarized along precise directions which are called easy directions. This is mostly due to the depolarizing effect of spin-lattice coupling in crystals. Therefore, we take this phenomenon into account by adding the field Ean = −A π , where A is a symmetric, positive semi-definite and objective tensor depending on the shape of the material and on its electric anisotropy. Hence, Eeff = E − Lπ ,

L = D + A;

(15.26)

the positive semi-definite tensor L may be referred to as generalized depolarizing tensor. This representation of Eeff leads to the following explicit expression of the free energy density ψ = 0 (C, θ) + 1 (P, θ), where 1 (P, θ) = 21 P · G(P, θ)P,

(15.27)

15.2 Ginzburg–Landau–Devonshire Theory

885

and G(P, θ) is a objective symmetric tensor-valued function such that G(P, θ) + 21 P · ∂P G(P, θ) = L(P, θ). The two main categories of ferroelectric materials are those undergoing a secondorder transition, like triglycine sulphate, and those undergoing a first-order transition, like BaTiO3 and other perovskites. Following are corresponding expressions of the free energy density.

15.2.1 Landau Free Energy Second-order temperature-induced ferroelectric transitions are usually described by taking advantage of the fourth-order Landau–Ginzburg theory. If this is the case then D and A are assumed to be quadratic even functions of P and hence   θ − θC + βP2 1, D(P, θ) = α θC

(15.28)

where θC is the Curie temperature and α, β are positive constants. Anisotropic properties of materials are characterized by their symmetry group. The largest symmetry group in three-dimensional space includes all orthogonal transformations, is denoted by O(3) and is referred to as isotropic. In that case A = 0.

(15.29)

By virtue of (15.27) the free energy density reads ψ = 0 +

α θ − θC 2 β 4 P + P . 2 θC 4

In stationary conditions we find that   E = ρ∂P ψ = ρ β P2 − α∗ (θ) P,

α∗ (θ) = α

θC − θ . θC

The polarization P is parallel or antiparallel to E depending on the sign of the quantity in square brackets. If θ > θC then P is parallel to E. Moreover, in the linear approximation θC , P = 0 χE, χ= αρ(θ − θC ) the susceptibility χ being inversely proportional to θ − θC ; in these conditions the material is said to be paraelectric. The inverse proportionality of χ on θ − θC is said

886

15 Ferroics

Fig. 15.4 Plot of the pitchfork bifurcation diagram with β = α = 1

to represent the Curie–Weiss law. The critical temperature θC is the ferroelectric Curie temperature or the Curie point. For large values of E, and hence of P, so that α∗ βP2 , we have P  (βρE2 )−1/3 E Hence |P| increases slowly as |E|1/3 . Yet, in the actual ferroelectrets P approaches a saturation value for large values of |E|. This means that this simple scheme is inadequate to model the saturation. If θ < θC then the dependence of P on E is more involved.  When the applied external field vanishes, E = 0, the stationary condition reads β P2 − α∗ (θ) P = 0 so that we find P = 0 and infinitely many pairs of nonzero spontaneous solutions, P = ±Ps (θ)e,

Ps (θ) :=



α∗ (θ)/β,

where e is a generic unit vector and Ps (θ) represents the spontaneous polarization modulus in the range 0 < θ < θC . In addition, letting P∗ = Ps (0) =



α/β,

we conclude that 0 ≤ Ps (θ) < P∗ . The function Ps is decreasing on (0, θC ). This suggests that we assume ϕ = P/P∗ as the dimensionless order parameter to describe second-order ferroelectric transitions so that the free energy takes the form ψ = 0 +

β P∗4 θ − θC 2 1 4 ϕ + ϕ . 2 θC 2

In the (θ, P) plane the curve consisting of the two branches P = ±Ps (θ) describes a super-critical pitchfork bifurcation (Fig. 15.4). This means that the spontaneous polarization varies continuously across the critical temperature. We assume that, for a nonzero external field E, P and E have a common direction; denote by P and E the pertinent components. Then we write

15.2 Ginzburg–Landau–Devonshire Theory

887

Fig. 15.5 The perturbed pitchfork bifurcation diagram with β = α = 1 and E = 0.05

  E = ρ β P 2 − α∗ (θ) P,

α∗ (θ) = α

θC − θ . θC

For any given value of E, the corresponding curve in the (θ, P) plane consists of three branches and describes a perturbed super-critical pitchfork bifurcation (Fig. 15.5). The slope of the upper branch grows continuously from zero as the temperature drops below θC and this means that a second-order transition occurs.4 In particular, the differential susceptibility χd (P, θ) varies according to the slope ∂E P =

1 . ρ(3β P 2 − α∗ )

When θ > θC , Ps (θ) is negligible (Fig. 15.5) and we get the Curie–Weiss law χd (θ) := χd (Ps (θ), θ) ≈ −

θC C 1 = = , ∗ ρ α (θ) ρ α(θ − θC ) θ − θC

C=

θC . ρα

When 0 < θ < θC , we have Ps2 (θ) = α∗ (θ)/β so that χd (θ) =

C 1 = . ∗ 2ρ α (θ) 2(θC − θ)

The plot of χd (θ) is given in Fig. 15.6. In addition, χd → ∞

4

as

P2 →

α(θC − θ) 1 = Ps2 (θ). 3βθC 3

In a second-order phase transition the order parameter grows continuously from zero as the temperature drops below θC , [10], Chap. 2).

888

15 Ferroics

Fig. 15.6 The Landau differential susceptibility

Anisotropic Landau Free Energy Though the isotropic Landau energy is widely applied and easy to compute, most ferroelectric materials are anisotropic. For instance, anisotropy in ferroelectrics is due to spin-lattice coupling in cubic crystals (as the perovskite system) and transversely isotropic symmetry in piezoelectric thin films. In the former case the group consisting of all proper rotations leaving invariant a cubic lattice is the rotational octahedral symmetry group. If, in addition, we allow reflections, we arrive at the so-called octahedral symmetry group which is denoted by Oh . Basically, when cubic anisotropy occurs then three privileged mutually orthogonal material directions exist. On the other case, a transversely isotropic material is characterized by symmetry with respect to one selected material direction referred to as principal direction. Properties of a transversely isotropic material remain unchanged by rotations about, and reflections from the planes orthogonal or parallel to, this direction. For the sake of definiteness, in the sequel we give the Landau free energy expressions related to octahedral and transversely isotropic symmetries. Let (i1 , i2 , i3 ) be a principal material direction basis and S j = i j ⊗ i j , j = 1, 2, 3,

be the related structural tensors so that P = 3j=1 P j i j and S j P = P j i j . After performing a suitable change of coordinates (in the frame locally at rest with the body), A can be expressed in diagonal form with respect to this basis. Then we assume A = A0 + A1 H2 (P), where A0 =

3

j=1  j S j ,

A1 =

3

j=1 λ j S j ,

H(P) =

3

j=1 P j S j .

Coefficients  j and λ j , j = 1, 2, 3, characterize the anisotropy of the body. Since Snj = S j , n ∈ N, and S j Sh = 0 when j = h, it follows Hn (P) = By (15.28) we have

3

n j=1 P j S j

15.2 Ginzburg–Landau–Devonshire Theory

889

Fig. 15.7 Plot of the fourth-order energy density 1 = 1 with transversely isotropic symmetry: , λ < 0 (left) and , λ > 0 (right)

θ − θC

L(P, θ)P = (D + A)P = α + β P 2 P + 3j=1 P j ( j + λ j P j2 )i j . θC Upon replacing this expression into (15.27)1 we obtain 1 (P, θ) =

1 2

θ − θC 2 3

α P + j=1  j P j2 + 41 β P 4 + 3j=1 λ j P j4 . θC

If the material is transversely isotropic (it has one easy direction, say e3 ) then 1 = 2 = , 3 = 0 and λ1 = λ2 = λ, λ3 = 0. Hence, 1 =

1 2

θ − θC 2 α P + (P12 + P22 ) + 41 β P 4 + λ(P14 + P24 ) . θC

Provided that θ < θC , the energy density 1 takes nontrivial minima along either the easy direction, if , λ < 0, or the easy plane, if , λ > 0 (Fig. 15.7). When the material lattice has cubic symmetry, then 1 = 2 = 3 = 0 and λ1 = λ2 = λ3 = λ. By introducing proper constants 1 α θ − θC β+λ λ , α12 = − , , α11 = α1 = − α∗ (θ) = 2 2 θC 4 2 Equation (15.27) becomes the customary expression for the cubic system5 O3 1 = α1 P 2 + α11 P 4 + α12 (P12 P22 + P22 P32 + P12 P32 ).

(15.30)

As for (15.30), we observe that α12 is negative, α11 is positive provided that λ > −β and the sign of α1 depends on θ. Then, below the Curie temperature, θ < θC , 1 takes nontrivial minima along the principal material directions provided that 5

See, for instance, [78, 133] and in particular [77], Tables 1 and 2.

890

15 Ferroics

Fig. 15.8 Plot of the fourth-order energy density 1 = 1 with cubic symmetry: λ > 0 (left) and λ < 0 (right)

λ > 0, on the contrary it takes maxima along the same directions when −β < λ < 0 (see Fig. 15.8).

15.2.2 Landau–Devonshire Free Energy According to the Landau-Ginzburg–Devonshire theory, to describe first-order phasetransitions in ferroelectric materials like BaTiO3 and other perovskites a sixth-order approximation of the free energy density is required [133]. With this goal in mind, we assume that D and A are fourth-order even functions of P, whose coefficients possibly depend on θ. In particular, we let   θ − θC + βP2 + γP4 I, D(P, θ) = α θC

(15.31)

where α, γ are positive constants, whereas β ∈ R. If the material is isotropic then (15.29) holds. By virtue of (15.27) and (15.31) the free energy density reads  = 0 + 21 α

θ − θC 2 1 4 1 6 P + 4 βP + 6 γP . θC

When the applied external field vanishes, E = 0, from the stationary condition it follows   θ − θC 0 = ∂P ψ = ρ α + βP2 + γP4 P. θC Now, we separately discuss the cases β ≥ 0 and β < 0.

15.2 Ginzburg–Landau–Devonshire Theory

891

• β ≥ 0. It is easy to check that P = 0 is the unique solution at equilibrium when θ ≥ θC . Otherwise, letting Y = P2 we look for positive solutions of γY 2 + βY − α∗ (θ) = 0, We obtain Y+ (θ) =

−β +

α∗ (θ) = α

θC − θ > 0. θC

β 2 + 4α∗ (θ)γ > 0. 2γ

and then there exist infinitely many pairs of nonvanishing stationary solutions,  P = ±Ps (θ)e,

Ps (θ) :=

−β +



β 2 + 4α∗ (θ)γ 2γ

where e is a generic unit vector and Ps (θ) represents the spontaneous polarization modulus at 0 < θ < θC . In addition, letting  P∗ = Ps (0) =

−β +



β 2 + 4αγ , 2γ

we infer that 0 ≤ Ps (θ) < P∗ , Ps being a decreasing function on (0, θC ]. If this is the case, we conclude that at θ = θC occurs a pitchfork bifurcation, as well as in the case of the fourth-order Landau free energy density. Namely, when β ≥ 0 the properties of the model remain unchanged whether γ > 0 or γ = 0. • β < 0. As in the previous case, P = 0 is an equilibrium solution at any temperature. In addition, letting Y = P2 we look for positive solutions of γY 2 − |β|Y − α∗ (θ) = 0,

α∗ (θ) = α

θC − θ . θC

When 0 < θ ≤ θC then α∗ (θ) ≥ 0 and we obtain only one positive value, Y+ (θ) =

|β| +

β 2 + 4α∗ (θ)γ . 2γ

Hence there exist infinitely many pairs of nonvanishing stationary solutions,  P = ±Ps (θ)e,

Ps (θ) :=

|β| +



β 2 + 4α∗ (θ)γ 2γ

where e is a generic unit vector and Ps (θ) represents the spontaneous polarization modulus at θ < θC . In particular we let

892

15 Ferroics

Fig. 15.9 Plot of the Devonshire bifurcation diagram with α = 1, β = −2, γ = 3, and θC∗ /θC = 4/3

 P∗ = Ps (0) =

|β| +

β 2 + 4αγ , 2γ

On the other hand, when θ > θC then α∗ (θ) < 0 and we get two positive values, Y+ (θ) =

|β| +



β 2 − 4|α∗ (θ)|γ , 2γ

After defining

Y− (θ) =

|β| −

β 2 − 4|α∗ (θ)|γ . 2γ

  β2 θC∗ = 1 + θC > θC , 4αγ

we can also write |β| ± Y± (θ) = 2γ

α(θC∗ − θ) , √ γθC

 Ps± (θ)

:=

|β| ± 2γ

α(θC∗ − θ) . √ γθC

Hence, when θC ≤ θ ≤ θC∗ there exist infinitely many quadruples of nonvanishing stationary solutions, P = ±Ps+ (θ)e, P = ±Ps− (θ)e, where e is a generic unit vector. We observe that the (stable) null solution suffers a jump bifurcation at θ = θC as temperature decreases, whereas the stable branches collapse to 0 at θ = θC∗ as temperature increases. The corresponding diagram is given in Fig. 15.9. We assume that, when the applied external field E does not vanish, P and E have a common direction and consider the pertinent components, P, E. Then we write   E = ρ γ P 3 + β P 2 − α∗ (θ) P,

α∗ (θ) = α

θC − θ . θC

When β < 0 the corresponding curve in the (θ, P)-plane is given by Fig. 15.10.

15.2 Ginzburg–Landau–Devonshire Theory

893

Fig. 15.10 The Devonshire perturbed bifurcation diagram with α = 1, β = −2, γ = 3, and E = 0.05

Fig. 15.11 The Devonshire differential susceptibility: θC∗ /θC = 4/3

In particular, the differential susceptibility χd (P, θ) varies according to the slope ∂E P =

1 . ρ[4γ P 4 + 3β P 2 − α∗ (θ)]

When θ > θC∗ , Ps (θ) is negligible (see Figs. 15.8 and 15.9) and we obtain χd (θ) := χd (Ps (θ), θ) ≈ −

θC C 1 = = , ∗ ρ α (θ) ρ α(θ − θC ) θ − θC

C=

θC . ρα

On the other hand, when 0 < θ < θC∗ , Ps (θ) ≈ P∗ and since β < 0 we have χd (θ) ≈ where

C 1 = , ρ[4γ P∗4 − 3|β|P∗2 − α∗ (θ)] θ + θ0

θ0 β 2 + |β| β 2 + 4αγ 4 2 + 3α > 0. = 4γ P∗ − 3|β|P∗ − α = θC 2γ

The plot of χd (θ) is given in Fig. 15.11. Because of the finite jump at the critical temperature, we conclude that a first-order phase transition occurs. In addition, χd → ∞

as

3|β| ± P → 8γ 2



α(θC∗ − θ) . √ 2 γθC

894

15 Ferroics

The Anisotropic Case Hereafter we construct A in order to account a material lattice with octahedral symmetries. We assume A = A0 + A1 H2 (P) + A2 P 2 H2 (P) + A3 H4 (P), where A0 =

3

j=1  j S j ,

A1 =

3

j=1 λ j S j ,

A2 =

3

j=1 μ j S j ,

A3 =

3

j=1 ν j S j .

The constants  j , λ j , μ j , j = 1, 2, 3, characterize the anisotropy of the body. Since L = D + A, from (15.31) θ − θC

L(P, θ)P = α + β P 2 + γ P 4 P + 3j=1 P j ( j + λ j P j2 + μ j P 2 P j2 + ν j P j4 )i j . θC

Upon substitution of this expression into (15.27)1 we have θ − θC 2 3 α P + j=1  j P j2 θC 4 3 

 1 + 4 β P + j=1 λ j P j4 + 16 γ P 6 + 3j=1 μ j P 2 P j4 + ν j P j6 .

1 (P, θ) =

1 2

When the material lattice has cubic symmetry we have λ j = λ, μ j = μ, ν j = ν and  j = 0, j = 1, 2, 3. In terms of the constants θ − θC , α11 = 41 (β + λ), α12 = − 21 λ, θC = 16 (γ + μ + ν), α112 = − 16 (2μ + 3ν), α123 = −(μ + ν),

α1 = 21 α α111

we then obtain (see, e.g. [77], Table 1) 1 = α1 P 2 + α11 P 4 + α12 (P12 P22 + P22 P32 + P12 P32 ) + α111 P 6 + α112 [P14 (P22 + P32 ) + P24 (P12 + P32 ) + P34 (P12 + P22 )] + α123 P12 P22 P32 . We observe that α111 is positive provided that μ + ν > −γ, whereas the sign of α112 and α123 only depend on the values of μ and ν (see Fig. 15.12).

15.3 Ferromagnetism

895

Fig. 15.12 Plot of the sixth-order energy density 1 = 1 with cubic symmetry: λ > 0, β + λ < 0 and 2μ + 3ν > 0 (left) and 2μ + 3ν < 0 (right)

15.3 Ferromagnetism The characteristic property of ferromagnetism is that, below a certain temperature called the Curie temperature, θC , the material can possess a spontaneous magnetization in the absence of an applied magnetic field. Upon application of a weak magnetic field, the magnetization increases rapidly to a high value called the saturation magnetization, which is in general a function of temperature. Physically this behaviour is viewed as the consequence of the alignment of atomic magnetic moments in response to the applied magnetic field. That is why various approaches to ferromagnetism are based on a micromagnetic picture of the points of the body. The combined effect of the atomic magnetic moments can give rise to a relatively large magnetization for a given applied field. Above the Curie temperature, a ferromagnetic substance behaves as paramagnetic in that its differential susceptibility χ is roughly constant and approaches the Curie–Weiss law (Sect. 15.3.3), namely χ is inversely proportional to θ − θC . Some materials having atoms with unequal moments exhibit a special form of ferromagnetism below the Curie temperature called ferrimagnetism. Upon the assumption that M and H have a common fixed alignment, the evolution equation was first set in the form6 M˙ = α0 μ0 H − α1 (θ − θC )M + α2 M 3 + α3 M, where α0 , α1 , α2 are positive parameters. This non-isothermal model accounts for the transition, at the Curie temperature θC , between the paramagnetic and the ferromagnetic behaviours. Yet the corresponding magnetization curve

6

See, e.g. [274] p. 39, [260].

896

15 Ferroics

H = [a(θ − θC ) + bM 2 ]M, where a = α1 /α0 μ0 , b = α2 /α0 μ0 , shows that sgn H = ±sgn M depending on the sign of a(θ − θC ) + bM 2 . If θ > θC then H and M are parallel. Moreover, if M is small, so that bM 2 a(θ − θC ), then the magnetic susceptibility χ M is roughly constant, 1/a χM = , M  χ M H, θ − θC consistent with the ferromagnetic Curie–Weiss law. If, instead, θ < θC then H = 0 at M = 0 and a M 2 = (θC − θ). b As M 2 < (a/b)(θC − θ) we have sgn M = −sgn H . The value [(a/b)(θC − θ)]1/2 is the spontaneous magnetization. As with ferroelectricity, this curve does not allow for saturation. In 1907 Weiss developed a theory of ferromagnetic domains structure known as mean field theory. In essence he arranged the Langevin potential in order to describe the non-isothermal paramagnetic-ferromagnetic transition and to account for the saturation phenomena [256]. Weiss assumed that in ferromagnetic materials each dipole (elementary magnet) is subject not only to the applied field H but also to a field Hmol , called molecular field by Weiss, such that Hmol = Nw M, Nw being a material constant. Thus the effective magnetic field would be He = H + Nw M. The classical relation between M and H , obtained by statistical argument, is M = Msat L(m H/kθ), where m is the absolute value of the magnetic moment, k is the Boltzmann constant and L is the Langevin function, L(x) = coth x −

1 , x

x ∈ R.

By the Weiss assumption the relation becomes M = Msat L[m(H + Nw M)/kθ]. The Langevin function is increasing and odd and moreover L(x) → ±1 as x → ±∞. As a consequence the inverse L−1 maps (−1, 1) into R. By the Weiss assumption we can then write

15.3 Ferromagnetism

897

H + βM = L−1 (M/Msat ), μθ where Msat is the saturation value, so that M ∈ (−Msat , Msat ), β = Nw , and μ = k/m, k being the Boltzmann constant. Hence H is given by H = μθL−1 (M/Msat ) − β M.

(15.32)

For formal convenience we let ξ = M/Msat and then H = H(ξ),

H(ξ) := μθL−1 (ξ) − β Msat ξ,

If θ → 0+ then

ξ ∈ (−1, 1).

H = −β M,

which is the classical law of diamagnetism or paramagnetism according as β > 0 or β < 0. Experimental evidence shows that there is a transition temperature θC for paramagnetic-ferromagnetic behaviour such that H (0) > 0 as θ > θC ,

H (0) ∈ (−β Msat , 0) as θ < θC .

Consistently, it is natural to let μθC (L−1 ) (0) − β Msat = 0.

(15.33)

This requirement induces a relation among β, μ, and θC . To this end let z=

H + βM μθ

and observe (L−1 ) (ξ) =

1 L (z(ξ))

.

Since L−1 (0) = 0 then z(0) = L−1 (0) = 0. Moreover L (0) = 1/3 and then (L−1 ) (0) = Hence Eq. (15.33) yields μ=

1 L (0)

= 3.

β Msat , 3θC

which relates the parameter μ to the transition temperature θC . Upon substitution in (15.32) we can write

898

15 Ferroics

1 u + 1 −1 L (ξ) − ξ, H= β Msat 3

u :=

θ − θC , θC

ξ=

M . Msat

15.3.1 Thermodynamic Approach to Magnetic Hysteresis The interaction between electromagnetic fields and thermal and electric conduction is developed in Sect. 12.6; here we restrict attention to non-conducting (J = 0, q = 0) and non-polarizable (P = 0) solids. Ferromagnetic hysteretic effects are described ˙ M. ˙ In a three-dimensional by letting the pertinent properties depend on M, H, H, setting of deformable bodies we need to account for the time evolution in a objective way, i.e. via objective derivatives. We assume that H and M are vectors, in S O(3), and hence say that M = F−1 M H = FT H, are invariant vectors.7 Denote by m the magnetic moment per unit mass and hence M = ρm. Consider the balance of energy (2.75), whence ˙ + T · L + ρr, ρε˙ = μ0 ρH · m and the entropy inequality ρη˙ −

ρr = σ ≥ 0. θ

Letting φ = ε − θη − μ0 H · m we can write the entropy inequality in the form ˙ − μ0 M · H ˙ + T · L = θσ ≥ 0. − ρ(φ˙ + η θ)

(15.34)

Observe that ˙ = (H ⊗ M) · L + M · H. ˙ M · H˙ = (F−1 M) · (FT LT H + FT H) ˙ = M · H˙ − (H ⊗ M) · L in (15.34) leads to Substitution of M · H ˙ − μ0M · H˙ + (T + μ0 H ⊗ M) · L = θσ. − ρ(φ˙ + η θ)

(15.35)

˙ M ˙ as independent variThe constitutive properties are based on θ, F, H, M, H, ables; H and M are fields relative to the frame locally at rest with the body. Since φ, η, σ are required to be Euclidean invariants then we assume that φ is a (differentiable) function of 7

We recall that FT H = H R is the reference (material) intensity.

15.3 Ferromagnetism

899

˙)  = (θ, C, H , M , D , H˙ , M where the stretching tensor D has to occur via invariant terms. We compute φ˙ and substitute in (15.35) to find ¨ ˙ ˙ − ρ∂H˙ φ · H¨ − ρ∂M −ρ(∂θ φ + η)θ˙ − ρ∂C φ · C ˙ φ · M − ρ∂D φ · D ˙ + (T + μ0 H ⊗ M) · L = θσ ≥ 0. −(ρ∂H φ + μ0M ) · H˙ − ρ∂M φ · M ¨ ,D ˙ , θ˙ occur linearly. Now H¨ = F¨ T H + 2F˙ T H ˙ + FT H ¨ and the The quantities H¨ , M 2 2 ¨ . Maxwell’s equations involve ∂ H and ∂ M in ∇ × ∂t E = −μ0 (∂ 2 H + like for M t t t ∂t2 M). Since ∂t E can be taken arbitrarily then we may ascribe arbitrary values to ∂t2 H and ∂t2 M; the corresponding value of ∇ × ∂t E satisfies Maxwell’s equation and meanwhile does not affect the other terms in the inequality. Since the arbitrariness ˙ and θ˙ we conclude that holds also for D ∂H˙ φ = 0,

∂M ˙ φ = 0,

∂D φ = 0,

η = −∂θ φ.

˙ = 2FT D F then the remaining inequality can be written in the form Since C ˙ = θσ ≥ 0. [T − 2ρF∂C φFT + μ0 H ⊗ M] · L − (ρ∂H φ + μ0M ) · H˙ − ρ∂M φ · M (15.36) The arbitrariness of W = skwL implies T + μ0 H ⊗ M ∈ Sym,

skwT = skw[μ0 M ⊗ H],

which coincides with the classical condition arising from the balance of angular momentum. Since φ is independent of D then we set D = 0 in (15.36) to obtain ˙, 0 ≤ θσ M H = −(ρ∂H φ + μ0M ) · H˙ − ρ∂M φ · M

(15.37)

where σ M H is the value of σ for any  with D = 0. Consequently [T − 2ρF∂C φFT + μ0 H ⊗ M] · D = θσT , σT := σ − σ M H . Let

ˆ T(θ, F, H, M) := 2ρF∂C φFT − μ0 H ⊗ M,

T := T − Tˆ

and then T (θ, F, H, M, D ) is subject to T · D = θσT . Mathematically we might assume T · D ≥ −θσ M H ; in that case inequality (15.36) holds with negative values of T · D . More naturally, we assume

900

15 Ferroics

T · D = θσT ≥ 0; the classical viscous terms are examples of σT > 0. We now investigate inequality (15.37). If M is not an independent variable then ∂M φ = 0 and inequality (15.37) simplifies to 0 ≤ θσ M H = −(ρ∂H φ + μ0M ) · H˙ , where σ M H is a non-negative function of . The magnetization M is then given by a constitutive function. First let M be independent of H˙ . The right-hand side is then linear in H˙ and hence can take positive and negative values if ρ R ∂H φ + μ0M = 0. Consequently the inequality holds if and only if σ M H = 0,

μ0M = −ρ∂H φ.

(15.38)

Since φ(θ, C, H ) = φ(θ, C, FT H) then ∂H φ = F∂H φ,

∂H φ = F−1 ∂H φ.

Replacing M = F−1 M in (15.38) we obtain the expected (spatial) relation μ0 M = −ρ∂H φ. If instead M is allowed to depend on H˙ then letting M = ρ∂H φ + μ0M we have M · H˙ = θσ M H ≥ 0,

μ0M = −ρ∂H φ + M .

We now let both M and H be independent variables with ∂M φ = 0, ∂H φ = 0. ˙ in terms of the inner product ρ∂M φ · Equation (15.37) can be used to represent M ˙ ˙ M = −θσ M H − (ρ∂H φ + μ0M ) · H . By means of (A.7), letting e = ∂M φ/|∂M φ| we have ˙ ] ∂M φ + (1 − ∂M φ ⊗ ∂M φ )w, ˙ = −[θσ M H + (ρ∂H φ + μ0 M) · H M ρ|∂M φ|2 |∂M φ| |∂M φ| w being any vector dependent on θ, F, H, M, D , H˙ . If σ M H = 0 then Eq. (15.37) reads ˙ = 0. (ρ∂H φ + μ0M ) · H˙ + ρ∂M φ · M ˙ is required to equal −(ρ∂H φ + μ0M ) · H˙ . By the repThe inner product ρ∂M φ · M ˙ and e = ∂M φ/|∂M φ| and observing that resentation formula (A.7), letting u = M ˙ ˙ · e = − (ρ∂H φ + μ0M ) · H M |ρ∂M φ|

15.3 Ferromagnetism

901

˙ in the form we can represent M ˙ 1 ˙ = − (ρ∂H φ + μ0M ) · H ∂M φ + (1 − ∂M φ ⊗ ∂M φ)w, M 2 |ρ∂M φ| |∂M φ|2 where w is any vector function of (θ, F, H, M, D , H˙ ). If w = GH˙ then we can write ˙ 1 ˙ = − (ρ∂H φ + μ0M ) · H ∂M φ + (1 − ∂M φ ⊗ GT ∂M φ)H˙ . M |ρ∂M φ|2 |∂M φ|2

(15.39)

Subject to the assumption G is a function of θ, F, H, M, Eq. (15.39) is quite a general rate-type relation for M . ˙ be among the independent variables and As a further scheme we may let M , M moreover let (15.37) hold with σ M H = 0. For definiteness we assume θσ M H = ν H |H˙ |,

ν H = ν H (θ, C, H , M ) ≥ 0.

We now show how this scheme is appropriate to the modelling of hysteresis.

15.3.2 One-Dimensional Models of Ferromagnetic Hysteresis ˙ = F−1 M ˙ M ˙ We now let F be constant and hence ρ is constant too. Since H˙ = FT H, −1 T and ∂H φ = F ∂H φ, ∂M φ = F ∂M φ then it follows from (15.37) that ˙ ˙ + ∂M φ · M ˙ = −ν H |FT H|. (ρ∂H φ + μ0 M) · H

(15.40)

Moreover assume H, M are collinear and directed along a principal direction of F. Thus we denote by H, M the components of H, M in their common direction. Equation (15.37) then simplifies to (∂ H φ + μM) H˙ + ∂ M φ M˙ = −ν| H˙ |,

(15.41)

where μ = μ0 /ρ and ν = ν H |F|/ρ is a positive-valued function of θ, F, H, M. The temperature θ and the deformation gradient F are considered as known parameters. The approach developed here parallels the one in Sect. 15.1.1 for ferroelectric hysteresis. Integration of (15.41) along any closed curve in the H − M plane, as t ∈ [t1 , t2 ], results in t2

t2

t2

t1

t1

t1

˙ ˙ = ∫[μM H˙ + φ]dt = μ ∫ M H˙ dt; 0 ≥ ∫[(∂ H φ + μM) H˙ + ∂ M φ M]dt this implies that the closed curve is run in the counterclockwise sense.

902

15 Ferroics

Except at inversion points (where H˙ = 0), by (15.41) we have M˙ ν ∂ H φ + μM − sgn H˙ . =− ˙ ∂M φ ∂M φ H

(15.42)

If ν depends on H˙ at most through sgn H˙ then the right-hand side of (15.42) is ˙ H˙ equals the derivative of M with respect piecewise a function of M, H and then M/ to H , d M/d H , the slope of the H − M curve. Letting χ1 = −

∂ H φ + μM , ∂M φ

χ2 = −

ν ∂M φ

(15.43)

we can write the differential equation dM = χ1 + χ2 sgn H˙ dH

(15.44)

for the unknown M in terms of H . If χ2 = 0 then χ1 represents the slope of the curve M(H ) of a paramagnetic substance; possibly the slope is not constant and depends on the values of M and H . We can then view χ1 as the positive, differential, magnetic susceptibility. We then require that χ1 > 0,

|χ2 | ≤ χ1 .

The second term χ2 sgn H˙ describes hysteretic effects in that the slope changes depending on the sign of H˙ . These inequalities look physically sound assumptions but are not required by thermodynamics. To account for the Curie temperature θC , above which hysteresis does not occur, we let h(θC − θ) = max{0, 1 − θ/θC } and assume ν = γ(H, M) h(θC − θ). Since ν ≥ 0 then γ(H, M) is required to be non-negative. As we see in a while, γ(H, M) governs the hysteretic properties; that is why it is referred to as the hysteretic function. By definition, χ1 is fully determined by the Gibbs potential φ whereas χ2 depends also on ν = γh. Hence different models are obtained by using the same function φ. The function χ2 accounts for the entropy production and hence affects both the dissipative and the hysteretic properties of the material. ˙ | H˙ | and hence is invariant under the time Equation (15.41) is linear in H˙ , M, transformation t → ct, with c > 0. Any model, characterized by φ and γ, is then rate-independent.

15.3 Ferromagnetism

903

We now establish some general features that serve as guidelines for the elaboration of ferromagnetic models. First we assume that a function M = M(H ) determines M when χ2 = 0. As such, the function M is free from hysteretic effects; it is then referred to as anhysteretic function and we say that {(H, M(H )) : H ∈ R} is the anhysteretic curve. Consequently dM = χ1 dH is the anhysteretic, differential, magnetic susceptibility; moreover  we let χ1 be a function of H . To obtain the counterclockwise property given by M H˙ dt ≤ 0 we may require that χ2 satisfies ⎧ ⎨ > 0 if M < M(H ), χ2 (H, M) = 0 if M = M(H ), ⎩ < 0 if M > M(H ).

(15.45)

To select the Gibbs potential φ we look for functions in the form φ(M, H ) = (M − (H )) + ϒ(H ) − μM H ; the functions , , ϒ are so far undetermined, differentiable functions of H, M, parameterized by θ and F. We observe that since φ = ψ − μM H then  + ϒ is just the Helmholtz free energy ψ. Upon substitution of ∂ H φ and ∂ M φ we obtain χ1 (H ) =

 (M − (H )) (H ) − ϒ (H ) ,  (M − (H )) − μH

χ2 = −

 (M

γh . − (H )) − μH

The first condition of (15.43) results in [ (M − (H )) − μH ]χ1 (H ) =  (M − (H )) (H ) − ϒ (H ) holds if

= χ1 ,

ϒ = μH χ1 (H ).

Since χ1 > 0 then is a monotone increasing function of H . To determine  we observe that the requirement (15.45) is obeyed if χ2 (H, M)[M − M(H )] ≤ 0. Since ν = γh ≥ 0 it follows that

904

15 Ferroics

M − M(H ) ≥ 0. − (H )) − μH

 (M

This condition in turn is satisfied if, for any positive parameter α, possibly dependent on F and θ, M − M(H ) 1 =  (M − (H )) − μH α whence  (M − (H )) = α[M − (H )],

M(H ) = (H ) +

1 μH. α

Hence  = 21 α[M − (H )]2 , to within an additive constant possibly dependent on F and θ. To sum up, thermodynamically-consistent models of one-dimensional ferromagnets are obtained by letting φ(M, H ) = 21 α[M − (H )]2 + ϒ(H ) − μM H, where

H

ϒ(H ) = μ ∫ y χ1 (y)dy,

γ(H, M)h(θC − θ) , α(M(H ) − M)

M(H ) = ∫ [χ1 (y) + μ/α]dy.

0

while χ2 =

H

(H ) = ∫ χ1 (y)dy,

0

H 0

Hence models are characterized by the (anhysteretic) differential permittivity χ1 (H ) > 0 and the parameter α > 0, and the hysteretic function γ(H, M) ≥ 0.

Soft Iron Models Let γ(H, M) = γ0 [M(H ) − M]2 ,

γ0 > 0.

Define γh (θ) = γ0 h(θC − θ). Hence we express M = d M/d H in the form M = χ1 (H ) + τ [M(H ) − M] sgn H˙ ,

(15.46)

15.3 Ferromagnetism

where

905

χ1 = M −

μ , α

τh =

γh . α

This is a very simple model; it is characterized by the positive ratio γ0 /α and the anhysteretic function M. For definiteness we look for the modelling of the class of ferromagnetic soft materials, particularly those called isoperms, like, e.g. Fe-Ni-Al isoperms or Mn-Zn ferrites. Equation (15.46) is the analogue of a model for soft materials [104, 439]. To get a direct connection with the literature we let g = χ1 ,

f = M,

and write the differential equation in the form dM = g(H ) − τh (M − f (H ))sgn H˙ , dH

τh > 0.

(15.47)

The differential equation (15.47) is consistent with thermodynamics; by (15.46) f and g are related by μ (15.48) g = f − , α where α is an arbitrary positive constant, possibly dependent on θ. As a particular case, we let g be piecewise constant and, correspondingly, f is piecewise linear, as devised in [103, 104]. For instance, assume μ = 1, α > 2, and

M(H ) =

⎧ 1 ⎪ ⎪ ⎨ 2 (H − 1) + 1 if H < −1,



H if − 1 ≤ H ≤ 1, ⎪ ⎪ ⎩ 1 (H + 1) − 1 if H > 1, 2

g(H ) =

1 − 1/α

if − 1 ≤ H ≤ 1,

1 2 − 1/α if |H | > 1,

In this case, hysteresis cycles are obtained by solving the system 

H˙ = ωH cos ωt     M˙ = M (H ) − 1/α H˙ + τh M(H ) − M | H˙ |.

Figure 15.13 shows cyclic processes with large and small amplitudes, corresponding to α = 5 and τh = 0.3. According to [103], the condition ∞

∫ [ f (ζ) − g(ζ)] exp(−τ ζ) dζ = H

1 exp(−τ H ) ≥ 0, ατ

H ∈ [0, ∞)

is both necessary and sufficient to warrant that the major loop has a counterclockwise orientation and is the boundary of the set of (attainable) states, i.e. the set of pairs

906

15 Ferroics

Fig. 15.13 Soft iron hysteresis loops (solid), anhysteretic curve M (dashed) and graph of g (short dashed) with H = 1, 2.5. The initial states are H0 = 0 and M0 = −0.1

(H, M) accessible, by appropriate variation of H , from the state in which both H and M are zero. Remark 15.1 As stated in [103], f ≥ 0, and hence g ≥ 1/α, is necessary and sufficient for d M/d H to be non-negative; the stronger condition f − g ≥ 0 (and hence, 1/α ≥ 0) is needed to ensure that the energy expended in a complete traversal of a primitive loop is non-negative ([103] Eqs. (1.6) and (3.18)). This condition is not enough to ensure thermodynamic consistency. Moreover, the requirement f − g = 1/α >  > 0, in [103], implies that f − g cannot vanish, not even at the limit as |H | goes to infinity and this prevents the model to exhibit the saturation property. A model allowing for the saturation property is obtained as follows. Let γ(H, M) = γ0 g(H )[M(H ) − M]2 , γ0 > 0, g(H ) = M (H ) − 1/α. Hence

dM = g(H )[1 + τh (M(H ) − M) sgn H˙ ]. dH

The vanishing of χ as |H | approaches infinity is a way of modelling the saturation property. Hysteresis cycles are obtained by solving the system

15.3 Ferromagnetism

907

2

M

1

-1

0

1

H

-1

-2

Fig. 15.14 Soft iron hysteresis loops with the saturation property (solid): anhysteretic curve M (dashed) and graph of g (short dashed) with H = 0.4, 1.4. The initial state H0 , M0 has H0 = 0 and M0 = −0.1, 0, 0.1



H˙ = ωH cos ωt     M˙ = g(H ) H˙ + τh M(H ) − M | H˙ | ,

and starting from different initial states (H0 , M0 ). For instance, in Fig. 15.14 we use M(H ) = 1.5(tanh 2H + H ) and α = 1.5, τh = 1. Then g(H ) = 3/(cosh 2H )2 .

15.3.3 Curie–Weiss Law We give a simple physical motivation of the Curie–Weiss law in ferromagnetism. In the physical literature the Weiss field is imagined as the magnetic field H E whose effect would be to line up ionic and atomic magnetic moments so that a paramagnetic substance becomes the corresponding ferromagnetic substance. The orienting effect of the Weiss field is opposed by th thermal agitation. At the Curie temperature θC the interaction energy of the moment m, μ0 m HE is about the thermal energy kθC . Let HE = λM where λ is said to be the Weiss field constant. The magnetization M is taken to be proportional to the magnetic intensity H and inversely proportional to the temperature θ. Thus we consider the whole magnetic intensity H + λM and say that C M = , H + λM θ

χ=

Letting θC = λC we find the Curie–Weiss law

M C = . H θ − λC

908

15 Ferroics

χ=

C . θ − θC

This indicates that the slope of the magnetization curve decreases inasmuch as the temperature θ increases. The singularity at θ = θC is regarded as the indication that a non-zero magnetization occurs without any applied field H .

Chapter 16

Phase Transitions

A phase denotes a region of a body with physically uniform properties. Typical phases are the familiar states of matter, i.e. solid, liquid, and gas; in shape-memory alloys the phases are martensite and austenite. Phase transitions denote transformations of the substance in a phase to another phase. To describe a phase transition we need a physical quantity which governs the transformation; for instance, the temperature or the pressure. The description of phase transitions is realized essentially within two schemes: a sharp interface across which the constitutive properties suffer a sudden change or a diffuse space region where the pertinent fields vary continuously. First the jump conditions are derived for the interface among two phases. Next details are determined for the liquid–vapour transition, thus finding the continuity of the Gibbs free energy at the interface and Clapeyron’s equation. The brine channels formation is also developed. The transition in a finite layer is investigated through the mixture model. As a relevant application the solidification-melting of binary alloys is described. Finally, phase transitions in shape-memory alloy are modelled in terms of an order parameter related to the concentration of the mass fraction of martensite.

16.1 Jump Conditions at Interfaces We first investigate phase transitions by considering different phases, e.g. solid and fluid, separated by a surface of zero thickness. Physical quantities such as density are assumed to be discontinuous across the interface. The conservation laws of mechanics and thermodynamics have their corresponding jump condition across the interface. In this respect we review the jump conditions developed in Sect. 2.17.2 and examine the physical consequences on the coexistence of different phases at opposite sides of the interface (Fig. 16.1). © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0_16

909

910

16 Phase Transitions

Fig. 16.1 The sharp interface σ separates two different phases in the regions Pt+ and Pt−

A time-dependent region Pt is convected by the body at hand. A surface (interface) σ is the common boundary of the two sub-regions Pt− , Pt+ ; the surface σ need not be convected. Denote by m the unit normal vector to σ from Pt− to Pt+ and, for any field f , let [[ f ]] denote the jump f − − f + of f across σ with f − , f + the limit values at the pertinent point from Pt− and Pt+ . We observe that Pt = Pt− ∪ σ ∪ Pt+ and ∂P = ∂Pˇ + ∪ ∂Pˇ − with ∂Pˇ ± = ∂P ± \ σ. t

t

t

t

t

Consider a generic balance equation d ∫ ρφdv = ∫ bdv + ∫ hnda, dt Pt Pt ∂Pt where ρ is the mass density. Observe that, symbolically, ∫ dv = ∫ dv + ∫ dv,

Pt

Pt−

Pt+

∫ da = ∫ da + ∫ da.

∂Pt

ˇ− ∂P t

ˇ+ ∂P t

By the generalized transport theorem we can write d ˙ + ∫(ρφU )− da, ∫ ρφdv = ∫ ρφdv dt P − σ Pt− t

d ˙ − ∫(ρφU )+ da, ∫ ρφdv = ∫ ρφdv dt P + σ Pt+ t

where U = (ν − v) · m while ν is the velocity of the point of σ and v is the limit of the velocity of the point of the body. Summing the two relations we arrive at ˙ + ∫[[ρφU ]]da = ∫ h nda + ∫ h nda. ∫ ρφdv

Pt

σ

ˇ− ∂P t

ˇ+ ∂P t

Assume Pt is a pillbox region around σ and hence ∂Pˇ t− and ∂Pˇ t+ approach σ with n = −m at ∂Pˇ t− and n = m at ∂Pˇ t+ . If ρφ˙ is bounded then the limit as the thickness of the pillbox approaches zero yields

16.1 Jump Conditions at Interfaces

911

∫{[[ρφU ]] + [[h]]m}da = 0. σ

The arbitrariness of σ, which is due to the arbitrariness of Pt , and the continuity of the integrand over σ at both sides imply that [[ρφU ]] + [[h]]m = 0 If, instead, the balance is expressed by an inequality, d ∫ ρφdv ≥ ∫ bdv + ∫ hnda, dt Pt Pt ∂Pt then it follows that [[ρφU ]] + [[h]]m ≥ 0. The balances of mass, linear momentum, and energy are obtained by letting φ = 1, v, ε + 21 v2 ;

h = 0, T, Tv − q,

where T ∈ Sym. Hence we obtain the jump conditions [[ρU ]] = 0,

(16.1)

ρU [[v]] + [[T]]m = 0,

(16.2)

ρU [[ε + 21 v2 ]] + m · [[Tv − q]] = 0.

(16.3)

The quantity ρU represents the relative (to the body) mass flow rate and Eq. (16.1) means that ρU is continuous across σ. Equations (16.2) and (16.3) are expected to be invariant under a change of frame. Let V be the velocity of a new frame of reference and hence let v = u + V. We have ρU [[ 21 v2 ]] + m · [[Tv]] = ρU [[ 21 u2 + u · V]] + m · [[Tu]] + m · [[TV]]. By (16.2) we have ρU [[u]] = −[[T]]m and hence, since T ∈ Sym, ρU [[u · V]] + m · [[TV]] = 0. Consequently (16.3) yields ρU [[ε + 21 u2 ]] + m · [[Tu − q]] = 0, and then it is invariant under a change of frame. In general ρU is a parameter of the transition. If we know, or fix, U then we determine ρ; the invariance of ρU implies

912

16 Phase Transitions

ρ− = ρ+

U+ . U−

Now some consequences of (16.2)–(16.5) are derived on the assumption that the stress tensor is isotropic; we let T = − p1, thus possibly identifying p with the pressure. Entropy inequality and entropy jump The balance of entropy can be given the form ρr d ∫ ρηdv = ∫ dv − ∫ (q/θ + k) · n da + ∫ ργ dv dt Pt Pt θ Pt ∂Pt where γ is the entropy production per unit mass. If k = 0 and η, ˙ r, γ are bounded, and even more if γ = 0, then passing to the limit of zero thickness we have q ρU [[η]] − [[ ]] · m = 0. θ

(16.4)

If, rather, the balance is d ρr q ∫ ρηdv ≥ ∫ dv + ∫ · n da dt Pt Pt θ ∂Pt θ then across the interface

q ρU [[η]] − [[ ]] · m ≥ 0. θ

(16.5)

16.2 Phase Transitions at Sharp Interfaces with T = − p1 Denote by v = v · m and q = q · m the components of v and q along m. Let v = ν + u. Hence we have [[v]] = [[u]],

[[ 21 v2 ]] = ν · [[u]] + [[ 21 u2 ]].

Observe that [[ν]] · m = 0 and hence, by the definition of U = (ν − v) · m, we obtain [[U ]] = −[[v]]. Notice that p [[ pv]] = [[ pν]] − [[ ρ(ν − v)]], ρ

p [[ pv]] · m = [[ p]]ν · m − [[ ]]ρU ρ

16.2 Phase Transitions at Sharp Interfaces with T = − p1

913

If T = − p1 then the jump conditions (16.2), (16.3) can be written in the form ρU [[u]] = [[ p]]m,

p ρU [[ε + 21 u2 ]] − [[ ]]ρU − [[q]] = 0, ρ

whence ρU [[ε +

p 1 2 + 2 u ]] − [[q]] = 0. ρ

(16.6)

(16.7)

Hence to within the kinetic term 21 u2 , relative to the observer at the interface, the balance of energy implies that the jump of q equals ρU times the jump of enthalpy. If U = 0 then the interface is convected by the motion of the body. In this case [[ p]] = 0. The transverse parts of v and q are unrestricted while the longitudinal parts v and q are subject to p[[v]] + [[q]] = 0,

[[q/θ]] ≤ 0.

Now let β = ρU = 0. The jump [[v]] is longitudinal and β[[v]] = [[ p]]. The unknowns [[v]], [[θ]], [[ p]], [[q]], [[η]] occur nonlinearly. Let [[θ]] = 0. Equations (16.6) can be written in the form β[[ε + 21 v2 ]] = [[ pv + q]],

β[[v]] = [[ p]],

β[[θη]] − [[q]] ≥ 0.

If, further, [[ p]] = 0 then [[v]] = 0, [[ pv]] = 0 and β[[ε]] = [[q]],

β[[θη]] − [[q]] ≥ 0.

Consequently β[[ε − θη]] ≤ 0 whence U [[ψ]] ≤ 0.

(16.8)

Thus if [[θ]] = 0, [[ p]] = 0, then U > 0 is allowed if ψ− ≤ ψ+ , namely if the Helmholtz free energy decreases across the interface. Things are more involved if [[θ]] = 0, [[ p]] = 0. For any pair of fields α, w, with α a scalar and w a scalar or a tensor of any order, [[αw]] = α+ [[w]] + [[α]]w+ + [[α]][[w]], [[αw]] = 0

=⇒

[[w]] = −

[[w]] =

[[αw]] − [[α]]w+ , α+ + [[α]]

[[α]]w+ . α+ + [[α]]

Hence [[ρU ]] = 0 implies [[ρ]] = −

ρ+ [[U ]] ρ+ [[v]] = . U+ + [[U ]] U+ − [[v]]

914

16 Phase Transitions

The specific volume v, such that ρv = 1, satisfies [[v]] = −

[[ρ]] . ρ+ (ρ+ + [[ρ]])

It then follows that [[ρ]] = and −

ρ+ [[ p]] , βU+ − [[ p]]

[[v]] = −

[[ p]] β2

(16.9)

[[ p]][[ρ]] 1 . = [[ p]][[v]] = − 2 β ρ+ (ρ+ + [[ρ]]

By (16.9) we have [[ p]][[v]] ≤ 0,

[[ p]][[ρ]] ≥ 0,

and [[ pv]] = ( p+ + [[ p]])[[v]] + v+ [[ p]] = (v+ −

p+ + [[ p]] )[[ p]]. β2

Consider the balance of energy and observe [[ pv]] = ( p+ + [[ p]])

[[ p]] , β

β[[ 21 v2 ]] = β 21 [[v]]2 =

[[ p]]2 , 2β

where, for formal convenience, we have taken v+ = 0. Hence we have β[[ε]] = [[ pv]] − 21 β[[v]]2 + [[q]] = ( p+ + 21 [[ p]])

[[ p]] + [[q]]. β

Consequently the possible values of β satisfy the equation [[ε]]β 2 − [[q]]β − ( p+ + 21 [[ p]])[[ p]] = 0.

(16.10)

The values of ρU so determined are effective properties of the interface if q ρU [[η]] − [[ ]] ≥ 0. θ

(16.11)

16.2.1 The Stefan Model Among the models of phase transitions the Stefan model is well-known as the origin of an initial boundary value problem for a parabolic partial differential equation. The model is claimed to describe a solid–fluid phase transition [177, 434]; due to its

16.3 Liquid–Vapour Transition

915

simplicity it follows from (16.2)–(16.5) as a purely thermal problem. Let [[T]] = 0 and hence [[v]] = 0, [[v2 ]] = 0. If [[v]] = 0 then the requirement [[ρU ]] = 0 results in [[ρ]] = 0, which is reasonable in the ice-water transition. The domain Pt is the interval [0, d]. Let s(t) ∈ [0, d] be the position of the interface and then U = s˙ . Hence Pt− = [0, s), Pt+ = (s, d] are occupied by the fluid and solid phases. Let x ∈ [0, d] and assume Fourier’s law q± = −k± ∂x θ in both phases. While (16.2) holds identically Eqs. (16.3)–(16.5) become ρU [[ε]] + [[k∂x θ]] = 0,

ρU [[η]] + [[(k/θ)∂x θ]] ≥ 0.

The requirement on U [[η]] is ignored. Since [[v]] = 0 we can view the two regions as rigid and hence ρ[[ε]] equals the latent heat λ. The temperature is assumed to be continuous across the interface. The whole problem for the function θ(x, t) then is represented by c− ∂t θ − k− ∂x x θ = 0, x ∈ (0, s); λ˙s + [[k∂x θ]] = 0,

c− ∂t θ − k− ∂x x θ = 0, x ∈ (s, d),

θ(s− ) = θ(s+ ),

t ∈ (0, T ),

along with the initial boundary conditions k− ∂x θ(0, t) = h 1 (t), k+ ∂x θ(d, t) = h 2 (t), 0 < t < T ; θ(x, 0) = (x), x ∈ (0, d),

and s(0) = b ∈ [0, d]

16.3 Liquid–Vapour Transition The two phases are viewed as viscous and heat-conducting fluids. We then adopt the Navier–Stokes–Fourier model T = − p(ρ, θ)1 + 2μD + λ(∇ · v)1,

q = −κ∇θ.

Then Eqs. (16.2)–(16.5) become ρU [[v]] = [[ p]]m − [[2μD + λ(∇ · v)1]]m, ρU [[ε + 21 v2 ]] = [[ pv]] · m − [[2μDv + λ(∇ · v)v]] · m − [[κ∂n θ]],

916

16 Phase Transitions

κ ρU [[η]] + [[ ∂n θ]] ≥ 0, θ where ∂n = m · ∇. For simplicity assume D+ = D+ m ⊗ m and hence ∇ · v+ = D+ . It follows that a solution exists with D− = D− m ⊗ m and v− given by v+ and ρU [[v]] = [[ p]] − [[(2μ + λ)D]]. If [[ p]] is known or, possibly, [[ p]] = 0 then the following equation determines [[ε]] if [[κ∂n θ]] is known. We find ρU [[ε + 21 v2 ]] = [[ pv]] − [[(2μ + λ)Dv]] − [[κ∂n θ]]. This shows that the system is determined if in any subregion we determine the fields ρ, v, θ via the differential equations given by the balance equations ρ˙v = −∇ p + 2μ∇ · D + λ∇(∇ · v) + ρb, ρε˙ = − p∇ · v + 2μD · D + λ(∇ · v)2 + ρr − ∇ · q. Assume the fluids are inviscid or look for transitions where D = 0 and ∂n θ = 0 in both phases. The jump conditions reduce to ρU (v− − v+ ) = p− − p+ ,

(16.12)

2 2 − v+ )} = p− v− − p+ v+ , ρU {ε− − ε+ + 21 (v−

(16.13)

ρU (η− − η+ ) ≥ 0.

(16.14)

To fix ideas we let the region ahead (+) contain the liquid and the region behind (−) contain the vapour. Denote by the subscripts V and L the quantities pertaining to the vapour and the liquid phases. Hence ηV > ηL implies U > 0 and this means that the interface moves toward the liquid phase or that the mass of the vapour phase increases. To apply this approach to phase transitions we need detailed constitutive functions of the two phases. Vapour The vapour is modelled as an ideal gas (see Sect. 5.3). Hence we have1 pV = K θρ =: ρ2 ∂ρ ψV (θ, ρ), 1

ψV (θ, ρ) = K θ ln

The subscript V denotes quantities pertaining to the vapour.

ρ θ + cv θ(1 − ln ) + ε0 , ρ0 θ0

16.3 Liquid–Vapour Transition

917

ηV (θ, ρ) := −∂θ ψV = cv ln

θ ρ − K ln , θ0 ρ0

εV := ψV + θηV = cv θ + ε0 ,

where K = R/M. In terms of θ and p we have p θ θ p + cv θ − c p θ ln + ε0 , ηV (θ, p) = c p ln − K ln . p0 θ0 θ0 p0 (16.15) The Gibbs free energy φ = ψ + p/ρ and the entropy η = −∂θ φ have the form ψV (θ, p) = K θ ln

φV (θ, p) = ψV (θ, p) +

p p θ = K θ ln + c p θ(1 − ln ) + ε0 , ρ p0 θ0

ηV (θ, p) = c p ln

(16.16)

θ p − K ln . θ0 p0

The internal energy ε and the enthalpy h = ε + p/ρ have the form εV = cv θ + ε0 ,

h V = c p θ + ε0 .

16.3.1 Continuity of the Gibbs Free Energy The latent heat (of vapourization) λ is the amount of energy that is needed to vapourize the unit mass of liquid at constant temperature. Consider the balance of energy for an inviscid fluid ρε˙ = − p∇ · v − ∇ · q + ρr. Since ∇ · v = v/v ˙ then we can write ε˙ = − p v˙ − v∇ · q + r. Two models are now examined. First we let the transition occur in a layer of finite thickness and we consider the path from the + side to the − side as functions of time. Next we look at transitions across sharp interfaces. Transitions across layers of finite thickness Consider a process from a state ahead to the corresponding state behind, across the interface. If d is the duration of the process, integrating with respect to time in the interval [0, d] we have d

d

0

0

∫(ε˙ + p v)dt ˙ = ∫(r − v∇ · q)dt = λ.

918

16 Phase Transitions

If, at constant temperature, P is the integral of p(θ, v), namely ∂v P(θ, v) = p(θ, v) then we have d

∫(ε˙ + p v)dt ˙ = εV + P(θ, vV ) − εL − P(θ, vL ); 0

if p is constant then P(θ, v) = pv. It is understood that εV , P(θ, vV ), εL , P(θ, vL ) are evaluated at the same point, but opposite sides of the interface, at constant temperature. Let θ be constant within the layer. If the entropy production and the extra-entropy flux vanish then we can write θη˙ = r − v∇ · q.

(16.17)

Integration in time, over [0, d], of (16.17) associated with the process placed at the interface, implies that the latent heat is also equal to θ(ηV − ηL ). Consequently, at constant temperature, we can write the two representations λ = θ(ηV − ηL ) = εV + P(θ, vV ) − εL − P(θ, vL ). Consider now the Gibbs free energy in the form φ(θ, v) = ε(θ, v) − θη(θ, v) + P(θ, v). We then have φV = εV − θηV + P(θ, vV ) = εL + P(θ, vL ) + λ − (θηL + λ) = φL

(16.18)

and can state the following Theorem 16.1 Across a transition layer liquid–vapour, where the extra-entropy flux and the entropy production vanish while the temperature is constant, the Gibbs free energy φ satisfies the equality φV = φL . Transitions across sharp interfaces Within an error ρU [[ 21 u2 ]], Eq. (16.7) can be written as ρU [[ε +

p ]] − [[q]] = 0, ρ

(16.19)

where q = q · m. If k = 0 and ργ vanishes, or is bounded, then (16.5) holds and then ρU [[η]] = [[q/θ]]. If [[θ]] = 0 then

ρU [[θη]] = [[q]].

16.3 Liquid–Vapour Transition

919

Substitution of [[q]] in (16.19) yields ρU [[φ]] = 0,

φ=ε+

p − θη. ρ

(16.20)

Hence U = 0 implies [[φ]] = 0. This proves the following Theorem 16.2 At a non-stationary, sharp interface if the extra-entropy and the entropy production vanish then, to within an error [[ 21 (v − ν)2 ]], the continuity of the temperature, [[θ]] = 0, implies the continuity of the Gibbs free energy, [[φ]] = 0. We observe that the result [[φ]] = 0 is consistent with the equality of chemical potentials (9.76) in chemical reactions. Moreover, while ηV (θ, p) is expressed by (16.15) for any state of the vapour, the relation λ(θ, p) = θ[ηV (θ, p) − ηL (θ, p)] holds for the states (θ, p) at the interface. For definiteness, at p = 1 atm, θ = 373.15 K we have λ = 40.67 kJ/mol,

ηV − ηL = 109 J/mol K.

Clapeyron’s equation For a fluid the entropy η and the specific volume v are given by the Gibbs free energy φ via v = ∂ p φ(θ, p). η = −∂θ φ(θ, p), These relations are assumed to hold for both vapour and liquid. Let θ, p be constant (continuous) across the interface. The condition φV (θ, p) = φL (θ, p)

(16.21)

results in the function p = p(θ) ˆ characterizing the coexistence line. Differentiation of (16.21) with respect to θ yields ∂θ φV + ∂ p φL pˆ = ∂θ φV + ∂ p φL pˆ . Since ∂θ φV = −ηV , ∂θ φL = −ηL , ∂ p φV = vV , ∂ p φL = vL then pˆ = −

∂ θ φ V − ∂θ φ L ηV − η L = . ∂ p φV − ∂ p φ L vV − v L

Moreover, since ηV − ηL = λθ then

920

16 Phase Transitions

pˆ =

λ . θ(vV − vL )

(16.22)

Equation (16.22), known as Clapeyron’s equation, holds for any phase transition between continua where the stress is in fact a pressure tensor (T = − p1). Hence we can apply (16.22) to the transition ice-water provided we describe the stress as a pressure tensor. With the restriction to the pressure tensor, Eq. (16.22) applies to first-order phase transitions that take place at constant θ and p while the first-order derivatives of the Gibbs free energy, and hence η and v, suffer a jump discontinuity. Two simple consequences are now derived from Clapeyron’s equation (16.22). First we consider the water-vapour transition around the boiling point ( p B , θ B ). Since vV = 1677 vL = 1.043 l/Kg, we may neglect vL as compared to vV . Moreover we may view the vapour as an ideal gas. Hence we have pvV = K θ,

p (θ) =

λp λ = , θvV (θ, p) K θ2

where p is the pressure along the coexistence line. Assume λ is constant. Upon integration of the differential equation we have ln

1 λ 1 p − , =− pB K θ θB

 λ 1 1  − p = p B exp − . K θ θT

Secondly, if stress in ice is modelled by T = − p1 then Clapeyron’s equation applies also to the water-ice transition and we have p (θ) =

λ , θ(vL − v I )

v I being the ice specific volume. Since vL < v I then p (θ) < 0. That is why, in the classical θ − p diagrams, the water-ice (coexistence) line has a negative large slope while the water-vapour line has a positive small slope. In addition, if the pressure on the ice is increased then the transition temperature decreases. This explains the experiment performed by placing a string on a block of ice and hanging two masses from the ends of the string. Sometime later, it is discovered that the string has passed through the ice without cutting it in half: the pressure caused by the string makes the ice melt just below the string, the string slides down, and the water freezes again just above the string. Liquid At the interface liquid and vapour coexist and, by (16.18), at any point of the interface we have φL = φV . We can then view φL − φV = 0 as the condition of equilibrium between liquid and vapour.

16.3 Liquid–Vapour Transition

921

Denote the state of liquid and vapour by the pair (θ, p). The continuity of φ at the interface and the form (16.16) of φV suggests that we let φL (θ, p) = φV (θ, p) − κ f (θ, p) = K θ ln

p θ + c p θ(1 − ln + ε0 − κ f (θ, p), p0 θ0

where κ > 0. Hence we obtain the entropy, ηL (θ, p) = −∂θ φL = ηV (θ, p) + κ∂θ f (θ, p) = c p ln

θ p − K ln + κ f (θ, p), θ0 p0

and the enthalpy, h L (θ, p) = φL (θ, p) + θηL (θ, p) = c p θ + ε0 − κ[ f (θ, p) − θ∂θ f (θ, p)] Incidentally, since ηV − ηL = λ/θ then we find a hint for the determination of f , namely λ κθ∂θ f (θ, p) = ; θ the knowledge of λ(θ, p) at the interface determines ∂θ f (θ, p) at the interface. Liquid–vapour coexistence line For definiteness we let f (θ, p) = G( p) − F(θ) and require that f = 0 at the interface. Otherwise we might say that the continuity condition φL (θ, p) = φV (θ, p) results in 0 = f (θ, p) = G( p) − F(θ). Anyway the coexistence line is supposed to take the form G( p) = F(θ),

(16.23)

where (θ, p) is any state at the interface. We now determine an approximate form of (16.23) to represent the water vapour coexistence line (or vapourization curve) from the triple point to the critical point. Denote by T, B the triple point and the atmospheric boiling point. It is required that G( p B ) = F(θ B ), G( pT ) = F(θT ), where θT = 273.16 K, θ B = 373.15 K, pT = 611.657 Pa, p B = 101.325 · 103 Pa. Since ∂θ f (θ, p) = −F (θ), ∂ p f (θ, p) = G ( p), then we have

922

16 Phase Transitions

ηV (θ, p) − ηL (θ, p) = κF (θ), Let G( p) = ln

p , p0

vV − vL = κG ( p).

F(θ) = α −

β , θ−γ

where p0 is a reference pressure. The requirements G( pT ) − F(θT ) = 0, imply ln

β pT = 0, −α+ p0 θT − γ

G( p B ) − F(θ B ) = 0

ln

pB β = 0. −α+ p0 θB − γ

Hence it follows α = ln

pT pB θB − γ + ln , p0 pT θ B − θT

Consequently ln

β = ln

p B (θ B − γ)(θT − γ) . pT θ B − θT

p p B θ B − γ θ − θT . = ln pT pT θ B − θT θ − γ

(16.24)

The parameter γ should be determined by a best fit procedure. An immediate simple estimate of γ is obtained by letting the curve p = p(θ) take the experimental value at the critical point. Letting θC and pC be the values of θ and p at the critical point we find from (16.24) that θ B − θT ln( pC / pT ) θB − γ = =: δ < 1 θC − γ θC − θT ln( p B / p B ) whence γ=

θ B − δθC . 1−δ

Since θC = 647.09 K and pC = 22.064 · 106 Pa it follows that δ = 0.549 and then γ = 39.684 K. Hence, letting p0 = pT we find α = 17.042, β = 3978.855. The coexistence line is then given the approximate formula ln with p in Pa and θ in K.

3978.855 p = 17.042 − , 611.657 θ − 39.684

16.3 Liquid–Vapour Transition

923

Evolution of the transition The nonlinear system (16.12)–(16.14) gives insights into the effects of pressure and temperature on the transition. First we assume U = 0 whereby the relative mass flowrate is nonzero. By (16.9), β 2 [[v]] = −[[ p]]. Since generally [[v]] = 0 then [[ p]] = 0 and [[v]][[ p]] < 0 or [[ρ]][[ p]] > 0. In solid → liquid or liquid → vapour transitions most frequently [[v]] > 0. Hence the transition is favoured by applying a smaller pressure in the final phase. In the ice → water transition [[ρ]] > 0 and hence the transition is favoured with a greater pressure in the water phase. Most often the transition occurs with a continuous pressure across the interface. We check how the jump conditions allow for transitions with [[ p]] = 0. Let U = 0, that is the interface is convected by the body. Hence [[v]] = −[[U ]], [[ p]] = β[[v]], [[ p]] = −β 2 [[v]], hold with [[v]] = 0 and [[ p]] = 0 while [[v]] and [[ρ]] are unrestricted. Since [[ p]] = 0 then the energy equation (16.10) holds identically. Finally, the entropy inequality is satisfied if q [[ ]] ≤ 0. θ

16.3.2 Viscous Phases and Surface Tension Based on the general jump conditions (16.1)–(16.5) we now evaluate the effects of viscosity of the phases and the curvature of the interface. The two phases are viscous fluids described by T± = − p± 1 + T ± , where T is the viscous stress; we let T = 2μD + λ(tr D)1 + Tˆ , where Tˆ stands for possibly nonlinear or higher-grade terms. The jump conditions (16.2), (16.3) become T ]]m, ρU [[v]] = [[ p]]m − [[T T v]] + m · [[q]]. ρU [[ε + 21 v2 ]] = [[ pv]] − m · [[T This shows that the jump of v and ε are affected by the values of T and q at the sides of the interface and these in turn are induced by the solution v, v, θ to the differential equations

924

16 Phase Transitions

T · D + r − v∇ · q, v˙ = v∇ · v, ρ˙v = −∇ p + ∇ · T , ε˙ = − p v˙ + vT where T , ε, q are meant to be replaced by the corresponding constitutive equations so that eventually we get a system of differential equations in the unknowns v, v, θ. We now wonder about the possible effect of T on the latent heat. Again we let ε, p, v be the values during the process (transition) from the liquid to the vapour phase or across the interface from the +side to the −side. If d is the duration of the process then d

d

d

0

0

0

T · Ddt + ∫(r − v∇ · q)dt. ∫(ε˙ + p v)dt ˙ = ∫ vT Let P be the integral of p, ∂v P(θ, v) = p(θ, v). Then if, as we assume, the transition happens at constant temperature we have d

d

0

0

T · Ddt + ∫(r − v∇ · q)dt. εV + P(θ, vV ) − εL + P(θ, vL ) = ∫ vT Likewise by the entropy inequality we have q v r T · D − 2 q · ∇θ η˙ − ( − v∇ · ) = vT θ θ θ the right-hand side being the entropy production (per unit mass). Hence 1 T · D + (r − v∇ · q) η˙ = vT θ and then integrating we have d

d

0

0

T · Ddt + ∫(r − v∇ · q)dt. θ(ηV − ηL ) = ∫ vT It is then consistent to define the latent heat in the form λ = η(ηV − ηL ) = εV + P(θ, vV ) − εL + P(θ, vL ), as with perfect fluids. Consequently we find again that φV = φ L in the liquid–vapour transition. In some models [15] the jump conditions are modified or improved by assuming that a surface tension σ at the interface produces an additional pressure σκ, κ being the mean curvature of the interface. Formally we may use the present formulation by letting T m include σκm.

16.4 Solid–Fluid Transition

925

16.3.3 The Phase Rule A phase containing n constituents is characterized by the mass fractions ω1 , ..., ωn−1 . To account for the mass conservation it is convenient to consider the dependences on the masses, m 1 , m 2 , ...m n . If  phases may occur then we may have (n − 1) different concentrations and 2 degrees of freedom, the temperature θ and the pressure p. Let φˆ i (ωi1 , ωi2 , ..., ωi(n−1) ) = φi (m i1 , m i2 , ..., m in ) be the Gibbs free energy of the ith phase. The  Gibbs free energy of the whole system is the sum of the partial free energies, φ = h φh , and is assumed to have a minimum at equilibrium. If the kth constituent suffers the phase change i → j then 0 = φ˙ i + φ˙ j = ∂m ik φi m˙ ik + ∂m jk φ j m˙ jk . By the mass conservation, m˙ ik = −m˙ jk . Hence (∂m ik φi − ∂m jk φ j )m˙ ik = 0. The arbitrariness of m˙ ik implies that ∂m ik φi = ∂m jk φ j ,

i = j = 1, 2, ..., ; k = 1, 2, ..., n.

Thus there are n( − 1) equations with 2 + (n − 1) unknowns, θ, p, {ωik }. The degree of variability, or the number of degrees of freedom, ν is given by ν = 2 + (n − 1) − n( − 1) = 2 + n − . The result ν = 2 + n −  is referred to as the (Gibbs) phase rule. By way of example consider water in equilibrium with its vapour. We have n = 1 and  = 2; hence ν = 1. We can choose arbitrarily the temperature but then the (equilibrium) pressure is uniquely determined. At the triple point water is at equilibrium with ice and vapour. In this case n = 1 and  = 3. Thus ν = 0. This means that there is no freedom of choice of the variables; the three phases can coexist only for a fixed value of the temperature and a fixed value of the pressure.

16.4 Solid–Fluid Transition To model fluid–solid transitions we need to specify the constitutive equations for the stress. Fluids are reasonably viewed as thermoelastic materials where T = − p1 and p = p(θ, ρ). Solids are more involved. Ice models may be represented by

926

16 Phase Transitions

T = − p1 + μ( p)D + T , where T is a function of the Cotter–Rivlin derivative of D [14, 249]. Steel instead is modelled by a stress–strain relation with plastic effects [360]. If we denote generically with T+ the stress tensor in the solid phase while T = − p− 1 in the fluid phase then the jump conditions (16.2)–(16.3) become ρU [[v]] = −[[T]]m = p− m + T+ m, ρU [[ε + 21 v2 ]] = −m[[Tv − q]] = p− v− · m + m · T+ v + [[q]] · m. If, instead, we let the fluid phase be viscous then T = − p1 + 2μD + λ(tr D)1 in the region behind (−). The solid might be modelled as linearly elastic and incompressible. We let E = symH = 21 [∇R u + (∇R u)T ] and assume (see Sect. 4.1.1) ∂E ψ = 2μ(θ)E ˆ E,

tr E = ∇R · u = 0, tr E˙ = ∇R · u˙ = 0.

Hence E, T R R = − p1 + 2μE

μ = ρ R μ, ˆ

and, consistent with the linear approximation, we let T = − p1 + 2με,

tr ε = 0.

In any case, to determine ρU, [[v]], [[ε]] we need knowing the possible jumps [[T]] and [[q]]. Whenever the stress is qualitatively different in the two phases (e.g. pressure tensor in the fluid, elastic or viscous stress in the solid) the sharp interface looks inadequate. For instance we should have a stress jump [[T]] that leads to − p− 1 by starting from a non-isotropic T+ . This motivates the recourse to placing a solid–fluid (or liquid–vapour) transition in a layer of finite thickness.2 Order parameter and phase field The transition layer of finite thickness, instead of a sharp interface, is common within the phase field models. In the Landau theory of phase transitions [419] the free energy is taken as a function of the temperature and an order parameter, say e. The free energy ψ is then taken as a power expansion of e. At a sharp interface the function ψ depends on the values of e and θ at the interface. To account for non-local spatial effect and hence for a finite thickness of the interface the order parameter has to be a field, usually denoted as phase field in the literature. There are cases where the phase field is advantageous for the modelling without taking a precise physical meaning. For instance the phase field is merely used to 2

Sometimes named mushy region in the literature.

16.4 Solid–Fluid Transition

927

interpolate experimental data [444]. Further, a functional is assumed to depend on a phase variable, in addition to precise physical fields, and the corresponding differential equation is obtained by requiring the extremum property of the functional [9, 45]. In other cases the role of phase field is played by the concentration or mass fraction. In these cases though the field is subject to balance equations, in particular the consequences of the balance of mass. This view is pursued in the next section.

16.4.1 Transition in a Finite Layer—A Mixture Model The solid–fluid transition is taken to occur in a layer of finite thickness. The two phases are regarded as the constituents of a mixture. Further, owing to the phase transition, the mixture is subject to mass exchanges between the constituents. Denote by the indices F, S the quantities pertaining to the fluid and solid phases. Hence ρ F , ρS are the mass densities, ρ = ρ F + ρS is the mass density of the mixture, and ω = ρ F /ρ is the fluid mass fraction. If p is the pressure applied to the mixture we denote by p F the partial pressure in the fluid and pS that in the solid. The barycentric velocity v of the solid–fluid mixture is v = ω v F + (1 − ω)vS . Let u F , uS be the diffusion velocities and h F , hS the mass fluxes, u F = v F − v,

uS = vS − v,

h F = ρωu F ,

hS = ρ(1 − ω)uS ;

by definition hS + h F = 0. The phase transition results in an exchange of mass between the two phases as it happens in chemical reactions. This is expressed by the balances of mass, ∂t ρ F + ∇ · (ρ F v F ) = τ ,

∂t ρS + ∇ · (ρS vS ) = −τ ,

where τ is the mass of fluid produced per unit volume and unit time. These balances can be expressed in terms of ρ and ω in the form ρω˙ = −∇ · h F + τ ,

ρ˙ = −ρ∇ · v.

(16.25)

The balances of linear momentum and energy are considered for the whole mixture in the standard form for continua, ρ˙v = ∇ · T + ρb,

ρε˙ = T · L − ∇ · q + ρr.

The entropy inequality is assumed by taking the entropy flux in the general form j = q/θ + k,

928

16 Phase Transitions

ρη˙ ≥

q ρr − ∇ · ( + k). θ θ

Hence we have the Clausius–Duhem inequality in the form ˙ + T · L − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ(ψ˙ + η θ) θ The constitutive equations are established for the functions ψ, η, h F , T, q, k in terms of the set of variables  = (θ, ρ, ω, ∇θ, ∇ω, D, ∇∇ω) while ω˙ and ρ˙ are determined by (16.25). We assume ψ() is differentiable. Computing ψ˙ and substituting in the Clausius–Duhem inequality we have ˙ − ρ∂ ψ · ∇ω ˙ − ρ∂ ψ · D ˙ −ρ(∂θ ψ + η)θ˙ − ρ∂ρ ψ ρ˙ − ρ∂ω ψ ω˙ − ρ∂∇θ ψ · ∇θ ∇ω D ˙ + T · L − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ∂∇∇ω ψ · ∇∇ω θ ˙ = ∇∇ ω˙ − 2sym∇∇ωL, ˙ = ∇ ω˙ − LT ∇ω, (and the like for ∇θ), ∇∇ω Recall that ∇ω and ρ˙ = −ρ∇ · v. Then we can write −ρ(∂θ ψ + η)θ˙ − ρ∂ω ψ ω˙ − ρ∂∇ω ψ · ∇ ω˙ − ρ∂∇∇ω ψ · (∇∇ ω˙ − 2sym∇∇ωL) +[T + ρ2 ∂ρ ψ1 + ρ∇θ ⊗ ∂∇θ ψ + ρ∇ω ⊗ ∂∇ω ψ] · (D + W) ˙ − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ∂∇θ ψ · ∇ θ˙ − ρ∂D ψ · D θ ˙ D, ˙ and W implies ˙ θ, The arbitrariness of ∇∇ ω, ˙ ∇ θ, ∂∇∇ω ψ = 0,

∂∇θ ψ = 0,

∂D ψ = 0,

η = −∂θ ψ,

T + ρ∇ω ⊗ ∂∇ω ψ ∈ Sym. Let T = T + ρ2 ∂ρ ψ1 + ρ∇ω ⊗ ∂∇ω ψ. Upon dividing by θ we can write the remaining inequality in the form 1 ρ 1 ρ − ∂ω ψ ω˙ − ∂∇ω ψ · ∇ ω˙ + T · D − 2 q · ∇θ + ∇ · k ≥ 0. θ θ θ θ The identity ρ ρ ρ ˙ + [∇ · ( ∂∇ω ψ)]ω˙ − ∂∇ω ψ · ∇ ω˙ = −∇ · ( ∂∇ω ψ ω) θ θ θ allows us to write 1 1 ρ ρ ˙ ≥ 0, − δω ψ ω˙ + T · D − 2 q · ∇θ + ∇ · (k − ∂∇ω ψ ω) θ θ θ θ

16.4 Solid–Fluid Transition

where

929

ρ θ δω ψ = ∂ω ψ − ∇ · ( ∂∇ω ψ). ρ θ

In light of (16.25)1 we have 1 1 ρ − δω ψ ω˙ = − δω ψ τ + δω ψ ∇ · h F , θ θ θ while

1 δω ψ 1 δω ψ ∇ · h F = ∇ · ( δω ψ h F ) − h F · ∇ . θ θ θ

Hence the inequality can be written in the form 1 1 1 1 δω ψ ρ T · D − 2 q · ∇θ − δω ψ τ − h F · ∇ + ∇ · (k + δω ψ − ∂∇ω ψ ω) ˙ ≥ 0. θ θ θ θ θ θ This inequality holds if

ρ 1 k = − δω ψ + ∂∇ω ψ ω˙ θ θ

and T · D ≥ 0,

δω ψ τ ≤ 0,

q · ∇θ ≤ 0,

hF · ∇

δω ψ ≤ 0. θ

Since ψ is a function of (θ, ρ, ω, ∇ω) then ψ may depend on ∇ω only through |∇ω|. Consequently ∇ω ⊗ ∂∇ω ψ =

∂|∇ω| ψ ∇ω ⊗ ∇ω ∈ Sym |∇ω|

and hence T ∈ Sym. Moreover, the result for T implies that the stress T has the form T = −ρ2 ∂ρ ψ1 − ρ

∂|∇ω| ψ ∇ω ⊗ ∇ω + T , |∇ω|

where T may be any (dissipative) stress function of D subject to T · D ≥ 0. To compute δω ψ we observe that, by the theory of mixtures, ρψ =



α ρα (ψα

+ 21 uα2 ),

where α labels the peculiar (partial) terms; each ψα determines the partial pressure in the form pα = ρ2α ∂ρα ψα . Here we neglect the diffusive terms uα2 and consider ψ F = ψ F (θ, ρ F ), ψS = ψS (θ, ρS ) whence ψ = ωψ F (θ, ρω) + (1 − ω)ψS (θ, (1 − ω)ρ).

930

16 Phase Transitions

It follows that ∂ω ψ = ψ F − ψ S +

pF pS − = φ F − φS ; ρF ρS

φ F = μ F and φS = μS are the Gibbs free energies or chemical potentials of the two phases. As with chemical reactions, τ δω ψ ≤ 0 means that equilibrium (i.e. τ = 0) occurs when δω ψ = 0 that is when φ F = φS . Very often in the literature the governing equations are derived by starting from a (Ginzburg–Landau) functional for the entropy [45] or the free energy [54, 215] with a density function dependent on the parameter ω and a term |∇ω|2 to account for an interface energy. Here we can do the analogue, though consistent with the mixture model, by considering ψ in the form ψ = ω[ψ F (θ, ρω) + 21 γ F |∇ω|2 ] + (1 − ω)[ψS (θ, ρ(1 − ω)) + 21 γS |∇ω 2 |]. If γ F = γS = γ then

ρ θ δω ψ = φ F − φS − ∇ · ( γ∇ω). ρ θ

For formal simplicity let γ F = γS = γ. Inequality τ δω ψ ≤ 0 leads to ρ θ τ = −κ(θ, ρ, φ){φ F (θ, ρω) − φS (θ, ρ(1 − ω)) − ∇ · ( γ∇ω)}, ρ θ

κ≥0 (16.26)

while h F · ∇δω ψ/θ ≤ 0 leads to ρ θ h F = −M∇{φ F (θ, ρω) − φS (θ, ρ(1 − ω)) − ∇ · ( γ∇ω)}, ρ θ

M ≥ 0. (16.27)

We now let ρ, θ, γ be constants. If τ  ρω˙ then (16.26) simplifies to ω˙ = −

κ κγ f (ω) + ω, ρ ρ

f (ω) := φ F (θ, ρω) − φS (θ, ρ(1 − ω)),

namely the Ginzburg–Landau equation. If instead we let ρω˙  −∇ · h F then (16.27) leads to 1 ∇ · M∇[ f (ω) − γω], ω˙ = ρθ namely the Cahn–Hilliard equation. The |∇ω|2 term is considered as a negative contribution to the entropy density (e.g. − 21 2ω |∇ω|2 in [45]). We might then take γ as proportional to the temperature θ, γ = θγ ∗ , so that

16.4 Solid–Fluid Transition

931

ψ = f (θ, ω) + 21 θγ ∗ |∇θ|2 ,

η = −∂θ f − 21 γ ∗ |∇ω|2 ,

ε = ψ + θη = f (θ, ω),

thus getting a scheme consistent with standard approaches [54, 215].

16.4.2 Evolution of the Mass Fraction If the phase field is just the mass fraction then we have to account for the whole set of balance equations. The Ginzburg–Landau and Cahn–Hilliard equations may be of interest when the phase field is not merely the mass fraction or the volume fraction. The evolution of the mass fraction ω is governed by the balance of energy. We start from the equation given by the theory of mixtures along with the approximations Tα = − pα 1, pα = ωα p, and the neglect of the interaction terms εˆα . Hence we can write ρα ε`α = − pωα ∇ · vα − ∇ · qα + ρα rα . Since

ρ`α v`α =− = ∇ · vα − vα τα vα ρα

then we have ρα ε`α = − pωα

 v`α vα

 + vα τα − ∇ · qα + ρα rα

whence ε`α + pωα v`α = − pωα vα2 τα − vα ∇ · qα + rα . Observe that 1 1 1 1 ω` α = ω˙ α + uα · ∇ωα = − ∇ · (ρα uα ) + uα · ∇ωα + τα = − ωα ∇ · (ρuα ) + τα . ρ ρ ρ ρ

Hence, at constant pressure, pvα ωα pvα ∇ · (ρuα ) − τα . pωα v`α = pω`α vα − p ω` α vα = pω`α vα + ρ ρ Notice that h α = εα + pωα vα is the enthalpy per unit mass. Hence upon substitution and neglect of the diffusive term pvα ∇ · (ρuα ) we obtain ρα h` α  −∇ · qα + ρα rα .

(16.28)

The sum of (16.28) over α gives the total heat supplied per unit volume. We now examine how (16.28) governs the time dependence of the mass fraction. To this

932

16 Phase Transitions

purpose we still neglect the diffusive terms (h` α  h˙ α ) and sum over α to obtain3 

˙ = −∇ · q + ρr,

α ρα h α

q=



α qα ,

ρr =



α ρα r α .

Go back to fluid and solid constituents. Assume h F = h F (θ, ρ F ), ρ F = ρω,

h S = h S (θ, ρS ), ρS = ρ(1 − ω).

If ρ F  ρS then it is reasonable to assume ρ is constant and hence ˙ ρ˙ F = ρω,

ρ˙ S = −ρω. ˙

Otherwise we have ρ˙ F = ω ρ˙ − ∇ · (ρ F u F ) + τ and we might exploit ρ F instead of ω to describe the evolution of the transition. ˙ Take ω to describe the transition and observe that (with ρ˙ F  ρω) ˙ ρ F h˙ F + ρS h˙ S = ρ(ρ F ∂ρ F h F − ρS ∂ρS h S )ω. Hence we have ω˙ =

1 (ρr − ∇ · q). ρ(ρ F ∂ρ F h F − ρS ∂ρS h S )

(16.29)

16.5 Solidification-Melting of Binary Alloys The transition solidification-melting is considered for alloys. For formal simplicity we restrict attention to binary (isomorphous) alloys and consider the continuum as a mixture of the two components A, B; as an example, we may have in mind the copper-nickel alloy [16]. The phase diagram shows three regions, call them Liquid, Liquid-Solid, Solid. The phase boundary which limits the liquid phase is called the Liquidus line; the line which limits the solid phase is called the Solidus line (Fig. 16.2). Let θ A , θ B be the melting, or solidification, temperatures of A, B and take θ A < θ B . Consider the alloy, at a given composition, in the liquid phase, θ > θL . As the temperature θ decreases up to θL , on the Liquidus line, a first transition occurs where the liquid alloy ABl separates into A liquid and B solid, thanks to the corresponding absorption of energy. Once the transition is completed we have a mixture of A liquid and B solid. Upon decreasing the temperature the mixture reaches the state on the Solidus line, θ = θS . By subtracting energy, the transition occurs between A liquid B solid and the solid alloy ABs. 3

The literature shows various schemes where the evolution of the transition is given by the enthalpy, see e.g. [355].

16.5 Solidification-Melting of Binary Alloys

933

Fig. 16.2 a Liquidus-solidus diagram: the temperature θ is plotted versus c, the concentration of B. b Cooling curve: the temperature θ is plotted versus time t with constant heat removed

Both transitions can be modelled as a chemical reaction within a mixture of appropriate constituents. The transition at the Liquidus is viewed as a reaction between ABl, Al, Bs, with exchanges of atoms of B between ABl and Bs. Likewise, the transition at the Solidus is viewed as a reaction between Al, Bs, ABs with exchanges of atoms of A between Al and ABs. For definiteness here we describe the transition at the Liquidus and hence the chemical reaction between ABl, Al, Bs. For simplicity we describe the mixture by means of the balances for a single body; the balance of energy and the entropy inequality are then taken in the standard form ρε˙ = T · L − ∇ · q + ρr, ˙ + T · L − 1 q · ∇θ + θ∇ · k ≥ 0. −ρ(ψ˙ + η θ) θ For a more detailed description of the mixture we take advantage from the properties arising from the theory of mixtures. In the transition ABl ⇐⇒ Al, Bs there is a mass exchange of B between ABl and Bs. Denote by the subscripts ABl, Bs, Al the quantities pertaining to ABl, Bs, Al and let ρ = ρ ABl + ρ B s + ρ Al ,

ρv = ρ ABl v ABl + ρ B s v B s + ρ Al v Al .

We then consider the balances of mass, in the barycentric frame, in the form ρ˙ ABl = −∇ · (ρ ABl u ABl ) − ρ ABl ∇ · v + τ , ρ˙ B s = −∇ · (ρ B s u B s ) − ρ B s ∇ · v − τ , ρ˙ Al = −∇ · (ρ Al u Al ) − ρ Al ∇ · v,

934

16 Phase Transitions

where τ is the mass supply of B in ABl. Moreover the mass fractions ωα = ρα /ρ are governed by ρω˙ ABl = − ∇ · (ρ ABl u ABl )+τ , ρω˙ B s = −∇ · (ρ B s u B s ) − τ , ρω˙ Al = −∇ · (ρ Al u Al ). The constitutive properties of the mixture are described by the set of independent variables  = (θ, {ρα }, D, ∇θ, {∇ρα }, {uα }, {ρ˙α }, ). We let the free energy ψ, the entropy η, the stress T, the heat flux q, the extra-entropy flux k be continuous functions of . Indeed, by the theory of mixtures, the free energy ψ is related to the peculiar free energies {ψα } of the constituents by ρψ =



α ρα (ψα

+ 21 uα2 ).

Here we take the approximation that the quantities {uα2 } are negligible and hence we let  α = ABl, Bs, Al. ρψ = α ρα ψα (θ, ρα , ∇θ, ∇ρα , D, ρ˙α , ∇∇ρα ), Moreover ψ is taken to be continuously differentiable. For a compact writing we first compute ρψ˙ in the case where ψα = ψα (ρα ). Then ψ˙ =



˙

α ωα ψα

=



˙ α ψα αω

+ ωα ρ˙α ∂ρα ψα .

Since ρ˙α = −∇ · hα − ρα ∇ · v + τα then ρψ˙ =



˙ α ψα + ρα ∂ρα ψα ρ˙ α ) = α (ρω



α ψα (τα − ∇ · hα ) +



α ρα ∂ρα ψα (τα − ∇ · hα − ρα ∇ · v).

We let ρ2α ∂ρα ψα = pα be the partial pressure and p = mixture while μα = ψα + pα /ρα , hα = ρα uα . It follows ρψ˙ =



α [−μα ∇



α pα

the pressure in the

· hα + τα μα − p∇ · v].

With this in mind we now compute ρψ˙ in more involved models. First we ignore spatial interaction effects via gradients of mass densities. Hence we take ψ=



α ωα ψα (θ, ρα );

the dependence on D and ∇θ would be ruled out by the Clausius–Duhem inequality; we omit the details. The Clausius–Duhem inequality can then be written in the form

16.5 Solidification-Melting of Binary Alloys

˙ p∇ · v− −ρ(∂θ ψ+η)θ+



α [τα μα

935

1 − μα ∇ · hα )]+T · L − q · ∇θ + θ∇ · k ≥ 0. θ

The arbitrariness of θ˙ implies that η = −∂θ ψ. Moreover, since L = D + W then the arbitrariness of W implies T ∈ Sym. Now observe that p∇ · v + T · D = (T + p1) · D. Divide the remaining inequality by θ, −

1 1 1 1 τα μα + μα ∇ · hα + (T + p1) · D − 2 q · ∇θ + ∇ · k ≥ 0, θ α θ α θ θ

and use the identity 1 1 μα μα ∇ · hα = ∇ · μα hα − hα · ∇ θ θ θ to obtain −

 1 1 1 μα 1 + (T + p1) · D − 2 q · ∇θ + ∇ · (k + μα hα ) ≥ 0. τα μα + α hα · ∇ α θ θ θ θ θ

This inequality is satisfied if  μα ≤ 0, α τα μα ≤ 0, θ   while k = − α μα hα /θ and q · ∇θ ≤ 0. As expected α τα μα ≤ 0 is consistent with (9.75). Here it follows that T = − p1 + T , T · D ≥ 0,



α hα

·∇

τ (μ ABl − μ B s ) ≤ 0.

(16.30)

We may conclude that the transition occurs at the temperature θL such that μ ABl (θL ) = μ B s (θL ) and does not occur at θ B > θL . This in turn means that the alloy AB, with θ A < θ B , has a temperature of solidification which is lower than that of the metal B. This in turn shows that μ ABl is a decreasing function of temperature. Moreover, inequality (16.30) implies that τ ≥ 0 if and only if μ ABl − μ B s ≤ 0 and vice versa; matter flows toward the phase at a lower chemical potential. Inequality (16.30) indicates the direction of the transition; the value of τ is given by the balance of energy as with (16.29). Strictly analogous conclusions hold if we consider the transition at the Solidus line between Al, Bs, and ABs. Spatial interaction may be modelled by allowing for a dependence of ψα also on ∇ρα , ρ˙α , ∇∇ρα . We find that ∂ρ˙ α ψα = 0, ∂∇∇ρα ψα = 0 and

936

16 Phase Transitions

 ρα  ρα 1 ∂ρα ψα ρ˙α − α ∂∇ρα ψα · (∇ ρ˙α − LT ∇ρα ) α ψα (τα − ∇ · hα ) − α θ θ θ 1 1 + T · L − 2 q · ∇θ + ∇ · k ≥ 0. θ θ  Applying the standard identity to α (ρα /θ)∂∇ρα ψα · ∇ ρ˙α we can write the inequality −



 ρα  1 1 1 ψα (τα − ∇ · hα ) − α δρα ψα ρ˙ α + [T + α ρα ∇ρα ⊗ ∂∇ρα ψα ] · L − 2 q · ∇θ θ α θ θ θ  ρα +∇ · (k − α ∂∇ρα ψα ρ˙ α ) ≥ 0, θ

where δρα ψα = ∂ρα ψα − (θ/ρα )∇ · [(ρα /θ)∂∇ρα ψα ]. Now we replace ρ˙α and define pα = ρ2α δρα ψα ,

μα = ψα + ρα δρα ψα .

(16.31)

Letting T = T + p1 we have −

  ρα 1 1 1 μα (τα − ∇ · hα ) + [T + α ρα (∇ρα ⊗ ∂∇ρα ψα )] · L − 2 q · ∇θ + ∇ · (k − α ψα ρ˙ α ) ≥ 0 θ α θ θ θ

Applying the standard identity to



α (μα /θ)∇

· hα we find that

  1 1 μα 1 T + α ρα (∇ρα ⊗ ∂∇ρα ψα )] · L − 2 q · ∇θ + [T α μα τα − α hα · ∇ θ θ θ θ  μα  ρα +∇ · (k − α ∂∇ρα ψα ρ˙α + α hα ) ≥ 0. θ θ  Again we find the condition α μα τα ≤ 0 though with the improved definition (16.31)2 of the chemical potential μα . The arbitrariness of W = skwL implies T +  α ρα (∇ρα ⊗ ∂∇ρα ψα ) ∈ Sym. If we assume T ∈ Sym then it follows ∂∇ρα ψα ∝ ∇ρα ; this implies that ψα depends on ∇ρα through |∇ρα |. −

16.6 Brine Channels Formation in Sea Ice The presence of salt (solute) in water (solvent) results in the freezing point depression. This phenomenon is referred to as cryoscopic effect and depends only on the solute concentration. During sea ice formation, salt is excluded from the solid phase. The dissolved salt makes the water of the ocean more dense. The salinity of the liquid phase thus increases and this lowers the freezing point. When the salinity is over 0.05 the solution is named brine. Nevertheless, at the end of the freezing process the salinity of the solution never reaches the unit value; the salt concentration has an upper limit which depends on temperature [263]. At room temperature (θr  291 K) the solubility limit

16.6 Brine Channels Formation in Sea Ice

937

Fig. 16.3 Plot of the ice-saline water coexistence line (freeze line) and solubility line: ξb = 0.05 is the minimum salinity of the brine, ξl = 0.233 and ξs = 0.27 denote the solubility limits

of salt is about 0.27. This system then offers two phase transitions: ice—salt water and salt water—salt. Let ρs , ρw be the mass densities of salt and water and ξ be the salt concentration in water, ρs ξ= . ρs + ρw As ξ > ξl equilibrium holds between saline water and saline water + salt through the solubility line (Fig. 16.3). Ice—saline water transition We let T = − p1 in both regions and hence consider the jump conditions in the form (16.6), namely ρU [[v]] = [[ p]]m,

ρU [[ε + 21 v2 ]] = [[ pv + q]],

ρU [[η]] − [[q/θ]] ≥ 0.

We let U > 0 indicate that the ice region progresses. We may have the static solution U = 0 in which case [[ p]] = 0, [[v]] = 0 and the solution holds with [[q]] = 0, [[θ]] = 0 while φi = φsw . If instead [[θ]] = 0 and [[ p]] = 0 then, by (16.8) it follows that > 0 if ψi < ψsw . The coexistence line ice—saline water is the solution of the equality of free energies or chemical potentials, φi = φsw . It follows (experimentally) that the line is represented by θ(ξ) = θτ − α0 ξ,

α0 = (θτ − θσ )/ξl  91.3 K

938

16 Phase Transitions

The salinity lowers the freezing temperature up to ωs = ξl  0.23 and θ = θσ  252 K. Transition saline water − saline water and salt The solubility line between saline water and saline water plus solid salt is found to be expressed by ξ ∈ [ξl , ξs ], θ(ξ) = θσ + β0 (ξ − ξl ), where β0  975 K. Mixture model of the transition If the two phases are not separated by a sharp interface then it is more appropriate to model saline water and ice as a mixture of three constituents occupying a time dependent region  [160]. Denote by the subscripts s, w, i the quantities pertaining to salt, water, and ice. Accordingly ρs , ρw , ρi are the mass densities of salt, water, and ice and ρ = ρs + ρw + ρi is the mass density of the mixture. Let ωs , ωi be the concentrations of salt and ice, ωs =

ρs , ρ

ωi =

ρi . ρ

Since ξ = ρs /(ρs + ρw ) then 1 ρi ρs + ρw ωi 1 = + = + ωs ρs ρs ωs ξ whence ωs = ξ(1 − ωi ). In a bounded (small) region it is reasonable to consider the constraint due to the mass of salt by letting Ms ≤ ξs M s + M ω + Mi the solubility limit in uniform conditions. Let vs , vw , vi be the velocity fields. The barycentric velocity v is written in the form v = ωs vs + ωi vi + (1 − ωs − ωi )vw . Let us , ui the diffusion velocities and hs , hi the mass fluxes,

16.6 Brine Channels Formation in Sea Ice

939

us = vs − v, ui = vi − v, hs = ρωs us , hi = ρωi ui . In the transition ice—saline water the salt is conserved (in the phase of saline water). The salt then is subject to ρω˙ s = −∇ · hs , while ∫ ρωs dv = constant. 

It follows that 0=

d ∫ ρωs dv = ∫ ρω˙ s dv. dt  

Hence we have ∫ hs · nda = 0,

∂

which is guaranteed by hs · n = 0 at ∂. Ice and water suffer a phase transition which is modelled as a chemical reaction. First we consider the balance of mass of ice, ∂t ρi + ∇ · (ρωi vi ) = τ , τ being the mass of ice produced per unit time and unit volume. Otherwise we can write ρω˙ i = −∇ · hi + τ . The continuity equation for water then can be written ρω˙ i = ∇ · hw + τ . The transition among saline water and saline water + salt is simply the formation of brine channels from the salt in excess of the solubility.

16.6.1 Transition Models via Minimization of Functionals We now outline a variational approach developed in the literature. Let  be a spatial domain occupied by two constituents A, B constant mass densities and temperature. The equilibrium is assumed to be associated with the minimum of a functional F (ω) = ∫ F(ω, ∇ω)dv, 

940

16 Phase Transitions

F being a free energy [9, 68, 215] and ω is the mass fraction of a constituent, say A. For definiteness F is usually taken as the Landau–Ginzburg functional where F(ω, ∇ω) = f (ω) + 21 κ|∇ω|2 . Observe that d F = ∫[∂ω f ω˙ + κ∂∇ω F · ∇ ω]dv ˙ = ∫ δω F ωdv ˙ + ∫ ∂∇ω F · n ωda, ˙ dt   ∂ where δω = ∂ω − ∇ · ∂∇ω is the variational derivative. The assumption n · ∇ω = 0 at ∂ makes the boundary term vanish if the Landau–Ginzburg functional is selected. Hence equilibrium is governed by the Euler–Lagrange equation δω F = 0. If the mass of the single constituents is conserved then equilibrium is governed by ∂ω f (ω) − ∇ · (κ∇ω) = λ, λ being the Lagrange multiplier connected with the mass conservation. By the minimum property it follows that d F /dt ≤ 0. Now, 0≥

d F = ∫(δω F)ω˙ dv, dt 

holds if ω˙ = −K (ω, θ)δω F, in that

(16.32)

d F = − ∫ K (δω F)2 dv. dt 

If we let ω be governed by

we assume

K ≥ 0,

ω˙ = −∇ · h

h = − Kˆ (ω, θ)∇δω F,

(16.33) Kˆ > 0,

and find d F = − ∫ δω F (−∇ · h)dv = − ∫ δω ∇ · ( K˜ ∇δω F)dv = − ∫ K˜ (δω F)2 dv dt    provided n · ∇δω F = 0 at ∂. Two remarks are in order. First, it is claimed [9, 215] that δω F is the (relative) chemical potential. This can be justified, e.g. by letting F = f as the Gibbs free energy in the form ([449], Chap. 17)

16.7 Phase-Field Model of Liquid-Solid Transitions

f =



941

α ωα μα ,

μα being the αth chemical potential. As n = 2, ω1 = ω, ω2 = −ω and then ∂ω f = μ1 − μ2 if μα is independent of ωα . This independence looks an approximation. Secondly, the mass fractions are governed by the differential equations ρω˙ α = −∇ · (ρα uα ) + τα as is shown in Chap. 9. Hence (16.32) and (16.33) are used in place of the right Eqs. (9.27) and (9.69).

16.7 Phase-Field Model of Liquid-Solid Transitions A simple model of phase transitions reduces significantly the set of fields describing the continuum by assuming that the body is undeformable. The two phases are incompressible solid and liquid; the mass density ρ is a parameter. The independent fields are then the temperature θ and a phase field ϕ; to fix ideas ϕ = 0 in the solid phase, ϕ = 1 in the liquid phase. The region occupied by the body, , is time independent. The balance of energy is of purely thermal character in the form ρε˙ = ρr − ∇ · q and the second-law inequality is ρη˙ + ∇ · (

ρr q + k) − = σ ≥ 0. θ θ

Consequently we have ˙ − 1 q · ∇θ + θ∇ · k = θσ ≥ 0. − ρ(ψ˙ + η θ) θ

(16.34)

We look for a scheme where the phase field ϕ is governed by a rate-type equation. Let ψ, η, q, k, ϕ˙ and σ be continuous functions of the independent variables θ, ϕ, ∇θ, ∇ϕ, ϕ, ˙ ∇∇ϕ. Moreover let ψ be differentiable. Computing ψ˙ and substituting in (16.34) we find −ρ(∂θ ψ + η)θ˙ − ρ∂ϕ ψ ϕ˙ − ρ∂∇θ ψ · ∇ θ˙ − ρ∂∇ϕ ψ · ∇ ϕ˙ − ρ∂ϕ˙ ψ ϕ ¨ − ρ∂∇∇ϕ ψ · ∇∇ ϕ˙ −

1 q · ∇θ + θ∇ · k ≥ 0. θ

942

16 Phase Transitions

˙ and θ˙ implies that The arbitrariness of ϕ, ¨ ∇∇ ϕ, ˙ ∇ θ, ∂ϕ˙ ψ = 0,

∂∇∇ϕ ψ = 0,

∂∇θ ψ = 0,

η = −∂θ ψ.

Consider the remaining inequality and divide by θ to have ρ 1 ρ − ∂ϕ ψ ϕ˙ − ∂∇ϕ ψ · ∇ ϕ˙ − 2 q · ∇θ + ∇ · k ≥ 0. θ θ θ In view of the identity ρ   ρ  ρ − ∂∇ϕ ∇ ϕ˙ = −∇ · ∂∇ϕ ψ ϕ˙ + ∇ · ∂∇ϕ ψ ϕ˙ θ θ θ we can write ρ 1 ρ − δϕ ψ ϕ˙ − 2 q · ∇θ + ∇ · (k − ∂∇ϕ ψ ϕ) ˙ ≥ 0, θ θ θ where δϕ ψ = ∂ϕ ψ − (θ/ρ)∇ · ((ρ/θ)∂∇ϕ ψ). Hence the extra-entropy flux k is given by ρ k = ∂∇ϕ ψ ϕ˙ θ while q satisfies the heat conduction inequality q · ∇θ ≤ 0 and hence we take the Fourier-like law q = −κ(θ, ϕ)∇θ, κ(θ, ϕ) ≥ 0. The sought function ϕ˙ is subject to δϕ ψ ϕ˙ ≤ 0. The simplest function ϕ˙ is ϕ˙ = −u(θ, ϕ) δϕ ψ,

u(θ, ϕ) ≥ 0.

Both κ and u are parameterized by ρ. If ψ = ψ0 (θ, ϕ) + 21 h(θ)|∇ϕ|2 then δϕ ψ = ∂ϕ ψ0 − θ∇ · [(h(θ)/θ)∇ϕ] and ϕ˙ = −[∂ϕ ψ0 − hϕ − θ∇ϕ · ∇(h/θ)]. Unless h = cθ, the variational derivative δϕ ψ contains a term ∇ϕ · ∇θ.

16.7 Phase-Field Model of Liquid-Solid Transitions

943

A physically motivated function ψ0 is determined in terms of the internal energy ε. Since ψ = ε − θη = ε + θ∂θ ψ then we have ε 1 ∂θ ψ − ψ = − , θ θ

ε 1 ∂θ ( ψ) = − 2 . θ θ

whence, for a chosen θ0 , θ

ψ(θ, ϕ, ∇ϕ) = −θ ∫ θ0

ε(ζ, ϕ, ∇ϕ) ˜ dζ + θψ(ϕ, ∇ϕ); ζ2

ˆ the occurrence of θψ(ϕ, ∇ϕ) leaves the internal energy ε invariant. Let θ M be the melting temperature and ε(θ, ϕ, ∇ϕ) = ε(θ, ˆ ϕ) + 21 g(θ)|∇ϕ|2 . Hence, letting θ0 = θ M we can express ψ in the form  θ ε(ζ,  ˆ ϕ) ˆ ψ(θ, ϕ) = θ − ∫ dζ + ψ(ϕ, ∇ϕ) − 21 θG(θ)|∇ϕ|2 , 2 ζ θM ˆ where ψ(ϕ, ∇ϕ) is so far an undetermined function while θ

G(θ) = ∫

θM

g(ζ) dζ ζ2

and hence h(θ) = −θG(θ). For a definite model we need the function ε(θ, ϕ). Let εS , εL be the internal energy density of solid and liquid phases and then assume ε(θ, ˆ ϕ) = εS (θ) + f (ϕ)λ(θ) = εL (θ) + [ f (ϕ) − 1]λ(θ), where λ(θ) = εL (θ) − εS (θ) and λ(θ M ) is the latent heat. The function f (ϕ) : [0, 1] → [0, 1] describes the transition and is subject to f (0) = 0, f (1) = 1; possibly f (ϕ) = ϕ. Hence ψ0 (θ, ϕ) can be written in the form  θ εL (ζ)  ˜ ψ0 (θ, ϕ) = θ − ∫ dζ − [ f (ϕ) − 1]Q(θ) + ψ(ϕ, ∇ϕ) , 2 θM ζ where

θ

Q(θ) = ∫

θM

λ(ζ) dζ. ζ2

˜ Let ψ(ϕ, ∇ϕ) = 21 2 |∇ϕ|2 + p(ϕ). The rate equation for ϕ then is given the form

944

16 Phase Transitions

ϕ˙ = u[θ f (ϕ)Q(θ) + θ2 ϕ − θ p (ϕ) + ∇ · (G(θ)∇ϕ)].

(16.35)

Further details are necessary to determine space-time properties of temperature and phase field. Assume the internal energy in the liquid depends linearly on the temperature and hence let εL (θ) = εL (θ M ) + c(θ − θ M ), where c is the specific heat (at constant volume). The balance of energy, ρε˙ = ∇ · κ∇θ, results in ˙ = −∇(κ∇θ). ρ{[c + ( f (ϕ) − 1)λ (θ)]θ˙ + λ(θ) f (ϕ)ϕ}

(16.36)

Equations (16.35) and (16.36) are the system of equations for θ and ϕ. Geometrical aspects of the interfaces are modelled via the function p(ϕ). At the melting temperature, G and Q vanish and the differential equation for ϕ shows that the stationary ϕ is given by 2 ϕ − p (ϕ) = 0.

16.8 Phase Transitions in SMA A shape-memory alloy (SMA) is an alloy that can be deformed when cold and returns to the pre-deformed shape when heated. Nickel-titanium (Ni-Ti) is the most common alloy showing the SMA effect. This alloy can exist in two different phases, with three different crystal structures (phases): twinned martensite, detwinned martensite, and austenite. Martensite exists at lower temperatures, and austenite exists at higher temperatures. Austenite, say A, is a solid phase. Martensite can exist in two solid states: selfaccommodated or twinned and oriented or detwinned, M+ , M− . The thermomechanical phase transformation produces two effects, the shape memory effect and the pseudo-elastic effect. Pseudo-elasticity results in a hysteretic behaviour under a tensile test. As the temperature grows loops move away from the origin in the strainstress plane. In the one-dimensional pseudo-elastic regime a mix of just two phases occurs, A and M+ in loading, A and M− in unloading. Transitions are described by an order parameter ϕ such that ϕ = 0 in A, ϕ = 1 in M+ and ϕ = −1 in M− . To simplify the description we use the parameter ϕ2 ∈ [0, 1] and hence we do not distinguish the content of martensite, M, among M+ and M− . The transition from martensite to austenite is induced by temperature and stress and no diffusion is involved. The SMA effect occurs because a temperature- or stressinduced transformation reverses deformation. Austenite transforms into martensite upon cooling; conversely, upon heating austenite transforms into austenite. Upon loading austenite transforms to martensite, upon unloading martensite reverts to austenite. If large stresses are applied a plastic behaviour happens. If the material is

16.8 Phase Transitions in SMA

945

Fig. 16.4 SMA major hysteresis loops at θ1 > θA (dashed) and θ2 > θ1 (solid)

unloaded before the plastic deformation, the hysteresis loop shows a reversible cycle (Fig. 16.4). A SMA is viewed as a mixture of austenite and martensite phases. Let ρ A and ρ M be the mass densities of austenite and martensite and hence ρ = ρ A + ρM the mass density of the alloy. We then let ϕ2 =

ρM , ρ

ϕ2 ∈ [0, 1],

be the mass fraction of martensite. Restrict attention to homogeneous one-dimensional models by assuming that the reference mass density ρ R keeps constant and the pertinent fields are uniaxial, namely T R R = s e ⊗ e and E = e e ⊗ e, e being a fixed unit vector; the scalar fields s and e are the engineering (Piola) stress and the engineering strain. The referential free enthalpy ρ R φ = ρ R ψ − s e is taken as a function of s, e parameterized by the temperature θ. For formal convenience we consider the referential densities ε R = ρ R ε,

η R = ρ R η,

ψR = ρR ψ

and q R = q R · e. The Clausius–Duhem inequality (2.43) can be written in the forms4 ˙ + s e˙ − 1 q R ∂ X θ ≥ 0, −(ψ˙ R + η R θ) θ

˙ − s˙ e − 1 q R ∂ X θ ≥ 0, −(φ˙ R + η R θ) θ

depending on whether we base on the Helmholtz free energy ψ R or the Gibbs free energy φ R . The transition from the martensitic to the austenitic phases may be induced by the temperature or the stress. 4

The present approach is consistent with a zero extra-entropy flux.

946

16 Phase Transitions

Fig. 16.5 Temperature-stress diagram: temperature induced (a) and stress induced (b)

To describe the temperature-induced transition at constant stress s = s¯ we assume the alloy is in the martensitic phase up to θ = θ¯M . Increasing the temperature produces an increase of the austenite fraction and the process is complete at θ > θ¯M . Decreasing the temperature leaves the alloy unchanged up to θ¯A < θ¯M . Further decreasing of the temperature produces an increase of the martensite content at θ¯A and the alloy goes back to being fully martensitic at lower temperatures (see Fig. 16.5a). A similar process occurs during stress-induced transitions at constant tem˜ as shown in Fig. 16.5b. perature θ = θ, The occurrence of two distinct temperature thresholds, in the first case, and stress, in the second one, indicate that these processes have a hysteretic character. Yet a constitutive hysteretic relation between ϕ2 and θ, but independent of e and s, may seem quite drastic. For definiteness we now neglect heat conduction and consider the Clausius–Duhem inequality in the form ˙ + s e˙ = − f ≥ 0 −(ψ˙ R + η R θ) where f = J θσ. In [371] ψ is a function of e, θ, and α, as the analogue of ϕ2 , while f is proportional to |α| ˙ with coefficients dependent on sgn α. ˙ By analogy with the hysteretic models of plasticity and ferroics we think that a model of SMA should involve a nonlinear relation between s˙ and e. ˙ We now let α = ϕ2 and assume ˙ α,  = (θ, α, s, e, θ, ˙ s˙ , e) ˙ is the set of independent variables. By the standard arguments we find that ψ R is in ˙ α, fact independent of θ, ˙ s˙ , e. ˙ Hence the Clausius–Duhem inequality simplifies to ˙ α, ˙ s˙ , e). ˙ (∂θ ψ R + η R )θ˙ + ∂α ψ R α˙ + ∂s ψ R s˙ + (∂e ψ R − s)e˙ = − f (θ, α, s, e, θ, If, as is reasonable, we let θ = θ0 be constant then we have ∂α ψ R α˙ + ∂s ψ R s˙ + (∂e ψ R − s)e˙ = − f 0 ,

16.8 Phase Transitions in SMA

947

where f 0 = f (θ0 , α, s, 0, α, ˙ s˙ ). Hence s, e, α are required to satisfy a nonlinear first-order equation, ˙ ∂s ψ R s˙ ∂α ψ R α˙ + f 0 (α) + . e˙ = s − ∂e ψ R s − ∂e ψ R To simplify the relation among the unknowns, we assume that e = eel + epl where eel and epl are given by e˙el =

∂s ψ R s˙ , s − ∂e ψ R

e˙pl =

˙ ∂α ψ R α˙ + f 0 (α) . s − ∂e ψ R

Hence eel plays the role of an elastic part of e while epl is the hysteretic part of e. Moreover s − ∂e ψ R ∂s ψ R can be viewed as the elastic modulus [19]. The strain epl is related to the martensitic phases. In the deformation-induced transformation mechanisms (or tensile tests) the martensite M+ (M− ) is produced if s is positive (negative). Hence we characterize epl and ϕ by assuming ϕ takes the same sign of s and epl , namely ϕ s ≥ 0, ϕ epl ≥ 0. To get detailed models we have merely to select the functions ψ R and f 0 . A different approach follows by considering the Clausius–Duhem inequality in the form (16.37) φ˙ R + η R θ˙ + e s˙ = f ≥ 0 and assuming from the very beginning that a model of SMA should involve a decomposition of the deformation e into an elastic and a plastic part. For ease in writing, we now let α = ϕ2 and assume e = eel + epl ,

eel = g(s), epl = d(θ)γ(α)sgn s

(16.38)

where g : R → R and γ : [0, 1] → [0, 1] are such that g(0) = 0, g (s) > 0, γ(0) = 0, γ (α) ≥ 0, γ(1) = 1, γ (1) = 0, γ (1) < 0.

The set of independent variables is then ˙ α,  = (θ, α, s, θ, ˙ s˙ ). ˙ α, ˙ s˙ . Hence, By the standard arguments we find that φ R is in fact independent of θ, using (16.38), the Clausius–Duhem inequality (16.37) simplifies to

948

16 Phase Transitions

˙ α, (∂θ φ R + η R )θ˙ + ∂α φ R α˙ + [∂s φ R + g + dγ sgn s]˙s = f (θ, α, s, θ, ˙ s˙ ). If we let θ = θ˜ be constant then we have ˜ sgn s]˙s = f˜ ≥ 0. ∂α φ˜ R α˙ + [∂s φ˜ R + g + dγ ˜ α, s), d˜ = d(θ) ˜ and f˜(α, s, α, ˜ α, s, 0, α, ˙ s˙ ) = f (θ, ˙ s˙ ). where φ˜ R (α, s) = φ R (θ, Hence, s and α are required to satisfy the following nonlinear first-order differential equation ˜ sgn s f˜(α, s, α, ˙ s˙ ) ∂s φ˜ R + g + dγ s˙ + . (16.39) α˙ = − ∂α φ˜ R ∂α φ˜ R which describes the hysteretic stress-induced phase transition at constant temperature after properly selecting the functions φ˜ R and f˜ (Figs. 16.6 and 16.7). Otherwise, to describe temperature-induced transitions we let the stress s = s¯ be constant; the Clausius–Duhem inequality (16.37) leads to ˙ α, ˙ s˙ ). (∂θ φ¯ R + η¯ R )θ˙ + ∂α φ¯ R α˙ = f¯(θ, α, s, θ, ˙ α) ˙ = f (θ, α, s¯ , where φ¯ R (θ, α) = φ R (θ, α, s¯ ), η¯ R (θ, α) = η R (θ, α, s¯ ) and f¯(θ, α, θ, ˙ α, θ, ˙ 0). Hence, θ and α are required to satisfy the following nonlinear first-order differential equation

Fig. 16.6 Stress-induced transitions at a fixed temperature θ˜ > θ∗ : from left to right A → M+ (◦), from right to left M+ → A ()

Fig. 16.7 Stress–strain loop in stress-induced transitions at a fixed temperature θ˜ > θ∗ : from left to right A → M+ (◦), from right to left M+ → A ()

16.8 Phase Transitions in SMA

949

Fig. 16.8 Temperature-induced transitions at a fixed stress s¯ > 0: from right to left A → M+ (◦), from left to right M+ → A ()

Fig. 16.9 Stress–strain loop in temperature-induced transitions at a fixed stress s¯ > 0: from left to right A → M+ (◦), from right to left M+ → A ()

α˙ = −

˙ α) ∂θ φ¯ R + η¯ R ˙ f¯(θ, α, θ, ˙ θ+ ¯ ¯ ∂α φ R ∂α φ R

(16.40)

which describes the hysteretic temperature-induced phase transition at constant stress after properly selecting the functions φ¯ R , η¯ and f¯ (Figs. 16.8 and 16.9).

Appendix A

Notes on Vectors and Tensors

A.1

Vector and Tensor Algebra

The space under consideration is a three-dimensional Euclidean point space E . Points are elements of E and are denoted by x. Granted a choice of origin, we identify all points with the corresponding vectors from the origin. A vector space over the reals R (or the complex numbers C) is a set V together with two operations satisfying the properties u + v = v + u, u + (v + w) = (u + v) + w, ∃0 : u + 0 = u, ∃ − u : u + (−u) = 0

αβu = α(βu), 1u = u, (α + β)u = αu + βu, α(u + v) = αu + αv, for all u, v, w ∈ V and α, β ∈ R. The length |u| of a vector u has the property of a norm, |u| ≥ 0, |u| = 0 ⇐⇒ u = 0; |u + v| ≤ |u| + |v|; |αu| = |α| |u|, |α| denoting the absolute value. The length |u| is also called the Euclidean norm or l2 - norm. The inner product and cross product of vectors u, v are denoted by u · v, By definition

u × v.

u · v := |u| |v| cos θ,

θ being the angle between u and v; it follows that the inner product so defined satisfies u · v = v · u,

(αu + βv) · w = αu · w + βv · w,

u · u ≥ 0, = 0 ⇐⇒ u = 0.

© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0

951

952

Appendix A : Notes on Vectors and Tensors

The magnitude (or length) |u| of u is related to the inner product by |u| = (u · u)1/2 . Throughout the symbol u2 stands for u · u = |u|2 . Let n be the unit vector, perpendicular to the plane containing u and v, such that the angle θ between u and v, in the counter-clockwise sense, is subject to θ ∈ (0, π); the direction of n is given also by the right-hand screw rule applied to u, v, n. The cross product u × v is defined by u × v := |u| |v| sin θ n if θ ∈ (0, π) and u × v = 0 if θ = 0, π. A Cartesian coordinate frame for E consists of a chosen origin together with a positively oriented orthonormal basis {e1 , e2 , e3 }. Throughout latin subscripts range over 1, 2, 3. Hence the orthonormal property is characterized by ei · e j = δi j , δi j denoting the Kronecker delta. The positively oriented property of the basis vectors implies that ei · (e j × ek ) = i jk , where i jk is the alternating symbol, i jk

⎧ ⎨ 1, if i, j, k are an even permutation of 1, 2, 3 = −1, if i, j, k are an odd permutation of 1, 2, 3 ⎩ 0, otherwise.

It follows that the alternating symbol is invariant under cyclic permutations, i jk =  jki = ki j . Throughout the Einstein summation convention is understood. Since {ei } is a basis then every vector u admits a unique representation, u = u j ej, the summation over j = 1, 2, 3 being understood. The scalars {u j } are called the Cartesian components of u. By the orthonormal character of the basis it follows that u i = u · ei . Likewise, letting u = e j × ek we have e j × ek = [ei · e j × ek ]ei = i jk ei .

Appendix A : Notes on Vectors and Tensors

953

In terms of the components, the inner and cross products are given by u · v = (u i ei ) · (u j e j ) = u i vi u × v = (u j e j ) × (vk ek ) = u j vk e j × ek = i jk u j vk ei . Accordingly, (u × v)i = i jk u j vk and the mixed product w · (u × v) is given by w · (u × v) = i jk wi u j vk . Since i jk wi u j vk =  jki u j vk wi = ki j vk wi u j then w · (u × v) = u · (v × w) = v · (w × u), a mixed product is invariant upon a cyclic permutation of the vectors or upon the interchange of the dot and cross products. Some identities involving the cross product and the alternating symbol are often useful in vector calculus.1 First ei × (e j × ek ) = δik e j − δi j ek . Hence u × (v × w) = (u · w)v − (u · v)w. Now, ei · (e j × ek ) ei · (e p × eq ) = (e j × ek ) · (e p × eq ) = e p · eq × (e j × ek ) = e p · (δqk e j − δq j ek ).

Hence it follows the identity i jk i pq = δ j p δkq − δ jq δkp . Consequently, i jk i jq = 2δkq ,

i jk i jk = 6.

Since e j × ek = [ei · (e j × ek )]ei = i jk ei 1

The detailed proofs are given in [325], pp. 6431–6440.

(A.1)

954

Appendix A : Notes on Vectors and Tensors

then  pjk e j × ek =  pjk i jk ei = 2δ pi ei = 2e p . As a consequence it follows the identity ei = 21 i jk e j × ek . A tensor, S say, is a linear mapping of vectors to vectors, Su = v. The linearity is expressed by S(αu + βv) = αSu + βSv, for all vectors u, v and scalars α, β. The sum of tensors, S + T, and the multiplication by a scalar, αS, are defined via the action on vectors, i.e. (S + T)u = Su + Tu,

(αS)u = αSu,

for all vectors u. The zero tensor 0 and the identity tensor 1 are defined by 0u = 0,

1u = u

for all vectors u. The dyadic product u ⊗ v of two vectors u and v is a tensor defined by (u ⊗ v)w = (v · w)u for all vector w. Let e be a unit vector. Since (e ⊗ e)u = (u · e)e for any vector u, then the tensor e ⊗ e maps each vector u to the projection (u · e)e of u in the direction of e. Likewise, (1 − e ⊗ e)u = u − (u · e)e,

e · (1 − e ⊗ e)u = 0.

For any vector u, the tensor 1 − e ⊗ e maps u to the projection of u perpendicular to e. Hence e ⊗ e and 1 − e ⊗ e are tensors providing the projections onto e and onto the plane perpendicular to e. Incidentally, since 1u = u = (u · ei )ei = (ei ⊗ ei )u then we have the representation of the identity, 1 = ei ⊗ ei , the summation over i being understood.

Appendix A : Notes on Vectors and Tensors

955

We now determine the components of a tensor. Given a tensor S and any vector u let v = Su. Hence vk ek = S(u j e j ) = u j Se j . Inner multiplying by ei we obtain vi = (ei · Se j )u j . Hence we have the component form of v = Su as vi = Si j u j upon defining the components of S as Si j := ei · Se j . Conversely, to obtain the tensor in terms of the components we observe v = vi ei = (Si j u j )ei = (Si j e j · u)ei = (Si j ei ⊗ e j )u whence it follows Su = (Si j ei ⊗ e j )u. By the arbitrariness of u we conclude that S = Si j ei ⊗ e j . The matrix S = [Si j ] is the matrix associated with the tensor S, relative to the selected basis {ei }. The transpose of a tensor S is the unique tensor ST such that u · Sv = v · Su for all vectors u and v. To find the connection between the matrices of S and ST we observe ei · ST e j = e j · Sei and hence (ST )i j = S ji ,

ST = Si j e j ⊗ ei .

The matrix associated with ST is the transpose of that associated with S. Applying the definition of transpose we find that (S + T)T = ST + TT ,

(ST )T = S,

1T = 1.

956

Appendix A : Notes on Vectors and Tensors

This is so because, by definition, v · (S + T)T u = u · (S + T)v = u · Sv + u · Tv = v · ST u + v · TT u = v · (ST + TT )u,

v · (ST )T u = u · ST v = v · Su, v · 1T u = u · 1v = v · u, whence (S + T)T u = (ST + TT )u, (ST )T u = Su, 1T u = u and the conclusions follow. A tensor S is symmetric if S = ST and is skew if S = −ST . Let Sym be the set of symmetric tensors and Skw be the set of skew tensors. The identity S = 21 (S + ST ) + 21 (S − ST ) and the observation that S + ST ∈ Sym, S − ST ∈ Skw show that any tensor admits a (unique) decomposition into symmetric and skew parts, S = sym S + skw S. The uniqueness of the decomposition follows by observing that if ˆ + B, ˆ A, A ˆ ∈ Sym, B, Bˆ ∈ Skw S=A+B=A then

ˆ S + ST = 2A = 2A,

ˆ S − ST = 2B = 2B,

ˆ B = B. ˆ whence A = A, Given two tensors S, T the product ST is defined by (ST)u := S(Tu), for all vectors u. To determine the matrix associated with the product ST we observe ei · STe j = ST ei · Te j = Sik ek · Th j eh = Sik Th j ek · eh = Sik Tk j , whence (ST)i j = Sik Tk j . Accordingly, the matrix associated with ST is obtained from S and T via the standard row-column rule for computing the product of matrices. The symbol Sm denotes the m-times product, Sm = S S · · · S. m times

Appendix A : Notes on Vectors and Tensors

957

In particular S2 = S S. To determine the action of the transpose of the product we observe that v · (ST)T u = u · (ST)v = u · S(Tv) = (Tv) · ST u = v · TT ST u, and then (ST)T = TT ST . In applications we may find tensors in the form A S AT . Since v · ASAT u = v · A(SAT u) = (SAT u) · AT v = AT u) · ST AT v = u · AST AT v, whence (ASAT )T = AST AT . As a consequence S ∈ Sym =⇒ ASAT ∈ Sym,

sym(ASAT ) = A(symS)AT .

The tensors may involve a dyadic product. Now, S(u ⊗ v)w = S[(u ⊗ v)w] = S(v · w)u = (v · w)Su = [(Su) ⊗ v]w, (u ⊗ v)Sw = (u ⊗ v)(Sw) = u v · Sw = u ST v · w = (u ⊗ ST v)w, t · (u ⊗ v)T w = w · (u ⊗ v)t = w · u t · v = t · (v ⊗ u)w. Accordingly, S(u ⊗ v) = (Su) ⊗ v,

(u ⊗ v)S = u ⊗ (ST v),

(u ⊗ v)T = v ⊗ u.

There is a 1-1 correspondence between skew tensors and axial vectors . If  ∈ Skw then the axial vector a of  is defined by u = a × u for all vectors u. To determine the direct correspondence between  and a we observe that, by definition,  pq u q =  piq (a )i u q whence  pq =  piq (a )i . Conversely, multiplying by  j pq we have

958

Appendix A : Notes on Vectors and Tensors

 j pq  pq =  j pq  piq (a )i = −2 δi j (a )i = −2(a ) j , whence (a ) j = − 21  j pq  pq . The trace of a tensor S is a linear operation based on the definition tr (ei ⊗ e j ) = ei · e j . Hence tr S = tr [Si j ei ⊗ e j ] = Si j tr (ei ⊗ e j ) = Sii and hence tr S = tr S. Further, tr 1 = tr (ei ⊗ ei ) = 3. If S ∈ Skw then tr S = 0. A tensor S is said to be traceless if tr S = 0. For any tensor S, S0 := S − 13 (tr S)1 is traceless and S0 is said to be the deviatoric part (or deviator) of S. The identity S = [S − 13 (tr S)1] + 13 (tr S)1 means that each tensor admits the decomposition S = S0 + 13 (tr S)1. Given two tensors S, T the inner product S · T is defined by2 S · T = tr (ST T). It follows S · T = (ST T) j j = S Tji Ti j = Si j Ti j . As a consequence, S · T = T · S,

1 · S = tr S.

Some properties are useful about the inner product of tensors. First, S ∈ Sym, W ∈ Skw

=⇒

S · W = 0;

in the geometrical language, S and W are orthogonal. This follows by observing that 2

Often the notation S : T is used instead of S · T.

Appendix A : Notes on Vectors and Tensors

959

S · W = tr (ST W) = tr (SW) = tr (WS) = −tr (WT S) = −W · S = −S · W. Let |S| :=

√  S · S = i, j Si2j .

Since S = sym S + skw S and sym S · skw S = 0 then |S|2 = |sym S|2 + |skw S|2 . Likewise, by the decomposition of S via the traceless part we have |S|2 = |S0 |2 + 13 (tr S)2 . A tensor S is invertible if there is a tensor S−1 such that SS−1 = S−1 S = 1, S−1 being called the inverse of S. If S and T are invertible then right multiplication of (ST)−1 ST = 1 by T−1 and S−1 gives (ST)−1 = T−1 S−1 . Also by

it follows that

ST (S−1 )T = (S−1 S)T = 1 S−T := (S−1 )T = (ST )−1 .

The determinant of a tensor S is defined by [216] det S =

Su · (Sv × Sw) u · (v × w)

for any basis u, v, w. Owing to the geometrical meaning of the mixed product, | det S| is the ratio of the volume of the parallelepiped defined by the vectors Su, Sv, and Sw to the volume of the parallelepiped defined by u, v, and w. For simplicity we let u, v, w be the orthonormal triad and define det S = whence

Sei · (Se j × Sek ) ei · (e j × ek )

960

Appendix A : Notes on Vectors and Tensors

i jk det S = Sei · (Se j × Sek ). Since Sei = (S ph e p ⊗ eh )ei = S pi ei and the like for Se j and Sek we obtain i jk det S =  pqr S pi Sq j Sr k . Multiplying by i jk and summing over i, j, k we find that det S = 16 i jk  pqr S pi Sq j Sr k . Otherwise, we let i = 1, j = 2, k = 3 to obtain det S =  pqr S p1 Sq2 Sr 3 = det S. Accordingly, the determinant of the tensor equals the determinant of the corresponding matrix. By using the equality of the determinants, det S = det S, it follows that det ST = det S,

det(S T) = (det S)(det T).

Hence by S−1 S = 1 it follows det S−1 = (det S)−1 . Since (S−1 )i j = (S −1 )i j it follows that S is invertible if and only if det S = 0. A tensor Q is orthogonal if Qu · Qv = u · v for all vectors u, v. In particular if v = u it follows (Qu)2 = u2 or |Qu| = |u|; for any vector u the length of Qu equals that of u. Furthermore, if α is the angle between u and v while β is the angle between Qu and Qv then cos β =

u·v Qu · Qv = = cos α; |Qu||Qv| |u||v|

the angle is left invariant upon the action of an orthogonal tensor. Equivalently, orthogonal tensors Q are characterized by the condition

Appendix A : Notes on Vectors and Tensors

961

QT Q = QQT = 1. To prove the equivalence we first assume Qu · Qv = u · v for all vectors u, v. Let v = u so that u · u = Qu · Qu = u · QT Qu whence u · (QT Q − 1)u = 0 for all u. Let T = QT Q − 1 and observe T ∈ Sym. We need showing that if T ∈ Sym satisfies u · Tu = 0 for any vector u then T = 0. To this end we consider the identity (u + v) · T(u + v) − u · Tu − v · Tv = 2u · Tv for any vectors u, v and T ∈ Sym. The left-hand side is zero. Hence u · Tv = 0 for any vectors u, v. This implies T = 0. As a consequence QT Q = 1. This in turn implies that det Q = ±1, Q is invertible and QT = Q−1 . Conversely, if QT Q = 1 then Qu · Qv = u · QT Qv = u · v, which completes the proof. An orthogonal tensor Q is said to be a rotation if det Q = 1. If QT Q = 1 then Q ∈ Orth; if further det Q = 1 then Q ∈ Orth+ . A scalar ω is an eigenvalue of a tensor S if there is a nonzero vector v such that Sv = ωv. If S is symmetric then there is an orthonormal basis {ˆei } of eigenvectors of S such that 3 ωi eˆ i ⊗ eˆ i , S = i=1 ωi being the eigenvalue of S associated with the eigenvector eˆ i . If S has two distinct eigenvalues, ω1 = ω2 = ω3 then ω2 eˆ 2 ⊗ eˆ 2 + ω3 eˆ 3 ⊗ eˆ 3 = ω2 (ˆe2 ⊗ eˆ 2 + eˆ 3 ⊗ eˆ 3 ). Since eˆ 2 ⊗ eˆ 2 + eˆ 3 ⊗ eˆ 3 = 1 − eˆ 1 ⊗ eˆ 1 then S = ω1 eˆ 1 ⊗ eˆ 1 + ω2 (1 − eˆ 1 ⊗ eˆ 1 ). If ω1 = ω2 = ω3 = ω then

S = ω1;

962

Appendix A : Notes on Vectors and Tensors

in this case each vector is an eigenvector. A tensor S is positive-definite if it is symmetric and u · Su > 0 for all nonzero vectors u. If S is positive-definite then letting eˆ i be the i-th basis eigenvector we have 0 < eˆ i · Sˆei = ωi and hence each eigenvalue ωi is positive. Conversely, if ω1 , ω2 , ω3 are positive then u = u i eˆ i results in u · Su = u j eˆ j · Su i eˆ i =



i, j u i u j ωi δi j

=



2 i ωi u i .

Accordingly S is positive-definite if and only if the eigenvalues {ωi } are positive. Given two orthonormal bases {e j }, {ˆei } we can express eˆ i relative to the basis {e j } in the form Ri j = eˆ i · e j . eˆ i = Ri j e j , Since δi h = eˆ i · eˆ h = Ri j Rhk e j · ek = Ri j Rh j = (R R T )i h then R R T = 1l. Hence R T = R −1 and R T R = 1l. Accordingly R is orthogonal and, subject to det R = 1, a rotation. Geometrically, R describes the (passive) rotation from e j to eˆ i . If R is the rotation matrix from {e j } to {ˆei } so that eˆ i = Ri j e j then the matrices S, Sˆ of S relative to the bases {e j }, {ˆei } satisfy Si j = ei · Se j = Ri p eˆ p · SR jq eˆ q = Ri p R jq Sˆ pq . Hence

S = R Sˆ R T .

If {ˆei } is the basis of eigenvectors then det S = det S = det Sˆ = ω1 ω2 ω3 ,

tr S = tr S = tr Sˆ = ω1 + ω2 + ω3 .

As a consequence, if S is positive-definite then ωi > 0, i = 1, 2, 3, and hence det S > 0,

tr S > 0.

Appendix A : Notes on Vectors and Tensors

963

For any positive-definite tensor S there is a unique positive-definite tensor U such that U2 = S; √ √ the symbol S is then used for √ U and S is referred to as the square root of S. To prove the existence of S we observe that if S is positive-definite then S=

 i

ωi eˆ i ⊗ eˆ i ,

Let U=

 √ i

ωi > 0.

ωi eˆ i ⊗ eˆ i .

Now, for any vector v U2 v =

Accordingly,

 √

 √ √ √ ωi ω j eˆ i ⊗ eˆ i eˆ j ⊗ eˆ j v = i, j ωi ω j eˆ i ⊗ eˆ i eˆ j vˆ j  √ √   = i, j ωi ω j eˆ i δi j vˆ j = i ωi eˆ i vˆi = i ωi eˆ i ⊗ eˆ i v.

i, j

 √ i, j

ωi

 √ ω j eˆ i ⊗ eˆ i eˆ j ⊗ eˆ j = i ωi eˆ i ⊗ eˆ i

and U2 = S.

A.1.1

Eigenvalues and Eigenvectors

Let ω be an eigenvalue of S and e be the corresponding eigenvector. Then (S − ω1)e = 0 and S − ω1 is not invertible whence det(S − ω1) = 0. The corresponding matrix form, det(S − ω1l) = 0 results in the characteristic equation ω 3 − a1 ω 2 + a2 ω − a3 = 0, where a1 = tr S, a3 = det S, while a2 = S22 S33 − S32 S23 + S11 S22 − S21 S12 + S11 S33 − S13 S31 is the sum of the principal minors of S.

(A.2)

964

Appendix A : Notes on Vectors and Tensors

To get a simpler representation of a2 we observe that 2 2 2 + S22 + S33 + 2S12 S21 + 2S13 S31 + 2S23 S32 , tr S 2 = Si j S ji = S11 2 2 2 + S22 + S33 + 2S11 S22 + 2S11 S33 + 2S22 S33 . (tr S)2 = S11

Hence (tr S)2 − tr S 2 = 2(S11 S22 − S12 S21 )+2(S11 S33 − S13 S31 ) + 2(S22 S33 − S23 S32 ) = 2a2 .

Hence we can write a1 = tr S,

a2 = 21 [(tr S)2 − tr S 2 ],

a3 = det S.

Upon an orthogonal transformation, with matrix Q, the matrix S is transformed to Sˆ such that Q T Q = Q Q T = 1l. Sˆ = Q S Q T , It follows that

tr Sˆ = Q i j S jk Q ik = δ jk S jk = tr S, tr Sˆ 2 = tr Q S Q T Q S Q T = tr Q S 2 Q T = tr S 2 , det Sˆ = det Q S Q T = (det Q)2 det S = det S.

Accordingly, a1 , a2 , a3 are invariant under the group of orthogonal transformations. That is why the symbols I1 , I2 , I3 are currently used instead of a1 , a2 , a3 and I1 , I2 , I3 are called principal invariants of the matrix (and of the tensor). Let S be a symmetric tensor and let e be an eigenvector of S with ω the corresponding eigenvalue. Hence Se = ωe and S2 e = S(Se) = ωSe = ω 2 e,

S3 e = ω 3 e.

Consequently (S3 − I1 S2 + I2 S − I3 1)e = (ω 3 − I1 ω 2 + I2 ω − I3 )e = 0. Since S is symmetric there is an orthonormal basis {ˆei } of eigenvectors of S and hence, for any vector u = uˆ i eˆ i , (S3 − I1 S2 + I2 S − I3 1)u = uˆ i (S3 − I1 S2 + I2 S − I3 1)ˆei =



3 i uˆ i (ωi

− I1 ωi2 + I2 ωi − I3 )ˆei .

Appendix A : Notes on Vectors and Tensors

965

Since each eigenvalue ωi satisfies the characteristic equation (A.2) then (S3 − I1 S2 + I2 S − I3 1)u = 0 for any vector u. This implies that S3 − I1 S2 + I2 S − I3 1 = 0.

(A.3)

By (A.3) we can state Theorem A.1 (Cayley–Hamilton) Every tensor satisfies its own characteristic equation. Equation (A.3) is naturally referred to as the Cayley–Hamilton equation for S; it is the tensor analogue of the Cayley–Hamilton equation for matrices [183] which, though, need not require the symmetry of the matrix. Incidentally, for symmetric tensors S the principal invariants are given by the eigenvalues in the form I 1 = ω1 + ω2 + ω3 ,

I 2 = ω1 ω2 + ω2 ω3 + ω1 ω3 ,

I 3 = ω1 ω2 ω3 .

In connection with a tensor S, define the quantities Ji , i = 1, 2, 3, as Ji = det S i . They are invariants in that det[ Sˆ i ] = det[(Q S Q T )i ] = det[Q S i Q T ] = det S i . Indeed, they are named the main invariants of S [388]. Further invariants are defined as I 1 = tr S,

I2 = tr S 2 ,

I3 = tr S 3 .

The invariance of tr S i follows by observing that tr Sˆ i = tr (Q S Q T )i = tr (Q S i Q T ) = tr [(Q T Q)S i ] = tr S i . The eigenvalues ω of an orthogonal tensor are subject to |ω| = 1. If the orthogonal tensor is a rotation then one of the eigenvalues equals 1. To prove these assertions we consider Q ∈ Orth and let Qv = ωv,

v ∈ C3 .

Take the complex conjugate, denoted by ∗, and then the transpose to obtain

966

Appendix A : Notes on Vectors and Tensors

Qv∗ = ω ∗ v∗ , Hence we find

(v∗ )T QT = ω ∗ (v∗ )T .

(v∗ )T QT Qv = ωω ∗ (v∗ )T · v.

Since Q ∈ Orth then QT Q = 1 then it follows (v∗ )T · v(1 − ωω ∗ ) = 0. As a consequence, v = 0 implies ωω ∗ = 1, and hence |ω| = 1. The characteristic equation det(Q − ω1) = −ω 3 + I1 ω 2 − I2 ω + det Q = 0 is satisfied by at least one real value, say ω1 . The remaining roots ω2 , ω3 are complex conjugate, ω3 = ω2∗ . Now, det Q = ω1 ω2 ω2∗ = ω1 and hence ω1 = ±1. If det Q = 1 then ω1 = 1. As a consequence, for any rotation Q there is a vector v, associated with ω1 , such that Qv = v; the vector v, which is invariant under Q, defines the rotation axis. Let v be a vector in R3 and n a unit vector along an axis of rotation. To determine the effect of a rotation on a vector we observe that any vector v can be decomposed as v := (v · n)n, v⊥ = −n × (n × v), v = v + v⊥ , v and v⊥ being the parallel and the orthogonal parts of v relative to n. Upon a rotation of v about n by an angle θ, the vector v is left unchanged. Instead, if i1 , i2 are unit vectors such that {i1 , i2 , n} is an orthonormal basis then vˆ1 = cos θv1 − sin θv2 ,

vˆ2 = − sin θv1 + cos θv2 ,

and hence vˆ ⊥ = cos θv⊥ + sin θn × v⊥ . As a consequence, upon the rotation, v → vˆ such that vˆ = cos θ v + sin θ n × v + (1 − cos θ)(n · v)n.

(A.4)

Appendix A : Notes on Vectors and Tensors

967

Equation (A.4) is usually referred to as Rodrigues rotation formula. The transformation v → vˆ given by (A.4) can be viewed as the effect of the rotation Q = n ⊗ n + cos θ(1 − n ⊗ n) + sin θ(i2 ⊗ i1 − i1 ⊗ i2 ). Now, I1 = tr Q = 1 + 2 cos θ,

I2 = 21 [(tr Q)2 −tr Q2 ] = 1+2 cos θ,

I3 = det Q = 1.

Since I1 = I2 = tr Q then the Cayley–Hamilton theorem allows us to write QT = Q−1 = Q2 − (tr Q)Q + (tr Q)1.

A.1.2

Length and Induced Norm

The cross product u × v, the multiplication by a scalar αv, the product by a tensor Sv, and the dyadic product u ⊗ v are linear operators of V into V ; an operator A is linear if A(αu + βv) = αAu + βAv for all u, v ∈ V and all real α, β. A linear operator A is bounded if there is a constant M such that for all v ∈ V we have |Av| ≤ M|v|. The least such M is called the norm of A, induced by the length of vectors. Thus A := sup |Av|. |v|=1

Hence |Av| ≤ A |v|. An induced operator norm is sub-multiplicative in that TS ≤ T S . This follows at once by |TSv| ≤ T S |v|,

TS = sup |TSv|. |v|=1

The cross product u × v can be viewed as the operator u × applied to v. Since |u × v| ≤ |u| |v|, and equality holds if v is perpendicular to u, then u × = |u|.

968

Appendix A : Notes on Vectors and Tensors

The norm of the dyadic product is found by observing that |(u ⊗ w)v| = |u w · v| = |u| |w · v| ≤ |u| |w| |v| while equality holds if v = ±w. Hence it follows |u ⊗ w| = |u| |w|. To determine the norm of a tensor S we observe Sv = Si j v j ei ,

|Sv| = [



i (Si j v j )

]

2 1/2

.

Now, Si j v j can be viewed as the inner product (in R3 ) of the vectors (Si1 , Si2 , Si3 ) and (v1 , v2 , v3 ). The Cauchy–Schwarz inequality, as well as the inner product in V , gives   |Si j v j | ≤ ( j Si2j )1/2 ( h vh2 )1/2 and hence |Sv| = [



i (Si j v j )



Since |v| = (

]

2 1/2

2 1/2 h vh )

≤[

  2  2 1/2   = ( i, j Si2j )1/2 ( h vh2 )1/2 . i( j Si j )( h vh )]

then it follows S ≤ (



2 1/2 . i, j Si j )

(A.5)

 The same result emerges by observing that i, j Si2j can be viewed as the inner product in R9 and applying the Cauchy–Schwarz inequality. Yet, if S is symmetric then we represent v in the basis of eigenvectors, v = vˆ j eˆ j , to obtain |Sv|2 = (Sv) · (Sv) = Svˆ j eˆ j · Svˆ h eˆ h =



ˆj i, j ω j ωh vˆ j vˆ h e

· eˆ h =



2 2 j ω j vˆ j

≤ ωmax



2 j vˆ j ,

where ωmax = max j |ω j |. Moreover equality holds if v is in the direction of the eigenvector associated with the largest eigenvalue. Hence, for symmetric tensors, S = ωmax ;

(A.6)

the norm (A.6) is called the spectral norm. Accordingly, in general it is the condition (A.5) that holds true and hence we set S = (



2 1/2 ; i, j Si j )

this norm is usually called the Frobenius norm. There are several fourth-order tensors constructed from second-order ones. If A, B ∈ Lin the tensor products A  B and A  B are fourth-order tensors defined by (A  B)C := A C B,

Appendix A : Notes on Vectors and Tensors

969

(A  B)C := A C BT , for every C ∈ Lin. In indicial notation, (A  B)i j hk = Ai h Bk j ,

(A  B)i j hk = Ai h B jk .

In the box product  the two tensors are ascribed the first and second pair of indices. Consistently we find (A  B)(C  D) = (AC)  (BD), (A  B)[u ⊗ v] = (Au) ⊗ (Bv); in indicial notation, [(A  B)(C  D)]i j pq = (A  B)i j hk (C  D)hkpq = Ai h B jk C hk Dkq = (Ai h C hp )(B jk Dkq ) = (AC)i p (BD) jq = [(AC)  (BD)]i j pq . If A ∈ Lin then we let

(1  1)A = 1A1 = A.

Hence I := 1  1 is the identity operator in Lin. The transpose (A  B)T of (A  B) is defined by (A  B)T = (AT  BT ). Hence, for orthogonal tensors Q1 , Q2 it follows (Q1  Q2 )T (Q1  Q2 ) = (Q1T  Q2T )(Q1  Q2 ) = (Q1T Q1 )  (Q2T  Q2 ) = 1  1 = I.

For a fourth-order tensor C the eigenvalue problem involves finding a pair (λ, A) of the eigenvalue λ and the (second-order) eigentensor A such that CA = λA,

Ci j hk Ahk = λAi j .

Let n be the rotation vector of Q ∈ Orth+ , Qn = n, and let (λ, m), (λ∗ , m∗ ) denote the remaining eigenvalue-eigenvector pairs. We can then prove the following statement. Let Q ∈ Orth+ . The eigenvalue-eigenvector pairs of Q = Q  Q are (1, 1), (1, Q), (1, Q2 ), (λ, n ⊗ m), (λ, m ⊗ n), (λ∗ , n ⊗ m∗ ), (λ∗ , m∗ ⊗ n), (λ2 , m ⊗ m), (λ∗2 , m∗ ⊗ m∗ ).

970

Appendix A : Notes on Vectors and Tensors

The proof is given by a direct check. For instance, Q1 = Q1QT = QQT = 1,

QQ = QQQT = Q,

QQ2 = QQ2 QT = QQQQT = QQ = Q2 , Qn ⊗ m = (Qn) ⊗ (Qm) = n ⊗ λm = λn ⊗ m, Qm∗ ⊗ m∗ = (Qm∗ ) ⊗ (Qm∗ ) = λ∗ m∗ ⊗ λ∗ m∗ = λ∗2 m∗ ⊗ m∗ . The literature shows also a dyadic product between second-order tensors which is the strict analogue of dyadic product between vectors. If A, B, C ∈ Lin then A ⊗ B is defined by3 (A ⊗ B)C = (B · C)A, or [(A ⊗ B)C]i j = (A ⊗ B)i j hk C hk = Ai j Bhk C hk .

A.1.3

Representation Formulae for Vectors and Tensors

Let e be a unit vector. The tensors e⊗e

and

1−e⊗e

determine projections of any vector u onto e, say u , and onto the plane perpendicular to e, say u⊥ . Hence u = u + u⊥ = (e ⊗ e)u + (1 − e ⊗ e)u. If u = u · e is fixed whereas u⊥ is undetermined then we can write u = (u · e)e + (1 − e ⊗ e)w

(A.7)

for any vector w in that [(1 − e ⊗ e)w] · e = w · e − (e · e)e · w = 0 and hence (1 − e ⊗ e)w is perpendicular to e for any vector w. The representation (A.7) has a strict analogue for tensors. Let I be the unit fourthorder tensor and Y a second-order tensor then This product is named fourth-order trace projection tensor [229]. The notation A ⊗ B is nonunique; the symbol A  B is also used [234].

3

Appendix A : Notes on Vectors and Tensors

971

[(I − N ⊗ N)Y] · N = 0; in words, the projection onto N of the orthogonal (or transverse) part of Y with respect to N is identically zero. Hence, Y = Y + Y⊥ ,

Y := (Y · N)N, Y⊥ := (I − N ⊗ N)Y,

and Y⊥ · N = 0. This representation is unique. Now suppose Z is a second-order tensor such that Z is known while Z⊥ is undetermined. Then we observe that [(I − N ⊗ N)G] · N = 0, with any second-order tensor G and then (I − N ⊗ N)G is a possible representation of Z⊥ . Hence to emphasize the unknown property of Z⊥ we represent Z in the form Z = (Z · N)N + (I − N ⊗ N)G

(A.8)

for any (second-order) tensor G. As we can see in some contexts, thermodynamic requirements may result in the knowledge of the projections u or Z relative to appropriate vectors e or tensors N. In those cases the representation formulae (A.7) or (A.8) apply.

A.2

Isotropic Tensors

By definition, isotropic tensors are characterized by the property that their components take the same values in all Cartesian coordinate systems. Let Q = 1 be the rotation matrix between two coordinate systems. The components of a tensor of order n satisfy Ai1 ···in = Q i1 j1 · · · Q in jn A j1 ··· jn The tensor is said to be isotropic if it is invariant, Ai1 ···in = Ai1 ···in , under any rotation Q. Owing to invariance, all scalars are isotropic. A vector, say w, is isotropic if wi = Q i j w j = wi . This is true only if Q = 1 and hence there are no isotropic vectors. Among the admissible rotations are those associated with time-dependent differentiable matrices Q(t) such that Q(0) = 1 and hence

972

Appendix A : Notes on Vectors and Tensors

˙ Q(t) = 1 + Q(0)t + o(t). The orthogonality of the function Q(t), namely Q(t)QT (t) = 1, for all t ≥ 0, implies that T T ˙ T (t) = −[Q(t)Q ˙ ˙ (t) = −Q(t)Q (t)]T Q(t)Q T ˙ ˙ whence Q(t)Q (t) ∈ Skw for all t ≥ 0. Since Q(0) = QT (0) = 1 then Q(0) ∈ Skw. Consequently there is a vector, say w, such that

Q˙ pq =  piq wi ;

wi = − 21 i pq Q˙ pq ,

˙ stands for Q(0). ˙ henceforth Q Require that the second-order tensor M be isotropic, Mi j = Q i p Q jq M pq = Mi j . Substitution of Q results in 0 = ( Q˙ i p δ jq + Q˙ jq δi p )M pq = wr (ir p M pj +  jrq Miq ). ˙ ∈ Skw, and hence that of w, implies The arbitrariness of Q ir p M pj +  jrq Miq = 0, ∀i, j, r = 1, 2, 3. Multiply by ir k to obtain 0 = (δrr δ pk − δr p δr k )M pj + (δi j δkq − δiq δk j )Miq = 2Mk j + M jk − Mqq δ jk . Hence 2Mk j + M jk − Mqq δ jk = 0,

2M jk + Mk j − Mqq δ jk = 0,

the second relation being obtained by interchanging j and k. Now, taking the difference it follows M jk − Mk j = 0. Consequently we have M jk = 13 Mqq δ jk and hence M jk = λ δ jk . For a third-order tensor N, isotropy means Ni jk = Nijk = Q i p Q jq Q kr N pqr .

Appendix A : Notes on Vectors and Tensors

973

Consequently Ni jk = (δi p + Q˙ i p t + o(t))(δ jq + Q˙ jq t + o(t))(δkr + Q˙ kr t + o(t))N pqr whence it follows

Q˙ i p N pjk + Q˙ jq Niqk + Q˙ kr Ni jr = 0.

Substitution of Q˙ i p = iq p wq and the like for Q˙ jq and Q˙ kr results in isp N pjk ws +  jsq Niqk ws + ksr Ni jr ws = 0 and, by the arbitrariness of w, si p N pjk + s jq Niqk + skr Ni jr = 0. Upon multiplying by si h , s j h , skh it follows 2Nh jk + N j hk + Nk j h = Nqqk δh j + Nq jq δhk , 2Ni hk + Nhik + Nikh = Nqqk δhi + Niqq δhk , 2Ni j h + Nh ji + Ni h j = Nk jk δhi + Niqq δh j . Multiply these equations by δ jk , δik , and δi j , respectively. We find that Nhqq , Nqhq , and Nqqh vanish for any h = 1, 2, 3. Hence, by relabelling the indices, it follows that 2Nhik + Ni hk + Nki h = 0, 2Ni hk + Nhik + Nikh = 0, 2Ni hk + Nkhi + Nikh = 0. The second and third equations imply Nhik = Nkhi , the matrix N is invariant upon a cyclic permutation. Hence the second equation results in Ni hk = −Nikh . Accordingly, N is invariant under a cyclic permutation, changes sign when two indices are interchanged, and hence is zero if two indices are equal. This means that N is a multiple of the Levi-Civita (or permutation) symbol, Ni hk = μ i hk .

974

Appendix A : Notes on Vectors and Tensors

For a fourth-order tensor A, the isotropy condition is expressed by Ai j hk = Ai j hk = Q i p Q jq Q hr Q ks A pqr s . ˙ + o(t)) results in Substitution of Q = 1 + Qt Q˙ i p A pj hk + Q˙ j p Ai phk + Q˙ hp Ai j pk + Q˙ kp Ai j hp = 0, whence mi p A pj hk + m j p Ai phk + mhp Ai j pk + mkp Ai j hp = 0. Multiply this equation by miq , m jq , mhq , mkq to obtain 2 Ai j hk + A ji hk + Ah jik + Ak j hi = A pphk δi j + A pj pk δi h + A pj hp δik , 2 Ai j hk + A ji hk + Aih jk + Aikh j = A pphk δ ji + Ai ppk δ j h + Ai php δ jk , 2 Ai j hk + Ai jkh + Ah jik + Aih jk = A pj pk δhi + Ai ppk δh j + Ai j pp δhk , 2 Ai j hk + Ak j hi + Aikh j + Ai jkh = A pj hp δki + Ai php δk j + Ai j pp δkh , where q has been relabelled as i, j, h, k, respectively. Since A is an isotropic fourthorder tensor then any quantities with a pair of repeated indices, like A pphk , are the components of a second-order isotropic tensor. Accordingly we let A pphk = μ12 δhk , A pj pk = μ13 δ jk , A pj hp = μ14 δ j h , Ai ppk = μ23 δik , Ai php = μ24 δi h , Ai j pp = μ34 δi j . Upon substitution, subtraction of the third and fourth equations from the summation of the first and second equation result in A ji hk − Ai jkh = μ12 δhk δi j − μ34 δhk δi j . Letting i = j, h = k we find

μ12 = μ34 =: α.

Consequently, by subtracting the fourth equation from the first one we obtain A ji hk + Ah jik − Aikh j − Ai jkh = (μ13 − μ24 )δ jk δi h . Letting j = k = i = h we find μ13 = μ24 =: β.

Appendix A : Notes on Vectors and Tensors

975

We now subtract the second equation from the first one and use the condition μ13 = μ24 . It follows Ah jik + Ak j hi − Aih jk − Aikh j = (μ14 − μ23 )δ j h δik . Letting j = k = i = h we find μ14 = μ23 =: γ. Moreover we have A ji hk = Ai jkh ,

Ah jik = Aikh j .

(A.9)

In view of (A.9), the four conditions on A simplify to a single significant equation 2 Ai j hk + A ji hk + Ah jik + Ak j hi = λδhk δi j + μδ jk δi h + νδ j h δik .

(A.10)

Now let j hk → hk j and j hk → k j h to obtain 2 Ai hk j + Ahik j + Akhi j + A j hki = λδk j δi h + μδh j δik + νδhk δi j ,

(A.11)

2 Aik j h + Aki j h + A jkih + Ahk ji = λδ j h δik + μδkh δi j + νδk j δi h .

(A.12)

The sum of Eqs. (A.10)–(A.12) results in 2(Ai j hk + Ai hk j + Aik j h ) + 3(Ai jkh + Aikh j + Ahik j ) = (λ + μ + ν)(δhk δi j + δk j δi h + δ j h δik ).

(A.13)

The right-hand side is invariant under the interchange h ↔ k (as well as under i ↔ j). The requirement that so be the left-hand side and use of (A.9), namely Aki h j = Aik j h , Aih jk = Ahik j , yield 2(Ai j hk + Ai hk j + Aik j h ) + 3(Ai jkh + Aikh j + Ahik j ) = 2(Ai jkh + Aikh j + Ahik j ) + 3(Ai j hk + Ai hk j + Aik j h ). Hence letting X = Ai j hk + Ai hk j + Aik j h ,

Y = Ai jkh + Aikh j + Ahik j ,

we have 2X + 3Y = 3X + 2Y whence X = Y . Consequently (A.13) can be written in the form 5(Ai j hk + Ai hk j + Aik j h ) = (α + β + γ)(δhk δi j + δk j δi h + δ j h δik )

976

Appendix A : Notes on Vectors and Tensors

Upon substitution of Ai j hk + Ai hk j + Aik j h in (A.10) we find Ai j hk = λδi j δhk + μδi h δ jk + νδik δ j h

(A.14)

where λ=

1 (4α 10

− β − γ), μ =

1 (4β 10

− γ − α), ν =

1 (4γ 10

− α − β).

Equation (A.14) is the general representation of isotropic fourth-order tensors. Irrespective of the value of the parameters λ, μ, ν it follows that the major symmetry Ai j hk = Ahki j holds. The minor symmetries Ai j hk = A ji hk ,

Ai j hk = Ai jkh

hold if and only if μ = ν (and β = γ) whence Ai j hk = λδi j δhk + μ(δi h δ jk + δik δ j h ).

A.3

(A.15)

Differentiation

A scalar, vector, or tensor function w(t), t ∈ D ⊆ R, has a limit as t → t0 if ∀ > 0 ∃δ > 0 : 0 < |t − t0 | < δ =⇒ |w(t) − w0 | < , the symbol | · | denoting the absolute value of scalars or the norm of vectors; | · | is replaced by · if w is a tensor. We then write |w(t) − w0 | → 0 as t → t0 ,

lim w(t) = w0 .

t→t0

If the definition holds with w0 replaced by w(t0 ) we say that the function is continuous at t0 . If the scalar α, the vectors u, v, and the tensor S are continuous, at t, then so are αu, u · v, u × v, u ⊗ v, Sv. For instance, about u · v, u(t + h) · v(t + h) − u(t) · v(t) = [u(t + h) − u(t)] · v(t + h) + u(t) · [v(t + h) − v(t)], |u(t + h) · v(t + h) − u(t) · v(t)| ≤ |[u(t + h) − u(t)] · v(t + h)| + |u(t) · [v(t + h) − v(t)]| ≤ |u(t + h) − u(t)| |v(t + h)| + |u(t)| |v(t + h) − v(t)|.

Appendix A : Notes on Vectors and Tensors

977

Since |u(t + h) − u(t)| → 0, |v(t + h) − v(t)| → 0 while |v(t + h)| → |v(t)| then the continuity of u · v follows. Likewise, S(t + h)v(t + h) − S(t)v(t) = [S(t + h) − S(t)]v(t + h) + S(t)[v(t + h) − v(t)], |S(t + h)v(t + h) − S(t)v(t)| ≤ S(t + h) − S(t) |v(t + h)| + S(t) |v(t + h) − v(t)|.

Since S(t + h) − S(t) → 0 and |v(t + h) − v(t)| → 0 as h → 0 the continuity of Sv follows. ˙ The derivative w(t) is defined by the well-known limit of the difference quotient. ˙ and If w is the function, the derivative is denoted by w w(t + h) − w(t) or h→0 h

˙ w(t) = lim

  w(t + h) − w(t) ˙  = 0. lim  − w(t) h→0 h

Hence the derivative of a scalar, vector, or tensor is a scalar, vector, or tensor. Also, if w is a point function, the difference w(t + h) − w(t) is a vector and hence the derivative is a vector. Let u, v be differentiable vectors, α a differentiable scalar and S, T differentiable tensors. Following are some differentiation rules, u ˙· v = u˙ · v + u · v˙ ,

˙ v = u˙ × v + u × v˙ , u×

˙ v = u˙ ⊗ v + u ⊗ v˙ , u⊗

α˙v = α˙ v + α v˙ ,

˙ α˙S = α˙ S + α S,

S˙v = S˙ v + S v˙ ,

˙ T˙S = T˙ S + T S.

For definiteness we consider the difference quotient of Sv, at t, and find that, upon obvious rearrangements,   S(t + h) − S(t)   S(t + h)v(t + h) − S(t)v(t)  ˙ ˙  |v(t)| − (S(t)v(t) + S(t)˙v(t)) ≤  − S(t) h h  v(t + h) − v(t)   ˙ + S(t) |v(t + h) − v(t)| + S(t) − v˙ (t) h

Each term on the right-hand side approaches zero as h → 0, which leads us to the sought result. For constant vectors u, v and constant tensors S, T we have ˙ β v = αu ˙ αu + ˙ + βv,

˙ βT = αS ˙ αS + ˙ + βT.

978

Appendix A : Notes on Vectors and Tensors

Accordingly, relative to the fixed orthonormal basis {ei }, the derivative of vectors and tensors is given by the derivatives of the components, ˙ u(t) = u˙ i ei ,

˙ S(t) = S˙i j ei ⊗ e j .

In applications we often need the derivative of a determinant. Since det S = det S, we observe that for any matrix S, det S is a function of the entries Si j . If the entries Si j depend on t then det˙ S = ∂ Si j det S S˙i j . Now, ∂ Si j det S equals the cofactor of Si j and then ∂ Si j det S = det S S −1 ji .

(A.16)

As a consequence, ˙ −1 ˙ det˙ S = det S S −1 ji Si j = det S tr ( SS ). In tensor notation,

˙ −1 ). det˙ S = det S tr (SS

(A.17)

By the relation S−1 S = 1 for an invertible tensor S, the derivative results in ˙ S−1 S + S−1 S˙ = 0 whence

˙ S−1 = −S−1 S˙ S−1 .

An orthogonal tensor Q satisfies QT Q = QQT = 1. Upon differentiation it follows ˙ T. ˙ T = −QQ QQ Now, evaluating the transpose we have ˙ T. ˙ T )T = QQ (QQ ˙ T )T , whence ˙ T = −(QQ Hence it follows that QQ ˙ T ∈ Skw. QQ Functions of vectors and tensors Let U, W be finite-dimensional vector or tensor spaces and let g : U ⊇ D → W; if g is a function of the position vector x then U = E. The function g is differentiable

Appendix A : Notes on Vectors and Tensors

979

at y ∈ D if the difference g(y + h) − g(y) equals a linear function of h plus a o(|h|) term.4 Precisely, g is differentiable at y if there exists a linear operator G(y) : U → W such that g(y + u) = g(y) + G(y)h + o(|h|), or, equivalently, |g(y + h) − g(y) − G(y)h| → 0 as |h| → 0, where | · | has to be replaced by the norm · if tensors are involved. The linear term G(y)h is called strong (or Fréchet) differential of g at y and G(y) is called the strong (or Fréchet) derivative of g at y. If it exists, the linear operator G is unique. For, if two operators G1 and G2 exist then G1 (y)h + o(|h|) = G2 (y)h + o(|h|) implies G1 (y)h − G2 (y)h = o(|h|), which holds only if G1 (y) = G2 (y). For scalar-valued function, W = R, the definition of differentiability becomes α(y + h) = α(y) + a(y) · h + o(|h|). The limit dg(y, h) = lim

t→0

g(y + th) − g(y) , t

if it exists, is said to be the weak (or Gâteaux) differential of g at y in the direction of h. If dg(y, h) is linear in h then Dg(y) such that dg(y, h) = Dg(y)h is called the weak (or Gâteaux) derivative of g at y. Now, replace u by tu so that the strong differentiability gives g(y + th) = g(y) + tG(y)h + o(t). Hence it follows Dg(y) = G(y); strong differentiability implies weak differentiability and, moreover, the Fréchet derivative G(y) is equal to the Gâteaux derivative Dg(y). A Fréchet differentiable function is also continuous. Instead a Gâteaux differentiable function need not be Fréchet differentiable, neither need the function be continuous. Henceforth differentiability is meant as Fréchet differentiability. 4

The little-oh o(t) means a function such that o(t)/t → 0 as t → 0.

980

Appendix A : Notes on Vectors and Tensors

If U = V is the space of position vectors then the derivatives a and G are said to be the gradients5 of α and g, α(x + h) − α(x) = ∇α(x) · h + o(|h|), g(x + h) − g(x) = [∇g(x)]h + o(|h|). These definitions of gradients imply standard definitions in terms of partial derivatives. To show this, observe that α(x + hei ) − α(x) h→0 h

∂xi α(x) = lim and the like for g. Since

α(x + h) − α(x) − ∇α(x) · h →0 |h| as |h| → 0 then letting h = hei , so that |h| = |h| and multiplying by sgn h we find α(x + hei ) − α(x) − h∇α(x) · ei →0 h as h → 0. Hence α(x + hei ) − α(x) = ∂xi α(x). h→0 h

∇α(x) · ei = lim Likewise we obtain

[∇g(x)]ei = lim

h→0

g(x + hei ) − g(x) = ∂xi g(x). h

Accordingly, the component forms of ∇α and ∇g provide the conventional definitions, [∇g] ji = ∂xi g j . [∇α]i = ∂xi α, Based on the result for the gradient, of scalar and vector functions, we define the divergence and curl of a vector v and the divergence of a tensor T as follows ∇ · v = ∂xi vi ,

(∇ × v)i = i jk ∂x j vk ,

(∇ · T)i = ∂x j Ti j .

Sometimes, e.g. in [216], the gradient is denoted by grad, grad α and grad g, while ∇α and ∇g denote the gradients with respect to the position X in the reference configuration. Here, instead, we use ∇ and ∇ R for the gradients with respect to the spatial position x and the reference position X.

5

Appendix A : Notes on Vectors and Tensors

981

If a vector function, say v, is the gradient of a scalar function, v = ∇α, then6 ∇ · v = ∇ · ∇α =: α,

α =



2 i ∂xi α,

 being called the Laplacian. It often happens that we need to evaluate the curl of the curl. Now, [∇ × (∇ × v)]i = i jk ∂x j (∇ × v)k = i jk kpq ∂x j ∂x p vq  = (δi p δ jq − δiq δ j p )∂x j ∂x p vq = ∂xi ∂x j v j − j ∂x2j vi and hence ∇ × (∇ × v) = ∇(∇ · v) − v. If α is a scalar function of a tensor then α is said to be differentiable, at S, if α(S + H) − α(S) − (S) · H = o( H ). Let H = H ei ⊗ e j . Then α(S + H ei ⊗ e j ) − α(S) − H (S) · ei ⊗ e j = o(H ). Now,  · ei ⊗ e j =  pq e p ⊗ eq · ei ⊗ e j . Since e p ⊗ eq · ei ⊗ e j = tr (eq ⊗ e p )(ei ⊗ e j ) = δi p tr eq ⊗ e j = δi p δ jq . As a consequence,  · ei ⊗ e j = i j and, upon dividing by H , α(S + H ei ⊗ e j ) − α(S) − i j (S) → 0 H as H → 0. Accordingly, we can write  = ∂S α,

[∂S α]i j = ∂ Si j α.

Hence for differentiable functions α of vectors and tensors we have α(S + H) − α(S) = ∂S α · H + o( H ), 6

α(v + h) − α(v) = ∂v α · h + o(|h|).

The notation ∂xn stands for the n-th order derivative with respect to x.

982

Appendix A : Notes on Vectors and Tensors

We now look for the derivative of composite functions. For definiteness, let α(v(t)) the composite function. To determine the derivative with respect to t we observe α(v + h) − α(v) = ∂v α · h + o(|h|),

v(t + τ ) − v(t) = v˙ τ + o(τ ).

Letting h = v(t + τ ) − v(t) we have α(v(t + τ )) − v(t) = ∂v α · v˙ τ + o(τ ). Dividing by τ and taking the limit as τ → 0 we obtain ˙ α(v(t)) = ∂v α · v˙ . The analogue holds for the dependence on scalars and tensors and for vector- and tensor-valued functions. About functions of several variables, for definiteness look at α(u(t), v(t)). Since α(u + h, v + k) − α(u, v) = α(u + h, v + k) − α(u + h, v) + α(u + h, v)− α(u, v) then α(u + h, v + k) − α(u, v) = ∂v α(u + h, v) · k + ∂u α(u, v) · h + o(|k|) + o(|h|) and k = v(t + τ ) − v(t) = v˙ τ + o(τ ),

h = u(t + τ ) − u(t) = u˙ τ + o(τ ).

If also ∂v α(u + h, v) is continuous then we find that ˙ v) = ∂ α · u˙ + ∂ α · v˙ . α(u, u v

A.4

Integration

A line or curve C in space is a continuous mapping such that R  I → E , I being an interval. The mapping may be given by parametric equations, x = xˆ (ξ),

ξ ∈ I,

ξ being a parameter. Let I = [a, b]. The curve is simple if, for any two values ξ1 , ξ2 ∈ [a, b], it happens that xˆ (ξ1 ) = xˆ (ξ2 ). The curve is closed if xˆ (a) = xˆ (b). Assume xˆ (ξ) is differentiable and let xˆ  denote the derivative. The length of the curve C is given by b

L = ∫ |ˆx |(ξ)dξ. a

Appendix A : Notes on Vectors and Tensors

983

The vector x (ξ0 ) is (parallel to the) tangent to the curve at xˆ (ξ0 ). A curve is regular in [a, b] if |ˆx | > 0 in [a, b]. The length s(η) of the arc, as ξ ∈ [a, η], is then given by η

s(η) = ∫ |ˆx |(ξ)dξ. a

It then follows that s is differentiable and s  (η) = |ˆx |(η). If f is a continuous function along C we define the line integral of f on C as b

∫ f (ˆx(ξ)|ˆx (ξ)|dξ.

(A.18)

a

Since s(ξ) is invertible, by a change of variable we have b

sb

a

sa

∫ f (ˆx(ξ)|ˆx (ξ)|dξ = ∫ f˜(s)ds, where f˜(s) = f (ˆx(ξ(s))), sa = s(a), sb = s(b). The parametric representation xˆ (ξ) induces a positive sense on the curve. The symbol C is then meant as the curve in the selected positive sense. As a shorthand, the symbol ∫ f dl C

is used to mean the line integral (A.18). It may happen that the function f is the inner product between a vector field w and the unit tangent vector t = x /|x |, i.e. f = w · t. It follows that b

sb

a

sa

∫(w · t)(ξ)|x |(ξ)dξ = ∫ f˜(s)ds.  In the literature, the symbol C w · dx is also used to denote the integral of w · t along C. This quantity is defined by b

sb

a

sa

∫ w · dx := ∫ w(ˆx(ξ)) · xˆ  (ξ)dξ = ∫(w · t)(s)ds. C

Let D be an open connected set of the plane. A regular surface in E is the set of points x given by a mapping φ : D → E , of class C 1 , such that φ(u, v) is invertible for each (u, v) ∈ D and the rank of the jacobian matrix

984

Appendix A : Notes on Vectors and Tensors

⎤ ⎡ ∂ φ ∂ φ ∂(φ1 , φ2 , φ3 ) ⎣ u 1 v 1 ⎦ = ∂u φ2 ∂v φ2 ∂(u, v) ∂u φ3 ∂v φ3 equals 2. This means that the vectors ∂u φ, ∂v φ are not parallel to each other and hence ∂u φ × ∂v φ = 0. Let S be a regular surface in E determined by a function φ : D → E . If f : S → R then the integral of f over the surface S is defined by the double integral ∫ f da := ∫ f (φ(u, v))|∂u φ × ∂v φ|(u, v)du dv. S

D

Let n(u, v) be the unit normal to S (in the chosen direction). If w : S → V is a continuous vector field on S then the flux of n through s in the direction n is the integral ∫ w · n da. S

Hence, by definition, ∫ w · n da = ∫[w · ∂u φ × ∂v φ](u, v) du dv. S

D

 Given a region  ⊆ E we denote by  f (x)dv the volume (triple) integral of f in . If f is a divergence we have the following result. Theorem A.2 (Gauss, divergence) Let  be a bounded region with boundary ∂. Given a vector field w on  if n is the unit outward normal to ∂ then ∫ w · n da = ∫ ∇ · w dv 

∂

Two interesting consequences follow via appropriate choices of w. Let w = TT c, c being a constant vector field. Then c · ∫ Tn da = ∫ c · Tn da = ∫ (TT c) · n da = ∫ ∇ · (TT c)dv = c · ∫ ∇ · T dv, ∂

∂



∂

where (∇ · T)i := ∂x j Ti j . The arbitrariness of c allows us to conclude that ∫ T n da = ∫ ∇ · T dv.

∂





Appendix A : Notes on Vectors and Tensors

985

As a further consequence, letting T = α1 we find ∫ α n da = ∫ ∇α dv, 

∂

whose statement is often denoted as Green-Ostrogradskij theorem or gradient theorem. Moreover, letting α = 1 it follows ∫ n da = 0,

∂

the integral of the unit normal over a closed surface is zero. By applying the divergence theorem we can derive further identities, ∫ n · ∇ × w da = 0,

∂

∫ n × w da = ∫ ∇ × w dv, 

∂

∫ w ⊗ n da = ∫ ∇w dv.

∂



To show these identities we first observe that ∫ n · ∇ × w da = ∫ ∇ · (∇ × w)dv. 

∂

Since ∇ · (∇ × w) = i jk ∂xi ∂x j wk = 0 then the first identity follows. Now, since (n × w)i = i jk n j wk then ∫ (n × w)i da = ∫ ∂x j i jk wk dv = ∫(∇ × w)i dv

∂





and the second identity follows. Third, let c be a constant vector and consider (n ⊗ w)c = (c · w)n. Since ∇(c · w) = c · (∇w) we obtain ∫ (n ⊗ w)T da c = c ∫ ∇w dv = ∫(∇w)T dv c,

∂





whence the conclusion. The boundary ∂ S of a surface S is an oriented curve C; the positive sense of C = ∂ S is the one for which the region S remains on the left side as seen from the direction of the normal n. Theorem A.3 (Stokes) Let S be a surface in E so that the boundary ∂ S of S is a closed curve C oriented in the positive sense. Given a C 1 vector field on S then

986

Appendix A : Notes on Vectors and Tensors

∫ w · dx = ∫ n · ∇ × w da.

∂S

S

Here too appropriate choices of w lead to two further analogue theorems. Let w = Tc, c being a constant vector. By Stokes’ theorem b

∫ n · ∇ × (Tc)da = ∫ (Tc) · dx = c · ∫(Tx )(ξ)dξ. ∂S

S

a

Moreover, ∫ n · ∇ × (Tc)da = c p ∫ n i i jk ∂x j Tkp da = c · ∫(∇ × TT )n da S

S

S

where (∇ × TT ) pi := (∇ × T)i p := i jk ∂x j Tkp . By the arbitrariness of c, with this notation we can write ∫ T dx = ∫(∇ × TT )n da.

∂S

S

If now T = α1 we have (∇ × TT ) pi = i jk ∂x j αδkp =  pi j ∂x j α,

c p (∇ × TT ) pi n i = c · (n × ∇α).

Hence it follows that ∫ α dx = ∫(n × ∇α) da.

∂S

A.5

S

Harmonic Waves and Complex-Valued Functions

By appealing to Fourier analysis it is often claimed that no generality is lost by considering fields f(x, t) which vary sinusoidally in time as, e.g., f(x, t) = f0 (x) exp(−iωt) or, in the form of plane wave, f(x, t) = f0 exp[i(k · x − ωt)]. It is then asserted that the real part, or rather the real and the imaginary parts, of such expressions is to be taken to obtain physical quantities (e.g. [230], Sect. 7.1; [1], Sect. 1.6.2). The dependence on exp[i(k · x − ωt)], possibly with k ∈ C, is standard and denotes a plane wave propagating in the direction k with phase speed ω/|k|.

Appendix A : Notes on Vectors and Tensors

987

The dependence on x and t via the exponential is remarkably convenient in calculations. Mathematically though we need to check that the solution to the pertinent equations is not affected by the use of complex-valued functions. Let f(x, t) be a complex-valued vector function of x and t. We can then write f(x, t) = u(x, t) + iv(x, t), where u and v are real-valued vector functions. Since ∂x f(x, t) = ∂x u(x, t) + i∂x v(x, t),

∂t f(x, t) = ∂t u(x, t) + i∂t v(x, t),

then it follows that ∂x f(x, t) = ∂x f(x, t),

∂t f(x, t) = ∂t f(x, t).

Linear equations characterized by operators of the form E(∂x , ∂t ) := ai j ∂x j + b ∂t ,

ai j , b ∈ R,

satisfy {ai j ∂x j f + b ∂t f} = ai j ∂x j  f + b ∂t  f and the like for the imaginary part. Accordingly, E(∂x , ∂t )f(x, t) = 0

=⇒

E(∂x , ∂t ) f(x, t) = 0,

E(∂x , ∂t ) f(x, t) = 0.

This allows us to look for real-valued solutions  f(x, t), or  f(x, t), via the check that E(∂x , ∂t )f(x, t) = 0. Consider the equation ∞

P(t) = G0 E(t) + ∫ G(s)E(t − s)ds + A∂t E(t),

(A.19)

0

where G0 , G(s), A are second-order real-valued tensors. If E is taken in the form E(t) = E0 exp(−iωt), E0 being a complex-valued vector, then we expect that P(t) = P0 exp(−iωt),

P0 = χ(ω)E0 ,

χ(ω) being a complex-valued tensor. Let ∞

Gc (ω) := ∫ G(u) cos(ωu)du, 0



Gs (ω) := ∫ G(u) sin(ωu)du. 0

988

Appendix A : Notes on Vectors and Tensors

Upon substitution of E(t) = E0 exp(−iωt) in (A.19) it follows that χ(ω) = G0 + Gc (ω) + iGs (ω) + iωA. Instead, substitution of E(t) = E1 cos(ωt) + E2 sin(ωt) = E0 exp(−iωt),

E0 = E1 + iE2 ,

in (A.19) results in P(t) = (G0 + Gc (ω))[E1 cos(ωt) + E2 sin(ωt)] + (Gs (ω) + A)[E1 sin(ωt) − E2 cos(ωt)].

The same result follows by computing [χ(ω)(E1 + iE2 ) exp(−iωt)]. Accordingly, with linear operators of the form (A.19) the real value of P(t) coincides with the real part of χ(ω)(E1 + iE2 ) exp(−iωt). This does not mean that P = 0. Indeed, P(t) = (G0 + Gc (ω))(E2 cos ωt − E1 sin ωt) + (Gs (ω) + A)(E1 cos(ωt) + E2 sin(ωt)).

The recourse to complex-valued exponentials is customary in the investigation of plane wave propagation. An inhomogeneous plane wave is a function of the form w(x, t) = w0 exp[−k2 · x + i(k1 · x − ωt)]

(A.20)

where w0 is a complex vector. Let w0 = w1 e1 + iw2 e2 , e1 and e2 being perpendicular to k1 . We have w = exp(−k2 · x)[(w1 e1 cos(k1 · x − ωt) − w2 e2 sin(k1 · x − ωt)], w = exp(−k2 · x)[(w1 e1 sin(k1 · x − ωt) + w2 e2 cos(k1 · x − ωt)]. At an inhomogeneous plane wave the surfaces of constant amplitude are the planes perpendicular to k1 , the surfaces of constant phase are the planes perpendicular to k2 . At a fixed point in space we have w ∝ w1 e1 cos(ωt − ξ) + w2 e2 sin(ωt − ξ),

ξ = k1 · x.

If w1 w2 > 0 then, as time increases, the vector w rotates in time by sweeping around an ellipse, at the angular frequency ω, in the counter-clockwise (or righthanded) sense. Indeed, |w1 | and |w2 | are the semi-major and semi-minor axes of the ellipse. If instead w1 w2 < 0 then w sweeps around the same ellipse in the clockwise

Appendix A : Notes on Vectors and Tensors

989

(or left-handed) sense. The same conclusions follow if we look at w. Apparently if |w1 | = |w2 | then the ellipse becomes a circle. In that case the wave is said to be left (or right) circularly polarized7 if w1 w2 > 0 (or w1 w2 < 0). If instead w0 is real, say w0 = w1 e1 + w2 e2 , then w ∝ (w1 e1 + w2 e2 ) cos(ωt − ξ),

w ∝ (w1 e1 + w2 e2 ) sin(ωt − ξ).

The vectors w and w oscillate along the line making the angle α such that tan α = w2 /w1 . Then we say that the wave is linearly polarized. Consequently we can look for wave solutions in the complex form (A.20). Moreover, we conclude that the wave is linearly polarized if w0 is real and is elliptically polarized if w0 = w1 e1 + iw2 e2 ; in the counter-clockwise sense if w1 w2 > 0, in the clockwise sense if w1 w2 < 0. It is worth examining the geometric aspects of solutions of the form w = a1 e1 cos ωt + a2 e2 cos(ωt + δ), δ being the phase difference between the two oscillations. By a direct check we find that

w2 2

w1 2 w1 w2 −2 cos δ + = sin2 δ. a1 a1 a2 a2 By means of the variables X, Y related to w1 , w2 by w1 = X cos χ − Y sin χ,

w2 = X sin χ + Y cos χ

we obtain ([237], Sect. 6.4)

X/a)2 + Y/b)2 = 1,

where a 2 = 2(a1 a2 sin δ)2 /(a12 + a22 − α), b2 = 2(a1 a2 sin δ)2 /(a12 + a22 + α), α = (a14 + a24 + 2a12 a22 cos 2δ)1/2 . Except when δ = 0, π the composition of the two oscillations sweeps around an ellipse whose semi-major axis is rotated in the (e1 , e2 ) plane of the angle χ, tan 2χ = (2a1 a2 cos δ)/(a12 − a22 ). As δ = 0, π the vector w(t) describes a segment rotated of the angle χ = ± arctan a2 /a1 . If the argument of the two functions w1 , w2 is no longer the same then the advantage of the complex exponential is lost.

7

In the terminology of modern physics, left (right) circularly polarized waves are said to have positive (negative) helicity.

990

A.6

Appendix A : Notes on Vectors and Tensors

Fourier Transform

In dealing with materials with memory it is often convenient to make use of the Fourier transform. If f ∈ L 1 (R) the Fourier transform f F is defined as ∞

f F (ω) = ∫ exp(−iωt) f (t)dt. −∞

By definition, ∞



−∞

−∞

f F (ω) = ∫ cos(ωt) f (t)dt − i ∫ sin(ωt) f (t)dt. The real and imaginary parts are usually called full-range cosine and sine transforms of f . In applications we more frequently deal with ∞



f c (ω) := ∫ cos(ωt) f (t)dt,

f s (ω) := ∫ sin(ωt) f (t)dt,

0

0

called half-range cosine and sine transforms of f . The Fourier transform can be inverted in that, given f F we determine f . Precisely, if f ∈ L 1 (R) is piecewise continuous and has finite one-sided derivatives at possible discontinuities then ∞ 1 ∫ f F (ω) exp(iωt)dω. f (t) = 2π −∞

The following result8 is advantageous in passing from the time domain to the frequency domain. Theorem A.4 (Parseval–Plancherel) If f, g ∈ L 2 (R) then ∞

∫ | f (t)|2 (t)dt =

−∞



∫ f (t)g(t)dt =

−∞

8

See, e.g. [21].

1 2π

1 2π



∫ | f F (ω)|2 (ω)dω,

−∞ ∞

∫ f F (ω)g F (ω)dω.

−∞

References

1. 2. 3. 4. 5. 6. 7.

8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19.

J.D. Achenbach, Wave Propagation in Elastic Solids (North-Holland, Amsterdam, 1975) E.C. Aifantis, The physics of plastic deformation. Int. J. Plast. 3, 211–247 (1987) H. Alfvén, Existence of electromagnetic-hydrodynamic waves. Nature 150, 405–406 (1942) M.B. Allen, Numerical modelling of multiphase porous media. Adv. Water Resour. 8, 162–187 (1985) J.F. Allen, A.D. Misener, Flow phenomena in liquid helium II. Nature 142, 643–644 (1938) S.M. Allen, J.W. Cahn, A macroscopic theory for antiphase boundary motion and its application to antiphase domain coarsening. Acta Metall. 27, 1085–1095 (1979) A.I. Alsabery, M.A. Ismael, A.J. Chamkha, I. Hashim, Mixed convection of Al2 O3 -water nanofluid in a double lid-driven square cavity with a solid inner insert using Buongiorno’s two-phase model. Int. J. Heat Mass Transf. 119, 939–961 (2018) G. Amendola, M. Fabrizio, J.M. Golden, A linear memory model, in Thermodynamics of Materials with Memory (Springer, Boston, MA, 2012) H.W. Alt, I. Pawlow, A mathematical model of dynamics of non-isothermal phase separation. Physica D 59, 389–416 (1992) P.M. Anderson, Considerations on the flow of superfluid helium. Rev. Modern Phys. 38, 298–310 (1966) D.M. Anderson, G.B. McFadden, A.A. Wheeler, Diffuse-interface methods in fluid mechanics. Ann. Rev. Fluid Mech. 30, 139–165 (1998) E.C. Andrade, Viscosity of liquids. Nature 125, 309–310 (1930) D. Andrienko, Introduction to liquid crystals. J. Mol. Liq. 267, 520–541 (2018) M. Arcangioli, A. Farina, L. Fusi, G. Saccomandi, Constitutive equations with pressuredependent rheological parameters for describing ice creep. J. Glaciol. 65, 557–564 (2019) R. Aris, Vectors, Tensors, and the Basic Equations of Fluid Mechanics (Prentice Hall, Englewood Cliffs, NJ, 1962) M.F. Ashby, D.R.H. Jones, Engineering Materials 2 (Elsevier, Amsterdam, 2006). ((ch. 3)) A. Askar, P. Lee, A. Cakmak, Lattice-dynamics approach to the theory of elastic dielectrics with polarization gradient. Phys. Rev. B 1, 3525–3537 (1970) G. Astarita, G. Marrucci, Principles of Non-Newtonian Fluid Mechanics (McGraw Hill, New York, 1974) T. Atanackovic, I. Müller, A new form of the coherency energy in pseudoelasticity. Meccanica 30, 467–474 (1995)

© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0

991

992

References

20. M. Avalle, G. Belingardi, R. Montanini, Characterization of polymeric structural foams under compressive impact loading by means of energy absorption diagram. Int. J. Impact Eng. 25, 455–472 (2001) 21. L. Baggett, W. Fulks, Fourier Analysis (Anjou Press, Boulder, 1979) 22. S. Bair, The temperature and pressure dependence of viscosity and volume for two reference liquids. Lubr. Sci. 28, 81–95 (2016) 23. F. Bampi, A. Morro, Gravity waves in water of variable depth. Nuovo Cimento 1C, 377–388 (1978) 24. F. Bampi, A. Morro, Korteweg-de Vries equation and nonlinear waves. Lett. Nuovo Cimento 26, 61–63 (1979) 25. F. Bampi, A. Morro, Water wave theories and variational principles. Nuovo Cimento 2C, 352–366 (1979) 26. F. Bampi, A. Morro, Objectivity and objective time derivatives in continuum physics. Found. Phys. 10, 905–920 (1980) 27. F. Bampi, A. Morro, The inverse problem of the calculus of variations applied to continuum physics. J. Math. Phys. 23, 2312–2321 (1982) 28. J. Bardeen, L. Cooper, J.R. Schreiffer, Theory of superconductivity. Phys. Rev. 108, 1175– 1189 (1957) 29. G.I. Barenblatt, I.P. Zheltov, I.N. Kochina, Basic concepts in the theory of seepage of homogeneous liquids in fissured rocks. J. Appl. Math. Mech. 24, 1286–1303 (1960) 30. H.A. Barnes, The yield stress - a review or ‘παντ α ρει- everything flows. J. Non-Newtonian Fluid Mech. 81, 133–178 (1999) 31. R. Becker, G. Heller, F. Sauter, Uber die Stromverteilung in einer supraleitenden Kugel. Zeit. Phys. 85, 772–787 (1933) 32. J.G. Bednorz, K.A. Müller, Possible high Tc superconductivity in the Ba-La-CuO system. Z. Phys. B 64, 189–193 (1986) 33. N. Bellomo, M. Lo Schiavo, Lecture Notes on Mathematical Theory of Generalized Boltzmann Models (World Scientific, Singapore, 2000) 34. I.L. Bekarevich, I.M. Khalatnikov, Phenomenological derivation of the equation of vortex motion in He II. Sov. Phys. JETP 13, 643–646 (1961) 35. E. Beltrami, Sull’interpretazione meccanica delle formole di Maxwell. Memorie Accad. Sci. Bologna 7, 1–38 (1886) 36. B. Bernstein, E.A. Kearsley, L.J. Zapas, A study of stress relaxations with finite strain. Trans. Soc. Rheol. 7, 391–410 (1963) 37. A. Berti, C. Giorgi, E. Vuk, Free energies in one-dimensional models of magnetic transitions with hysteresis. Nuovo Cimento B 125, 371–394 (2010) 38. A. Berti, C. Giorgi, E. Vuk, Hysteresis and temperature-induced transitions in ferromagnetic materials. Appl. Math. Model. 39, 820–837 (2015) 39. A. Berti, C. Giorgi, E. Vuk, Free energies and pseudo-elastic transitions for shape-memory alloys. Discrete Cont. Dyn. Syst. Sr. S 6, 293–316 (2013) 40. A. Berti, C. Giorgi, Derivation of the Landau-Lifshitz-Bloch equation from continuum thermodynamics. Physica B 500, 142–153 (2016) 41. V. Berti, M. Fabrizio, D. Grandi, Hysteresis and phase transitions for one-dimensional and three-dimensional models in shape memory alloys. J. Math. Phys. 51, 062901 (2010) 42. A. Bertram, Elasticity and Plasticity of Large Deformations: An Introduction (Springer, Berlin, 2008) 43. Z. Bi, R.F. Sekerka, Phase-field model of solidification of a binary alloy. Physica A 261, 95–106 (1998) 44. E.C. Bingham, An investigation of the laws of plastic flow. Bulle. Bur. Stand. 13(2), 309–353 (1916) 45. M.A. Biot, Mechanics of Incremental Deformation: Theory of Elasticity and Viscoelasticity of Initially Stressed Solids and Fluids, Including Thermodynamic Foundations and Applications to Finite Strain (Wiley, New York, 1965)

References

993

46. R.B. Bird, J.M. Wiest, Constitutive equations for polymeric liquids. Annu. Rev. Fluid Mech. 27, 169–193 (1995) 47. P. Biscari, G. Napoli, S. Turzi, Bulk and surface biaxiality in nematic liquid crystals. Phys. Rev. E 74, 031708 (2006) 48. A.C. Biswas, R.K. Pathria, Critical velocity in superfluid helium. J. Low Temp. Phys. 28, 151–156 (1977) 49. D.R. Bland, The Theory of Linear Viscoelasticity (Pergamon Press, Oxford, 1960) 50. F. Bloch, Nuclear induction. Phys. Rev. 70, 460–474 (1946) 51. D.B. Bogy, P.M. Naghdi, On heat conduction and wave propagation in rigid solids. J. Math. Phys. 11, 917–923 (1970) 52. A.A. Bokov, Z.-G. Ye, Recent progress in relaxor ferroelectrics with perovskite structure. J. Mater. Sci. 41, 31–52 (2006) 53. P.C. Bollada, P.K. Jimack, A.M. Mullis, Multiphase field modelling of alloy solidification. Comp. Mat. Sci. 171, 109085 (2020) 54. L. Boltzmann, Zur theorie der elastichen nachwirkung. Sitzgsber. Akad. Wiss. Wien 70, 275– 300 (1874) 55. L. Boltzmann, Zur Theorie der elastischen Nachwirkung. Wied. Ann. 5, 430–432 (1878) 56. A. Bonfanti, J.L. Kaplan, G. Charras, A. Kabla, Fractional viscoelastic models for power-law materials. Soft Matter 16, 6002–6020 (2020) 57. R. Borghesani, A. Morro, Thermodynamics and isotropy in thermal and electric conduction. Meccanica 4, 63–69 (1974) 58. R. Borghesani, A. Morro, Thermodynamic restrictions on thermoelectric, thermomagnetic and galvanomagnetic coefficients. Meccanica 4, 157–161 (1974) 59. J. Boussinesq, Théorie Analitique de la Chaleur (Gauthier-Villars, Paris, 1903) 60. R.M. Bowen, Theory of mixtures, in Continuum Physics III, ed. by A.C. Eringen (Academic Press, New York, 1976) 61. R.M. Bowen, Compressible porous media models by use of the theory of mixtures. Int. J. Eng. Sci. 20, 697–735 (1982) 62. R.M. Bowen, P.J. Chen, On the effects of diffusion and chemical reactions on the growth and decay of acceleration waves. Arch. Rational Mech. Anal. 48, 319–351 (1972) 63. R.M. Bowen, J.C. Wiese, Diffusion in mixtures of elastic materials. Int. J. Eng. Sci. 7, 689–722 (1969) 64. A.F. Bower, Applied Mechanics of Solids (CRC Press, Boca Raton, 2009) 65. P.W. Bridgman, The Nature of Thermodynamics (Harvard University Press, Cambridge, MA, 1961) 66. H.C. Brinkman, The viscosity of concentrated suspensions and solutions. J. Chem. Phys. 20, 571–581 (1952) 67. M. Brokate, J. Sprekels, Hysteresis and Phase Transitions (Springer, Berlin, 1996) 68. J.P. Broomfield, Corrosion of Steel in Concrete; Understanding, Investigation, and Repair (E. & F.N. Spon, London, 1997) 69. W.F. Brown Jr., Magnetoelastic Interactions, Springer Tracts in Natural Philosophy, vol. 9 (Springer, New York, 1966) 70. L. Brun, Ondes de choc finies dans les solides elastiques, in Mechanical Waves in Solids, ed. by J. Mandel, L. Brun (Springer, CISM-Udine, 1975) 71. J. Buongiorno, Convective transport in nanofluids. Trans. ASME 128, 240–250 (2006) 72. J.M. Burgers, Mechanical considerations - Model systems - Phenomenological theories of relaxation and viscosity, in First Report on Viscosity and Plasticity, ed. by J.M. Burgers (Nordemann Publishing, New York, 1939), ch. 1 73. J.W. Cahn, J.E. Hilliard, Free energy of a nonuniform system I. Interfacial energy. J. Chem. Phys. 28, 258–267 (1958) 74. J.C. Cahn, On spinodal decomposition. Acta Metall. 9, 795–801 (1961) 75. J.C. Cahn, J.E. Hilliard, Spinodal decomposition: a reprise. Acta Metall. 19, 151–161 (1971) 76. W. Cao, Constructing Landau-Ginzburg-Devonshire type models for ferroelectric systems based on symmetry. Ferroelectrics 375, 28–39 (2008)

994

References

77. W. Cao, L.E. Cross, Theory of tetragonal twin structures in ferroelectric perovskites with a first-order phase transition. Phys. Rev. B 44, 5–12 (1991) 78. M. Caputo, F. Mainardi, Linear models of dissipation in anelastic solids. La Rivista del Nuovo Cimento 1, 161–198 (1971) 79. S. Carillo, C. Giorgi, Non-classical memory kernels in linear viscoelasticity, in Viscoelastic and Viscoplastic Materials. ed. by F. Mohamed (Croatia, El-Amin (INTECH, 2016) 80. D.E. Carlson, Linear Thermoelasticity, in Encyclopedia of Physics, ed. by C. Truesdell, vol. VIa/2 (Springer, Berlin, 1972) 81. M.C. Calderer, Mathematical problems in liquid crystal flows. Math. Meth. Appl. Sci. 17, 171–188 (1994) 82. J.-M. Cardona, S. Forest, R. Sievert, Towards a theory of second grade thermoelasticity. Extracta Math. 14, 127–140 (1999) 83. C. Cattaneo, Sulla conduzione del calore. Atti Sem. Mat. Fis. Univ. Modena 3, 3–21 (1948) 84. G. Caviglia, A. Morro, Inhomogenous Waves in Solids and Fluids (World Scientific, Singapore, 1992) 85. G. Caviglia, A. Morro, B. Straughan, Thermoelasticity at cryogenic temperatures. Int. J. Non-Lin. Mech. 27, 251–263 (1992) 86. S.-K. Chan, Steady-state kinetics of diffusionless first-order phase transformations. J. Chem. Phys. 67, 5755–5762 (1977) 87. S. Chapman, T.G. Cowling, The Mathematical Theory of Non-Uniform Gases (Cambridge University Press, 1970), p. 363 88. P.J. Chen, Growth and decay of waves in solids, in Encyclopedia of Physics, vol. VIa/3, ed. by C. Truesdell (Springer, Berlin, 1973) 89. J. Ciambella, A. Paolone, S. Vidoli, Memory decay rates of viscoelastic solids: not too slow, but not too fast either. Rheol. Acta 50, 661–674 (2011) 90. M. Ciarletta, B. Straughan, V. Tibullo, Explosive instabilities for a generalized second grade fluid. J. Math. Anal. Appl. 432, 945–953 (2015) 91. H. Cohen, C.-C. Wang, On the response and symmetry of elastic materials with internal constraints. Arch. Rational Mech. Anal. 99, 1–36 (1987) 92. B.D. Coleman, W. Noll, The thermodynamics of elastic materials with heat conduction and viscosity. Arch. Rational Mech. Anal. 13, 167–178 (1963) 93. B.D. Coleman, Thermodynamics of materials with memory. Arch. Rational Mech. Anal. 17, 1–46 (1964) 94. B.D. Coleman, V.J. Mizel, Existence of caloric equations of state in thermodynamics. J. Chem. Phys. 40, 1116–1125 (1964) 95. B.D. Coleman, On thermodynamics, strain impulses, and viscoelasticity. Arch. Rational Mech. Anal. 17, 230–254 (1964) 96. B.D. Coleman, E.H. Dill, On the thermodynamics of electromagnetic fields in materials with memory. Arch. Rational Mech. Anal. 41, 132–162 (1971) 97. B.D. Coleman, E.H. Dill, Thermodynamics restrictions on the constitutive equations of electromagnetic theory. ZAMP 22, 691–702 (1971) 98. B.D. Coleman, M.E. Gurtin, Thermodynamics with internal state variables. J. Chem. Phys. 47, 597–613 (1967) 99. B.D. Coleman, V.J. Mizel, Norms and semi-groups in the theory of fading memory. Arch. Rational Mech. Anal. 23, 87–123 (1967) 100. B.D. Coleman, V.J. Mizel, On the general theory of fading memory. Arch. Rational Mech. Anal. 29, 18–31 (1968) 101. B.D. Coleman, W. Noll, Foundations of linear viscoelasticity. Rev. Mod. Phys. 33, 239–249 (1961) 102. B.D. Coleman, M.L. Hodgdon, On a class of constitutive relations for ferromagnetic hysteresis. Arch. Rational Mech. Anal. 99, 375–396 (1987) 103. B.D. Coleman, M.L. Hodgdon, A constitutive relation for rate-independent hysteresis in ferromagnetically soft materials. Int. J. Eng. Sci. 24, 897–919 (1986)

References

995

104. M. Conti, V. Danese, C. Giorgi, V. Pata, A model of viscoelasticity with time-dependent memory kernels. Am. J. Math. 140, 349–389 (2018) 105. M. Corcione, M. Cianfrini, A. Quintino, Two-phase mixture modeling of natural convection of nanofluids with temperature-dependent properties. Int. J. Therm. Sci. 71, 182–195 (2013) 106. E. Cosserat, F. Cosserat, Théorie des Corps Déformables (Hermann, Paris, 1909) 107. B.A. Cotter, R.S. Rivlin, Tensors associated with time-dependent stress. Quart. Appl. Math. 13, 177–182 (1955) 108. S.C. Cowin, Bone poroelasticity. J. Biomech. 32, 217–238 (1999) 109. N.F. Cramer, The Physics of Alfvén Waves (Wiley, New York, 2001) 110. F.H. Crawford, Heat, Thermodynamics, and Statistical Physics (Rupert Hart-Davis, London, 1963) 111. C.M. Dafermos, Asymptotic stability in viscoelasticity. Arch. Rational Mech. Anal. 37, 297– 308 (1970) 112. J.S. Dahler, L.E. Scriven, Theory of structured continua I. General consideration of angular momentum and polarization. Proc. R. Soc. Lond. A 275, 504–527 (1963) 113. W.A. Day, Restrictions on relaxation functions in linear viscoelasticity. Quart. J. Mech. Appl. Math. 24, 487–497 (1971) 114. W.A. Day, The Thermodynamics of Simple Materials with Fading Memory (Springer, Berlin, 1972) 115. P.G. de Gennes, An analogy between superconductivity and smectics A. Solid State Commun. 10, 753–756 (1972) 116. P.G. de Gennes, J. Prost, The Physics of Liquid Crystals (Clarendon Press, Oxford, 1998) 117. S.R. de Groot, P. Mazur, Non-Equilibrium Thermodynamics (Dover, New York, 1984) 118. A. De Simone, P. Podio Guidugli, On the continuum theory of deformable ferromagnetic solids. Arch. Rational Mech. Anal. 136, 201–233 (1996) 119. E.A. de Souza Neto, D. Peric, D.R.J. Owen, Computational Methods for Plasticity: Theory and Applications (Wiley, New York, 2008); Appendix A 120. P. Debye, Einige Bemerkungen zur Magnetisierung bei tiefer Temperatur. Annalen der Physik 81, 1154 (1926) 121. J.A. Dean, N.A. Lange, Lange’s Handbook of Chemistry, 15th edn. (McGraw-Hill, 1999), p. 1436 122. G. Del Piero, L. Deseri, On the analytic expression of the free energy in linear viscoelasticity. J. Elast. 43, 247–278 (1996) 123. G. Del Piero, L. Deseri, On the concepts of state and free energy in linear viscoelasticity. Arch. Rational Mech. Anal. 138, 1–35 (1997) 124. F. Dell’Oro, V. Pata, On the Moore-Gibson-Thompson equation and its relation to linear viscoelasticity. Appl. Math. Optim. 76, 641–655 (2017) 125. K.G. Denbigh, The Principles of Chemical Equilibrium (Cambridge University Press, Cambridge, 1981) 126. Q. Deng, M. Kammoun, A. Erturk, P. Sharma, Nanoscale flexoelectric energy harvesting. Int. J. Solids Struct. 51, 3218–3225 (2014) 127. W. Desch, R. Grimmer, Singular relaxation moduli and smoothing in three-dimensional viscoelasticity. Trans. Am. Math. Soc. 314, 381–404 (1989) 128. L. Deseri, M. Fabrizio, J.M. Golden, The concept of a minimal state in viscoelasticity: new free energies and applications to PDEs. Arch. Rational Mech. Anal. 181, 43–96 (2006) 129. L. Deseri, G. Gentili, J.M. Golden, An explicit formula for the minimum free energy in linear viscoelasticity. J. Elast. 54, 141–185 (1999) 130. L. Deseri, M. Zingales, A mechanical picture of fractional-order Darcy equation. Commun. Nonlinear Sci. Numer. Simulat. 20, 940–949 (2015) 131. M. Destrade, G. Saccomandi, M. Vianello, Proper formulation of viscous dissipation for nonlinear solids. J. Acoust. Soc. Am. 133, 1255–1259 (2013) 132. A.F. Devonshire, Theory of barium titanate. Phil. Mag. 40, 1040–1063 (1949) 133. A.F. Devonshire, Theory of ferroelectrics. Adv. Phys. 3, 85–130 (1954)

996

References

134. R.J. Donnelly, The two-fluid theory and second sound in liquid helium. Phys. Today 62(10), 34–39 (2009) 135. A. Dorfmann, R.W. Ogden, Nonlinear magnetoelastic deformations. Q. J. Mech. Appl. Math. 57, 599–622 (2004) 136. A. Dorfmann, R.W. Ogden, Nonlinear electroelasticity. Acta Mech. 174, 167–183 (2005) 137. L. Dorfmann, R.W. Ogden, Nonlinear electroelasticity: material properties, continuum theory and applications. Proc. Roy. Soc. A 473, 1–18 (2017) 138. D.S. Drumheller, A. Bedford, A thermodynamic theory for reacting immiscible mixtures. Arch. Rational Mech. Anal. 73, 257–284 (1980) 139. J.E. Dunn, R.L. Fosdick, Thermodynamics, stability, and boundedness of fluids of complexity 2 and fluids of second grade. Arch. Rational Mech. Anal. 56, 191–252 (1974) 140. J.E. Dunn, K.R. Rajagopal, Fluids of differential type: critical review and thermodynamic analysis. Int. J. Eng. Sci. 33, 689–729 (1995) 141. J.E. Dunn, J. Serrin, On the thermodynamics of interstitial working. Arch. Rational Mech. Anal. 88, 96–133 (1985) 142. A. Einstein, W.J. de Haas, Experimental proof of the existence of Ampère’s molecular currents. Koninklijke Akademie van Wetenschappen te Amsterdam, Proceedings 18, 696–711 (1915) 143. A. Engel, R. Friedrichs, On the electromagnetic force on a polarizable body. Am. J. Phys. 70, 428–435 (2002) 144. J.L. Ericksen, Hydrostatic theory of liquid crystals. Arch. Rational Mech. Anal. 9, 371–378 (1962) 145. J.L. Ericksen, Liquid crystals with a variable degree of orientation. Arch. Rational Mech. Anal. 113, 97–120 (1991) 146. A.C. Eringen, Mechanics of Continua (Wiley, New York, 1967) 147. A.C. Eringen, A unified continuum theory of electrodynamics of liquid crystals. Int. J. Eng. Sci. 35, 1137–1157 (1997) 148. A. Eringen, Microcontinuum Field Theories: I (Foundations and Solids (Springer, New York, 1999) 149. A.C. Eringen, Micropolar mixture theory of porous media. J. Appl. Phys. 94, 4184–4190 (2003) 150. A.C. Eringen, C.B. Kafadar, Polar field theories, in Continuum Physics IV, ed. by A.C. Eringen (Academic, New York, 1976) 151. A.C. Eringen, G.A. Maugin, Electrodynamics of Continua I (Foundations and Solid Media (Springer, Berlin, 1990) 152. J.D. Eshelby, The determination of the elastic field of an ellipsoidal inclusion, and related problems. Phil. Trans. R. Soc. Lond. A 241, 376–396 (1957) 153. J.D. Eshelby, The elastic energy-momentum tensor. J. Elast. 5, 321–335 (1975) 154. M. Fabrizio, C. Giorgi, A. Morro, A thermodynamic approach to ferromagnetism and phase transitions. Int. J. Eng. Sci. 47, 821–839 (2009) 155. M. Fabrizio, C. Giorgi, A. Morro, Modeling of heat conduction via fractional derivatives. Heat Mass Transf. 53, 2785–2797 (2017) 156. M. Fabrizio, A. Morro, Viscoelastic relaxation functions compatible with thermodynamics. J. Elast. 19, 63–75 (1988) 157. M. Fabrizio, A. Morro, Mathematical Problems in Linear Viscoelasticity (SIAM, Philadelphia, 1992) 158. M. Fabrizio, C. Giorgi, A. Morro, Internal dissipation, relaxation property, and free energy in materials with fading memory. J. Elast. 40, 107–122 (1995) 159. M. Fabrizio, C. Giorgi, A. Morro, Solidification and separation in saline water. Discrete Contin. Dyn. Syst. Ser. S 9, 139–155 (2016) 160. M. Fabrizio, C. Giorgi, V. Pata, A new approach to equations with memory. Arch. Rational Mech. Anal. 198, 189–232 (2010) 161. M. Fabrizio, J.M. Golden, Maximum and minimum free energies for a linear viscoelastic material. Q. Appl. Math. 60, 341–381 (2002)

References

997

162. M. Fabrizio, A. Morro, Electromagnetism of Continuous Media (Oxford Science Publications, Oxford, 2003) 163. M. Fabrizio, A. Morro, Thermodynamics and second sound in a two-fluid model of helium II; revisited. J. Non-Equilib. Thermodyn. 208, 69–84 (2003) 164. R.M. Fano, L.J. Chu, R.B. Adler, Electromagnetic Fields, Energy, and Forces (Wiley, New york, 1960) 165. S.H. Faria, G.M. Kremer, A constitutive theory for ferrofluids. Cont. Mech. Thermodyn. 11, 297–306 (1999) 166. A. Favre, Statistical equations of turbulent gases, in Problems of Hydrodynamics and Continuum Mechanics (SIAM, Philadelphia, 1969), pp.231–266 167. R.P. Feynman, R.B. Leighton, M. Sands, The Feynman Lectures on Physics (Addison, Reading Massachusetts, 1966) 168. E. Fermi, Thermodynamics (Dover, New York, 1956) 169. A. Fick, Über diffusion. Annalen der Physik 170, 59–86 (1855) 170. R.L. Fosdick, K.R. Rajagopal, Thermodynamics and stability of fluids of third grade. Proc. R. Soc. Lond. A 339, 351–377 (1980) 171. F. Franchi, B. Lazzari, R. Nibbi, B. Straughan, Uniqueness and decay in local thermal nonequilibrium double porosity thermoelasticity. Math. Meth. Appl. Sci. 41, 6763–6771 (2018) 172. V. Francois-Lavet, F. Henrotte, L. Stainier, L. Noels, An energy-based variational model of ferromagnetic hysteresis for finite element computations. J. Comput. Appl. Math. 246, 243– 250 (2013) 173. F.C. Frank, On the theory of liquid crystals. Discuss. Faraday Soc. 25, 19–28 (1958) 174. M. Frémond, Adhérence des solides. J. Méc. Théor. Appl. 6, 383–407 (1987) 175. M. Frémond, B. Nedjar, Damage, gradient of damage and principle of virtual power. Int. J. Solids Struct. 33, 1083–1103 (1996) 176. M. Frémond, Phase Change in Mechanics (Springer, Berlin, 2012); ch. 4 177. E. Fried, M.E. Gurtin, Continuum theory of thermally induced phase transitions based on an order parameter. Physica D 68, 326–343 (1993) 178. E. Fried, M.E. Gurtin, Dynamic solid-solid phase transitions with phase characterized by an order parameter. Physica D 72, 287–308 (1994) 179. E. Fried, M.E. Gurtin, Tractions, balances, and boundary conditions for nonsimple materials with application to liquid flow at small-length scales. Arch. Rational Mech. Anal. 182, 513– 554 (2006) 180. Y.C. Fung, Biodynamics (Circulation (Springer, New York, 1984) 181. R.E. Gaidos, R. Darby, Numerical simulation and change in type in the developing flow of a nonlinear elastic fluid. J. Non-Newton. Fluid Mech. 29, 59–79 (1988) 182. F.R. Gantmacher, The Theory of Matrices, vol. 1 (Chelsea, New York, 1959) 183. F.R. Gantmacher, The Theory of Matrices, vol. 2 (Chelsea, New York, 1974) 184. D.A. Garanin, Fokker-Planck and Landau-Lifshitz-Bloch equations for classical ferromagnets. Phys. Rev. B 55, 3050–3057 (1997) 185. S. Gatti, C. Giorgi, V. Pata, Navier-Stokes limit of Jeffreys type flows. Physica D 203, 55–79 (2005) 186. G. Gentili, Regularity and stability for a viscoelastic material with a singular memory kernel. J. Elast. 37, 139–156 (1995) 187. G. Gentili, C. Giorgi, A new model for rate-independent hysteresis in permanent magnets. Int. J. Eng. Sci. 39, 1057–1090 (2001) 188. P. Germain, Mécanique des Milieux Continus (Masson, Paris, 1962) 189. W.F. Giauque, A thermodynamic treatment of certain magnetic effects. A proposed method of producing temperatures considerably below 1o absolute. J. Am. Chem. Soc. 49, 1864–1870 (1927) 190. T.L. Gilbert, A Lagrangian formulation of the gyromagnetic equation of the magnetization field. Phys. Rev. B 100, 1243 (1955) [Abstract only; full report, Armor Research Foundation Project No. A059, Supplementary Report, May 1, 1956] (unpublished)

998

References

191. T.L. Gilbert, A phenomenological theory of damping in ferromagnetic materials. IEEE Trans. Magn. 40, 3443–3449 (2004) 192. V.L. Ginzburg, Superconductivity: the day before yesterday, yesterday, today, and tomorrow. J. Supercond. 13, 665–677 (2000) 193. V.L. Ginzburg, L.D. Landau, On the theory of superconductivity, Zh. Eksp. Teor. Fiz. 20, 1064–1082 (1950); English translation in Collected papers of L.D. Landau, ed. D. ter Haar (Pergamon, Oxford, 1965), pp. 546–568 194. C. Giorgi, Phase-field models for transition phenomena in materials with hysteresis. Discrete Cont. Dyn. Syst. Ser. S 8, 693–722 (2015) 195. C. Giorgi, A. Morro, Extremum principles for viscoelastic fluids. Int. J. Eng. Sci. 29, 807–817 (1991) 196. C. Giorgi, A. Morro, Viscoelastic solids with unbounded relaxation function. Contin. Mech. Thermodyn. 4, 151–165 (1992) 197. C. Giorgi, J.E. Muñoz Rivera, V. Pata, Global attractors for a semilinear hyperbolic equation in viscoelasticity. J. Math. Anal. Appl. 260, 83–99 (2001) 198. J.M. Golden, Free energies in the frequency domain: the scalar case. Q. Appl. Math. 58, 127–150 (2000) 199. C.J. Gorter, H. Casimir, On supraconductivity I. Physica 1, 306–320 (1934) 200. P.-A. Gourdain, The impact of the Hall term on tokamak plasmas, arXiv:1703.00987 physics.plasm.-ph 201. D. Graffi, Sui problemi dell’ereditarietà lineare. Nuovo Cimento A 5, 53–71 (1928) 202. D. Graffi, Sull’espressione dell’energia libera nei materiali viscoelastici lineari. Ann. Mat. Pura Appl. 98, 273–279 (1974) 203. D. Graffi, Sull’espressione analitica di alcune grandezze termodinamiche nei materiali con memoria. Rend. Sem. Mat. Univ. Padova 68, 17–29 (1982) 204. D. Graffi, On the fading memory. Appl. Anal. 15, 295–311 (1983) 205. A.E. Green, P.M. Naghdi, A dynamical theory of interacting continua. Int. J. Eng. Sci. 3, 231–263 (1965) 206. A.E. Green, P.M. Naghdi, A general theory of an elastic-plastic continuum. Arch. Rational Mech. Anal. 18, 251–281 (1965) 207. A.E. Green, P.M. Naghdi, A derivation of equations for wave propagation in water of variable depth. J. Fluid Mech. 78, 237–246 (1976) 208. A.E. Green, P.M. Naghdi, On thermodynamics and the nature of the second law. Proc. R. Soc. Lond. A 357, 253–270 (1977) 209. A.E. Green, P.M. Naghdi, A re-examination of the basic postulates of thermomechanics. Proc. R. Soc. Lond. A 432, 171–194 (1991) 210. A.E. Green, P.M. Naghdi, Thermoelasticity without energy dissipation. J. Elast. 31, 189–208 (1993) 211. G. Grioli, Mathematical Theory of Elastic Equilibrium (Springer, Berlin, 1962) 212. E.P. Gross, Hydrodynamics of a superfluid condensate. J. Math. Phys. 4, 195–207 (1963) 213. A.E. Green, P.M. Naghdi, J.A. Trapp, Thermodynamics of a continuum with internal constraints. Int. J. Eng. Sci. 8, 891–902 (1970) 214. M.E. Gurtin, Generalized Ginzburg-Landau and Cahn-Hilliard equations based on a microforce balance. Physica D 92, 178–192 (1996) 215. M.E. Gurtin, E. Fried, L. Anand, The Mechanics and Thermodynamics of Continua (Cambridge University Press, 2010) 216. M.E. Gurtin, A.C. Pipkin, A general theory of heat conduction with finite wave speeds. Arch. Rational Mech. Anal. 31, 113–126 (1968) 217. M.E. Gurtin, P. Podio Guidugli, The thermodynamics of constrained materials. Arch. Rational Mech. Anal. 51, 192–208 (1973) 218. M.E. Gurtin, K. Spear, On the relationship between the logarithmic strain rate and the stretching tensor. Int. J. Solids Struct. 19, 437–444 (1983) 219. D. Gutierrez-Lemini, Engineering Viscoelasticity (Springer, New York, 2014)

References

999

220. H.E. Hall, W.F. Vinen, The rotation of liquid helium II. The theory of mutual friction in uniformly rotating helium II. Proc. Roy. Soc. A 238, 204–214 (1956) 221. S. Hassanpour, G.R. Heppler, Micropolar elasticity theory: a survey of linear isotropic equations, representative notations, and experimental investigations. Math. Mech. Solids 22, 224– 242 (2017) 222. T. Hayat, Y. Wang, K. Hutter, Flow of a fourth grade fluid. Math. Mod. Meth. Appl. Sci. 12, 797–811 (2002) 223. S. Helgason, Differential Geometry, Lie Groups, and Symmetric Spaces (Academic Press, New york, 1978) 224. H. Hsiao, O.T. Bruhns, A.T.M. Meyers, Strain rates and material spins. J. Elast. 52, 1–41 (1998) 225. R. Klages, G. Radons, I.M. Sokolov, Anomalous Transport: Foundations and Applications (Wiley, Weinheim, 2008) 226. V.L. Idenbom, E.B. Loginov, M.A. Osipov, The flexoelectric effect and the structure of crystals. Sov. Phys. Crystallogr. 26, 656–658 (1981) 227. F. Irgens, Continuum Mechanics (Springer, New York, 2008) 228. M. Itskov, Tensor Algebra and Tensor Analysis for Engineers with applications to Continuum Mechanics (Springer, Berlin, 2009) 229. J.D. Jackson, Classical Electrodynamics (Wiley, New York, 1975) 230. B. Jayawardhana, R. Ouyang, V. Andrieu, Dissipativity of general Duhem hysteresis models, in Proceedings of the 50th IEEE Conference on Decision and Control and European Control Conference (CDC-ECC) (New York, 2011), pp. 3234–3239 231. J.T. Jenkins, A theory of magnetic fluids. Arch. Rational Mech. Anal. 46, 42–60 (1972) 232. X. Jiang, W. Huang, S. Zhang, Flexoelectric nano-generator: materials, structures and devices. Nano Energy 2, 1079–1092 (2013) 233. C.S. Jog, A concise proof of the representation theorem for fourth-order isotropic tensors. J. Elast. 85, 119–124 (2006) 234. C.C. Johnson, Field and Wave Electrodynamics (McGraw-Hill, New York, 1965) 235. M. Johnson, D. Segalman, A model for viscoelastic fluid behavior which allows non-affine deformations. J. Non-Newton. Fluid Mech. 2, 255–270 (1977) 236. D.S. Jones, Acoustic and Electromagnetic Waves (Clarendon, Oxford, 1986) 237. S.H.S. Joodat, K.B. Nakshatrala, R. Ballarini, Modeling flow in porous media with double porosity/permeability: a stabilized mixed formulation, error analysis, and numerical solutions. Comp. Methods Appl. Mech. Eng. 337, 632–676 (2018) 238. P.M. Jordan, Growth and decay of acoustic acceleration waves in Darcy-type porous media. Proc. R. Soc. A 461, 2749–2766 (2005) 239. P.M. Jordan, Second-sound phenomena in inviscid, thermally relaxing gases. Discrete Contin. Dyn. Syst. Ser. B 19, 2189–2205 (2014) 240. P.M. Jordan, Second-sound propagation in rigid, nonlinear conductors. Mech. Res. Commun. 68, 52–79 (2015) 241. D.D. Joseph, L. Preziosi, Heat waves. Rev. Mod. Phys. 61, 41–73 (1989) 242. B.D. Josephson, Possible new effects in superconducting tunneling. Phys. Lett. 1, 251–253 (1962) 243. D.D. Joseph, Fluid Dynamics of Viscoelastic Liquids (Springer, New York, 1990) 244. D. Jou, J. Casas-Vázquez, G. Lebon, Extended Irreversible Thermodynamics (Springer, Berlin, 2010) 245. M. Kachanov, Continuum model of medium with cracks. J. Eng. Mech. 106, 1039–1051 (1980) 246. B. Kaltenbacher, I. Lasiecka, M.K. Pospieszalska, Well-posedness and exponential decay of the energy in the nonlinear Jordan-Moore-Gibson-Thompson equation arising in high intensity ultrasound. Math. Models Methods Appl. Sci. 22, 1250035 (2012) 247. M. Kamlah, Ferroelectric and ferroelastic piezoceramics - modeling of electromechanical hysteresis phenomena. Cont. Mech. Thermodyn. 13, 219–268 (2001)

1000

References

248. K. Kannan, K.R. Rajagopal, A model for the flow of rock glaciers. Int. J. Non-Lin. Mech. 48, 59–64 (2013) 249. P. Kapitza, Viscosity of liquid helium below the λ-point. Nature 141, 74–76 (1938) 250. J.I. Kaplan, M.L. Glasser, Effective magnetic field in metals. Phys. Rev. 170, 649–650 (1968) 251. A. Karma, Phase-field formulation for quantitative modeling of alloy solidification. Phys. Rev. Lett. 87, 115701 (2001) 252. G. Karniadakis, A. Beskok, N. Aluru, Microflows and Nanoflows (Fundamentals and Simulation (Springer, Berlin, 2005) 253. K.H. Khanafer, K. Vafai, M. Lightstone, Buoyancy-driven heat transfer enhancement in a two-dimensional enclosure utilizing nanofluids. Int. J. Heat Mass Transf. 46, 3639–3653 (2003) 254. A. Kholmetskii, O. Missevitch, T. Yarman, Electric/magnetic dipole in an electromagnetic field: force, torque and energy. Eur. Phys. J. Plus 129, 215 (2014) 255. C. Kittel, Introduction to Solid State Physics (Wiley, New York, 1996) 256. R. Kohlrausch, Nachtrag fiber die Elastische Nachwirlmng beim Cocon- und Glasfaden, und die Hygroskopische Eigenschaft des Ersteren. Pogg. Ann. Phys. 72(11), 393–425 (1847) 257. D.J. Korteweg, Sur la forme que prennent les équations du mouvement des fluides si l’on tient compte des forces capillaires causées par des variations de densité considérables mais continues et sur la théorie de la capillarité dans l’hypothèse d’une variation continue de la densité. Arch. Neerl. Sci. Exactes Ser. II(6), 1–24 (1901) 258. N.E. Kotchine, Sur la théorie des ondes de choc dans un fluide. Rend. Circ. Mat. Palermo 50, 305–344 (1926) 259. A. Kovetz, Electromagnetic Theory (Oxford University Press, 2000) 260. N.A. Krall, A.W. Trivelpiece, Principles of Plasma Physics (McGraw-Hill, New York, 1973). ((ch. 3)) 261. E. Kröner, Allgemeine kontinuumstheorie der versetzungen und eigenspannungen. Arch. Rational Mech. Anal. 4, 273–334 (1960) 262. B. Kutschan, K. Morawetz, S. Gemming, Modeling and morphogenesis of brine channels in sea ice. Phys. Rev. E 81, 036106 (2010) 263. R. Lakes, K.W. Wojciechowski, Negative compressibility, negative Poisson’s ratio, and stability. Physica Status Solidi (b) 245, 545–551 (2008) 264. L.D. Landau, Zur Theorie der Phasenumwandlungen. Phys. Z. Sowjet. 11, 26–35 (1937) 265. L.D. Landau, Unbranched model of intermediate state. Phys. Z. Sowjet. 11, 129–138 (1937) 266. L.D. Landau, The theory of superfluidity of helium II. J. Phys. USSR 5, 71–73 (1941) 267. L.D. Landau, The theory of superfluidity of helium II. Phys. Rev. 60, 356–358 (1941) 268. L.D. Landau, I.M. Khalatnikov, On the theory of superconductivity, in Collected Papers of L.D. Landau, ed. by D. Ter Haar (Pergamon, Oxford, 1965) 269. L.D. Landau, E.M. Lifshitz, On the theory of dispersion of magnetic permeability in ferromagnetic bodies. Phys. Z. Sowjetunion 8, 135–169 (1935) 270. L.D. Landau, E.M. Lifshitz, Statistical Physics (Pergamon Press, Oxford, 1959) 271. L.D. Landau, E.M. Lifshitz, Theory of Elasticity (Pergamon, New York, 1986) 272. L.D. Landau, E.M. Lifshitz, Fluid Mechanics (Pergamon Press, Oxford, 1987) 273. L.D. Landau, E.M. Lifshitz, L.P. Pitaevskii, Electrodynamics of Continuous Media (Pergamon, Oxford, 1984) 274. A. Lakhtakia, Beltrami Fields in Chiral Media (World Scientific, Singapore, 1994) 275. M.J. Leitman, G.M. Fisher, The linear theory of viscoelasticity, in Encyclopedia of Physics, vol. VIa/3. ed. by C. Truesdell (Springer, Berlin, 1973), pp.1–123 276. F.M. Leslie, Continuum theory for nematic liquid crystals. Cont. Mech. Thermodyn. 4, 167– 175 (1992) 277. D.R. Lide (ed.), CRC Handbook of Chemistry and Physics, 86th edn. (CRC Press, Boca Raton, FL, 2005), pp. 8–116 278. I-S. Liu, Method of Lagrange multipliers for exploitation of the entropy principle. Arch. Rational Mech. Anal. 46, 131–148 (1972)

References

1001

279. Q. Liu, G. Subhash, A phenomenological constitutive model for foams under large deformation. Polym. Eng. Sci. 44, 463–473 (2004) 280. F. London, H. London, Supraleitung und diamagnetismus. Physica 2, 341–354 (1935) 281. F. London, On the Bose-Einstein condensation. Phys. Rev. 54, 947–954 (1938) 282. F. London, Superfluids (Dover, New York, 1961) 283. K. Lonngren, A.C. Scott, Solitons in Action (Wiley, New york, 1978) 284. W.-Y. Lu, Compression and shear response of 3D printed foam pads, in Mechanics of Additive and Advanced Manufacturing, vol. 8, ed. by S. Kramer, J.L. Jordan, H. Jin, J. Carrol, A.M. Beese (Springer, New York 2019) 285. C. Ludwig, Diffusion Zwischen Ungleich Erwarmten Orten Gleich Zusammengestzter Losungen, Sitz. Ber Acad. Wiss. Wien Math-Naturw. Kl. 20, 539 (1856) 286. A.T. McBride, B.D. Reddy, P. Steinmann, Dissipation-consistent modelling and classification of extended plasticity formulations. J. Mech. Phys. Solids 119, 118–139 (2018) 287. M. Mc Carthy, Wave propagation in generalized thermoelasticity. Int. J. Eng. Sci. 10, 593–602 (1972) 288. M. Mc Carthy, Singular surfaces and waves, in Continuum Physics II, ed. by A.C. Eringen (Wiley, New York, 1975) 289. R.M. Mc Meeking, C.M. Landis, S.M.A. Jimenez, A principle of virtual work for combined electrostatic and mechanical loading of materials. Int. J. Non-Lin. Mech. 42, 831–838 (2007) 290. J.W. Macki, P. Nistri, P. Zecca, Mathematical models for hysteresis. SIAM Rev. 35, 94–123 (1993) 291. F. Mainardi, Fractional Calculus and Waves in Linear Viscoelasticity (Imperial College Press, London, 2010) 292. F. Mainardi, G. Spada, Creep, relaxation and viscosity properties for basic fractional models in rheology. Eur. Phys. J. Spec. Top. 193, 133–160 (2011) 293. J. Málek, K.R. Rajagopal, K. Tuma, Derivation of the variants of the Burgers model using a thermodynamic approach and appealing to the concept of evolving natural configurations. Fluids 3(4), 1–18 (2018) 294. J. Mandel, Plasticité Classique et Viscoplasticité, CISM Courses and Lectures no. 97 (Springer, Berlin, 1972) 295. J.J. Marigo, Modeling of brittle and fatigue damage for elastic material by growth of microvoids. Eng. Fract. Mech. 21, 861–874 (1985) 296. M. Massoudi, A. Vaidya, On some generalizations of the second grade fluid model. Nonlin. Anal. Real World Appl. 9, 1169–1183 (2008) 297. G.A. Maugin, The principle of virtual power: application to coupled fields. Acta Mech. 35, 1–70 (1980) 298. G.A. Maugin, Continuum Mechanics of Electromagnetic Solids (North-Holland, Amsterdam, 1988) 299. G.A. Maugin, Internal variables and dissipative structures. J. Non-Equilib. Thermodyn. 15, 173–192 (1990) 300. G.A. Maugin, On the thermomechanics of continuous media with diffusion and/or weak nonlocality. Arch. Appl. Mech. 75, 723–738 (2006) 301. G.A. Maugin, The principle of virtual power: from eliminating metaphysical forces to providing an efficient modelling tool. Continuum Mech. Thermodyn. 25, 127–146 (2013) 302. G.A. Maugin, W. Muschik, Thermodynamics with internal variables. Part I. General concepts. J. Non-Equilib. Thermodyn. 19, 217–249 (1994) 303. G.A. Maugin, J. Pouget, Electroacoustic equations for one-domain ferroelectric bodies. J. Acoust. Soc. Am. 68, 575–587 (1980) 304. J.C. Maxwell, Constitution of bodies, Sci. Papers, College Arts Sci. Univ. Tokyo 2, 616–624 (1877) 305. J.C. Maxwell, A Treatise on Electricity and Magnetism (Oxford University Press, 1904), reprint by (Dover, New York, 1954) 306. J.C. Maxwell, On the dynamical theory of gases, in The Scientific Papers of J.C. Maxwell, vol. 2, pp. 26–78 (1965)

1002

References

307. H. Mehrer, Diffusion in Solids (Springer, Berlin, 2007) 308. W. Meissner, R. Ochsenfeld, Ein neuer effect bei eintritt der supraleitfähigkeit. Naturwissenshaften 21, 787–788 (1933) 309. M. Mihâilescu-Suliciu, I. Suliciu, Energy for hypoelastic constitutive equations. Arch. Rational Mech. Anal. 70, 168–179 (1979) 310. R.D. Mindlin, Polarization gradient in elastic dielectrics. Int. J. Solids Struct. 4, 637–642 (1968) 311. R.D. Mindlin, Elasticity, piezoelectricity and crystal lattice dynamics. J. Elast. 2, 217–282 (1972) 312. R.D. Mindlin, H.F. Tiersten, Effects of couple-stresses in linear elasticity. Arch. Rational Mech. Anal. 11, 415–448 (1962) 313. C.W. Monroe, J. Newman, An introduction to the Onsager reciprocal relations. Chem. Eng. Educ. 41, 233–238 (2007) 314. M. Mooney, A theory of large elastic deformation. J. Appl. Phys. 11, 582–591 (1940) 315. M.J. Moran, H.N. Shapiro, Fundamentals of Engineering Thermodynamics (Wiley, Chichester, 2006) 316. J.J. Moreau, Sur les lois de frottement, de viscosité, et de plasticité. C.R. Acad. Sci. 271, 608–611 (1970) 317. A. Morozov, S. Spagnolie, Introduction to complex fluids, in Complex Fluids in Biological Systems. Experiment, Theory, and Computation, ed. by S. Spagnolie (Springer, 2015) 318. A. Morro, Temperature waves in materials with memory. Meccanica 12, 73–77 (1977) 319. A. Morro, Interaction of acoustic waves with shock waves in elastic solids. Zeit. Angew. Math. Phys. 822–827 (1978) 320. A. Morro, Interaction of waves with shocks in magneto-fluid-dynamics. Acta Mech. 197–213 (1980) 321. A. Morro, Jump relations and discontinuity waves in conductors with memory. Math. Comp. Model. 43, 138–149 (2005) 322. A. Morro, A phase-field approach to non-isothermal transitions. Math. Comp. Model. 48, 621–633 (2008) 323. A. Morro, Balance and constitutive equations for diffusion in mixtures of fluids. Meccanica 49, 2109–2123 (2014) 324. A. Morro, Vector spaces and matrix algebra: vector and tensor algebra, in Encyclopedia of Thermal Stresses, vol. 11, ed. by R. Hetnarski (Springer, Berlin, 2014) 325. A. Morro, A mixture model of poroelastic materials. Eur. J. Mech. A/Solids 53, 121–130 (2015) 326. A. Morro, Diffusion in mixtures of reacting thermoelastic solids. J. Elas. 123, 59–84 (2016) 327. A. Morro, Modelling of viscoelastic materials and creep behaviour. Meccanica 52, 3015–3021 (2017) 328. A. Morro, M. Romeo, Thermodynamic derivation of Saha’s equation for a multitemperature plasma. J. Plasma Phys. 39, 41–51 (1988) 329. A. Morro, T. Ruggeri, Second sound and internal energy in solids. Int. J. Non-Lin. Mech. 23, 27–36 (1987) 330. A. Morro, T. Ruggeri, Non-equilibrium properties of solids obtained from second-sound measurements. J. Phys. C: Solid State Phys. 21, 1743–1752 (1988) 331. A. Morro, M. Vianello, Interstitial energy flux and stress-power for second-gradient elasticity. Math. Mech. Solids 21, 403–412 (2016) 332. A. Morro, Thermodynamic restrictions in linear viscoelasticity. Materials 15, 2706 (2022) 333. I. Müller, A thermodynamic theory of mixtures of fluids. Arch. Rational Mech. Anal. 28, 1–39 (1968) 334. I. Müller, The coldness, a universal function in thermoelastic bodies. Arch. Rational Mech. Anal. 41, 319–332 (1971) 335. I. Müller, Thermodynamics of mixtures of fluids. J. Mécanique 14, 267–303 (1975) 336. I. Müller, Thermodynamics (Pitman, London, 1985) 337. I. Müller, T. Ruggeri, Extended Thermodynamics (Springer, New York, 1993)

References

1003

338. I. Müller, Thermodynamics of mixtures and phase field theory. Int. J. Solids Struct. 38, 1105– 1113 (2001) 339. I. Müller, S. Seelecke, Thermodynamic aspects of shape memory alloys. Math. Comp. Model. 34, 1307–1355 (2001) 340. A. Nagy, W.L. Ko, U.S. Lindholm, Mechanical behavior of foamed materials under dynamic compression. J. Cell. Plast. 10, 127–134 (1974) 341. J.L. Neuringer, R.E. Rosensweig, Ferrohydrodynamics. Phys. Fluids 7, 1927–1937 (1964) 342. T.D. Nguyen, S. Mao, Y.-W. Yeh, P.K. Purohit, M.C. McAlpine, Nanoscale flexoelectricity. Adv. Mater. 25, 946–974 (2013) 343. D.R. Nicholson, Introduction to Plasma Theory (Wiley, New York, 1983). ((ch. 8)) 344. W. Noll, A new mathematical theory of simple materials. Arch. Rational Mech. Anal. 48, 1–50 (2017) 345. A. Oberbeck, Uber die Wärmeleitung der Flüssigkeiten bei Berücksichtigung der Strömunggeninfolge von Temperaturdifferenzen. Ann. Phys. Chem. 7, 271–292 (1879) 346. R.W. Ogden, Non-Linear Elastic Deformations (Dover, New York, 1998) 347. J. Oldroyd, On the formulation of rheological equations of state. Proc. Roy. Soc. Lond. Ser. A 200, 523–541 (1950) 348. X. Oliver, C. Agelet de Saracibar, Continuum Mechanics for Engineers (Theory and Problems (UPC, Barcelona, 2017) 349. L. Onsager, Reciprocal relations in irreversible processes. Phys. Rev. 37, 405–426 (1931) 350. W.C. Oseen, The theory of liquid crystals. Trans. Faraday Soc. 29, 883–899 (1933) 351. M. Otténio, M. Destrade, R.W. Ogden, Incremental magnetoelasticdeformations, with application to surface instability. J. Elast. 90, 19–42 (2008) 352. T.C. Papanastasiou, Flows of materials with yields. J. Rheology. 31, 384–404 (1987) 353. Y.-S. Pao, K. Hutter, Electrodynamics for moving elastic solids and viscous fluids. Proc. IEEE 63, 1011–1021 (1975) 354. J.R.G. Parkinson, D.F. Martin, A.J. Wells, R.F. Katz, Modelling binary alloy solidification with adaptive mesh refinement. J. Comput. Phys. X5, 100043 (2020) 355. S.L. Passman, J.W. Nunziato, A theory of multiphase mixtures, in Rational Thermodynamics, ed. by C. Truesdell (Springer, New York, 1984) 356. P. Penfield, H.A. Haus, Electrodynamics of Moving Media (MIT Press, Cambridge Mass, 1967) 357. O. Penrose, P.C. Fife, Thermodynamically consistent models of phase-field type for the kinetics of phase transitions. Physica D 43, 44–62 (1990) 358. V.P. Peshkov, Second sound in helium II. Sov. Phys. JETP 11, 580–584 (1960); J. Phys. USSR 8, 381 (1944) 359. T.-H. Pham, J.J. Kim, S.-E. Kim, Estimating constitutive equation of structural steel using indentation. Int. J. Mech. Sci. 90, 151–161 (2015) 360. A.C. Pipkin, Small finite deformations of viscoelastic solids. Rev. Mod. Phys. 38, 1034–1041 (1964) 361. A.C. Pipkin, Constraints in linearly elastic materials. J. Elast. 6, 179–193 (1986) 362. A.B. Pippard, L. Bragg, An experimental and theoretical study of relations between magnetic fields and current in superconductors. Proc. Roy. Soc. Lond. A 216, 547–568 (1953) 363. L. Pitaevskii, Vortex lines in an imperfect Bose gas. J. Exp. Theor. Phys. 13, 451–454 (1961) 364. P. Podio Guidugli, On dissipation mechanisms in micromagnetics. Eur. Phys. J. B 19, 417–424 (2001) 365. P. Podio Guidugli, A primer in elasticity. J. Elast. 58, 1–103 (2000) 366. P. Podio-Guidugli, Continuum Thermodynamics (Springer, Cham, 2019) 367. P. Podio Guidugli, M. Vianello, Constraint manifolds for isotropic solids. Arch. Rational Mech. Anal. 105, 105–121 (1989) 368. E.M. Purcell, Electricity and Magnetism (McGraw-Hill, New York, 1965) 369. R. Quintanilla, Moore-Gibson-Thompson thermoelasticity. Math. Mech. Solids 24, 4020– 4031 (2019)

1004

References

370. K.R. Rajagopal, A.R. Srinivasa, On the thermomechanics of shape memory wires. Z. angew. Math. Phys. 50, 459–496 (1999) 371. K.R. Rajagopal, M. Ruzicka, Mathematical modelling of electrorheological materials. Cont. Mech. Thermodyn. 13, 59–78 (2001) 372. K.R. Rajagopal, On implicit constitutive theories for fluids. J. Fluid Mech. 550, 243–249 (2006) 373. K.R. Rajagopal, G. Saccomandi, L. Vergori, On the Oberbeck-Boussinesq approximation for fluids with pressure dependent viscosities. Nonlinear Anal. Real World Appl. 10, 1139–1150 (2009) 374. K.R. Rajagopal, G. Saccomandi, L. Vergori, Flow of fluids with pressure- and shear-dependent viscosity down an inclined plane. J. Fluid Mech. 706, 173–189 (2012) 375. M.A. Rahman, M.Z. Saghir, Thermodiffusion or Soret effect: historical review. Int. J. Heat Mass Transf. 73, 693–705 (2014) 376. J.R. Reitz, F.J. Milford, R.W. Christy, Foundations of Electromagnetic Theory (AddisonWesley, Reading Massachusetts, 1993) 377. L.A. Richards, Capillary conduction of liquids in porous media. Physics 1, 318–333 (1931) 378. R.S. Rivlin, Large elastic deformations isotropic materials IV. Further developments of the general theory. Phil. Trans. R. Soc. A 241, 379–397 (1948) 379. R.S. Rivlin, J.L. Ericksen, Stress-deformation relation for isotropic materials. J. Rational Mech. Anal. 4, 323–425 (1955) 380. A. Romano, Proof that the formulations of the electrodynamics of moving bodies are equivalent. Arch. Rational Mech. Anal. 68, 283–298 (1978) 381. D. Said Schicchi, A. Caggiano, M. Hunkel, E.A.B. Koenders, Thermodynamically consistent multiscale formulation of a thermo-mechanical problem with phase transformations. Cont. Mech. Thermodyn. 31, 273–299 (2019) 382. G.A. Samara, The relaxational properties of compositionally disordered SBO3 perovskites. J. Phys.: Condens. Matter 15, R367–R411 (2003) 383. M. Sarjala, E.T. Seppälä, M.J. Alava, Dynamic hysteresis in ferroelectrics with quenched randomness. Physica B 40, 418–421 (2008) 384. P. Saxena, M. Hossain, P. Steinmann, A theory of finite deformation magneto-viscoelasticity. Int. J. Solids Struct. 50, 3886–3897 (2013) 385. P. Saxena, D.K. Vu, P. Steinmann, On rate-dependent dissipation effects in electro-elasticity. Int. J. Non-Lin. Mech. 62, 1–11 (2014) 386. J.W. Schmelzer, E.D. Zanotto, V.M. Fokin, Pressure dependence of viscosity. J. Chem. Phys. 122, 074511 (2005) 387. J. Schröder, P. Neff, Poly-, Quasi- and Rank-One Convexity in Applied Mechanics (Springer, Berlin, 2010) 388. J.F. Scott, Electrocaloric materials. Annu. Rev. Mater. Res. 41, 229–240 (2011) 389. R.F. Sekerka, Similarity solutions for a binary diffusion couple with diffusivity and density dependent on composition. Prog. Mat. Sci. 49, 511–536 (2004) 390. A. Sellitto, V.A. Cimmelli, D. Jou, Mesoscopic Theories of Heat Transport in Nanosystems (Springer, Berlin, 2016) 391. A. Serra-Aguila, J.M. Puigoriol-Forcada, G. Reyes et al., Viscoelastic models revisited: characteristics and interconversion formulas for generalized Kelvin-Voigt and Maxwell models. Acta Mech. Sin. 35, 1191–1209 (2019) 392. J. Serrin, Mathematical principles of classical fluid mechanics, in Encyclopedia of Physics, vol. VIII/1. ed. by S. Flügge (Springer, Berlin, 1959), pp.125–263 393. N.D. Sharma, R. Maranganti, P. Sharma, On the possibility of piezoelectric nanocomposites without using piezoelectric materials. J. Mech. Phys. Solids 55, 2328–2350 (2007) 394. M.I. Shliomis, Effective viscosity of magnetic suspensions. Soviet Phys. JETP 34, 1291–1994 (1972) 395. W. Shockley, R.P. James, “Try Simplest Cases” Discovery of “Hidden Momentum” Forces on “Magnetic Currents”. Phys. Rev. Lett. 18, 876–879 (1967) 396. B.V. Somov, Plasma Astrophysics (Springer, Berlin, 2007). ((ch. 11))

References

1005

397. A.J.C.B. de St. Venant, Établissement élémentaire des formules et équations générales de la theorie de l’élasticité des corpes solides. Appendix in: Résumé des Leçons données à l’École des Ponts et Chaussées sur l’Application de la Mécanique; De la Résistance des Corps Solides, par C.L.M.H. Navier (Paris, 1864) 398. J. Stefan, Über das Gleichgewicht und Bewegung, insbesondere die Diffusion von Gemishen. Sitzungsberichte der Kaiserlichen Akademie der Wissenschaften Wien 63, 63–124 (1871) 399. J.J. Stoker, Water Waves (Wiley, New York, 1958) 400. J.A. Stratton, Electromagnetic Theory (McGraw-Hill, New York, 1962) 401. B. Straughan, Heat Waves (Springer, Berlin, 2011) 402. B. Straughan, Stability and uniqueness in double porosity elasticity. Int. J. Eng. Sci. 65, 1–8 (2013) 403. Y. Su, C.M. Landis, Continuum thermodynamics of ferroelectric domain evolution: theory, finite element implementation, and application to domain wall pinning. J. Mech. Phys. Solids 55, 280–305 (2007) 404. E.S. Suhubi, Thermoelastic Solids, in Continuum Physics, vol. II, ed. by A.C. Eringen (Academic, New York, 1975) 405. M. Svanadze, Plane waves and boundary value problems in the theory of elasticity for solids with double porosity. Acta Appl. Math. 122, 461–471 (2012) 406. H. Szambolics, J.-Ch. Toussaint, A. Marty, I.M. Miron, L.D. Buda-Prejbeanu, Domain wall motion in ferromagnetic systems with perpendicular magnetization. J. Magn. Magn. Mater. 321, 1912–1918 (2009) 407. A.K. Tagantsev, A.S. Yurkov, Flexoelectric effect in finite samples. J. Appl. Phys. 112, 044103 (2012) 408. C.-T. Tai, On the presentation of Maxwell’s theory. Proc. IEEE 60, 936–945 (1972) 409. R. Taylor et al., Small particles, big impacts: a review of the diverse applications of nanofluids. J. Appl. Phys. 113, 011301 (2013) 410. J.J. Thalakkottor, K. Mohseni, Unified slip boundary condition for fluid flows. Phys. Rev. E 94, 023113 (2016) 411. A. Thiaville, Y. Nakatani, J. Miltat, Y. Suzuki, Micromagnetic understanding of current-driven domain wall motion in patterned nanowires. Europhys. Lett. 69, 990–996 (2005) 412. N.L. Thomas, A.H. Windle, A theory of case II diffusion. Polymer 23, 529–542 (1982) 413. H.F. Tiersten, On the nonlinear equations of thermo-electroelasticity. Int. J. Eng. Sci. 9, 587– 604 (1971) 414. D. Tilley, J. Tilley, Superfluidity and Superconductivity (IOP Publishing, Bristol, 1990) 415. A.M. Tishin, Y.I. Spichkin, The Magnetocaloric Effect and Its Applications (Series in Condensed Matter Physics) (Publishing Ltd., Institute of Physics, 2003) 416. L. Tisza, Sur la supraconductibilité thermique de l’hélium II liquide et la statistique de BoseEinstein. C. R. Acad. Sci. 207, 1035–1037 (1938) 417. L. Tisza, La viscosité de l’hélium liquide et la statistique de Bose-Einstein. C. R. Acad. Sci. 207, 1186–1189 (1938) 418. J.C. Tolédano, P. Tolédano, The Landau Theory of Phase Transitions (World Scientific, Singapore, 1987) 419. R.A. Toupin, The elastic dielectric. J. Rational Mech. Anal. 5, 849–915 (1956) 420. R.A. Toupin, Elastic materials with couple-stresses. Arch. Rational Mech. Anal. 11, 385–414 (1962) 421. R.A. Toupin, Theory of elasticity with couple-stresses. Arch. Rational Mech. Anal. 17, 85–112 (1964) 422. L.R.G. Treloar, The Physics of Rubber Elasticity (Clarendon Press, Oxford, 1958) 423. C. Truesdell, The simplest rate theory of pure elasticity. Comm. Pure Appl. Math. 8, 123–132 (1955) 424. C. Truesdell, Introduction à la Mécanique Rationelle des Milieux Continus (Masson, Paris, 1974) 425. C. Truesdell, A First Course in Rational Continuum Mechanics (Academic, NewYork, 1977) 426. C. Truesdell, Rational Thermodynamics (Springer, New York, 1984). ((ch. 5))

1006

References

427. C. Truesdell, W. Noll, The non-linear field theory of mechanics, in Encyclopedia of Physics, vol. III/3, ed. by S. Flügge (Springer, Berlin, 1965) 428. C. Truesdell, R.A. Toupin, The classical field theories, in Encyclopedia of Physics, vol. III/1, ed. by S. Flügge (Springer, Berlin, 1960) 429. L. Vaidman, Torque and force on a magnetic dipole. Am. J. Phys. 58, 978–983 (1990) 430. M.P. Vernotte, Les paradoxes de la théorie continue de l’équation de la chaleur. Comptes Rendus de l’Académie des Sciences 246, 3154–3155 (1958) 431. W.F. Vinen, J.J. Niemela, Quantum turbulence. J. Low Temp. Phys. 128, 167–231 (2002) 432. A. Visintin, Differential Models of Hysteresis (Springer, Berlin, 1994) 433. A. Visintin, Models of Phase Transition (Birkäuser, Boston, 1996); ch. 4 434. A. Visintin, Mathematical models of hysteresis, in The Science of Hysteresis, ed. by G. Bertotti, I. Mayergoyz (Elsevier, 2006), pp. 1–123 435. V. Volterra, Leçons sur les Fonctions des Lignes (Gauthier-Villars, Paris, 1913) 436. V. Volterra, Sur la théorie mathématique des phénomènes héréditaires. J. Math. Pures Appl. 7, 249–298 (1928) 437. V. Volterra, Energia nei fenomeni ereditari. Acta Pontificia Acad. Scient. 4, 115–128 (1940) 438. J. Vörös, Modeling and identification of hysteresis using special forms of the ColemanHodgdon model. J. Electr. Eng. 60, 100–105 (2009) 439. R.T. Wang, W.T. Wagner, R.J. Donnelly, Precision second-sound velocity measurements in helium II. J. Low Temp. Phys. 68, 409–417 (1987) 440. E. Warburg, Magnetische Untersuchungen. Annalen der Physik 13, 141–164 (1881) 441. J.G. Webster (ed.), The Measurement, Instrumentation, and Sensors Handbook (CRC Press (in cooperation with IEEE Press), Boca Raton, FL, 1999). 0849383471 442. J. Weingarten, Sulle superfici di discontinuità nella teoria dell’elasticità dei corpi solidi. Atti Accad. Naz. Lincei Cl. Sci. Fis. Mat. Natur. Rend. 10, 57–60 (1901) 443. A. Wheeler, W. Boettinger, G. McFadden, Phase-field model for isothermal phase transitions in binary alloys. Phys. Rev. A d5, 7424–7440 (1992) 444. J.L. White, A.B. Metzner, Development of constitutive equations for polymeric melts and solutions. J. Appl. Polym. Sci. 7, 1867–1889 (1963) 445. E. Wiechert, Gesetze der Elastichen Nachwirkung für constante Temperatur. Annalen der Physik 286(335–348), 546–570 (1893) 446. L. Xu, S. Zhang, Magnetization dynamics at elevated temperatures. Physica E 45, 72–76 (2012) 447. Y. Yu, P. Perdikaris, G.E. Karniadakis, Fractional modeling of viscoelasticity in 3D cerebral arteries and aneurysms. J. Comp. Phys. 323, 219–242 (2016) 448. M.W. Zemansky, Heat and Thermodynamics (McGraw-Hill, New York, 1968) 449. C. Zener, Elasticity and Anelasticity of Metals (University of Chicago Press, Chicago, 1948)

Index

A Absolute temperature, 103 Acoustic tensor, 221 Active rotation, 24 Almansi tensor, 10 Amperian formulation, 142 Anhysteretic curve, 903 Auxetics, 228 Axial vector, 957 B Ballistic energy, 402 Bernoulli’s law, 115 Biharmonic equation, 234 Bingham fluid, 326 Bingham-Maxwell solid, 327 Bloch equations, 720 Boltzmann function, 408 Boltzmann–Matano method, 593 Bulk modulus, 227 C Cahn–Hilliard equation, 930 Cauchy–Green tensors, 7 Cayley–Hamilton equation, 965 Chemical potential, 278, 574 Christoffel equation, 233 Chu formulation, 141 Circular polarization, 773 Clapeyron’s equation, 920 Clausius-Duhem inequality, 104 Clausius-Planck inequality, 103 Coexistence line, 921 Constitutive equations, 178

Convected time derivative, 48, 49 Convecting sets, 18 Cosserat deformation tensor, 615 Couple stress tensor, 619 Creep compliance, 337 Creeping effect, 851 Creep test, 334

D Dalton’s law, 584 Determinant of a tensor, 959 Deviator, 958 Differential elastic modulus, 821 Differential susceptibility, 875 Diffusion velocity, 549 Displacement current, 128 Displacement derivative, 41, 43

E Einstein–de Haas effect, 154 Elasticity tensor, 223 Electric dipole moment, 125 Electric displacement, 125 Electric susceptibility, 136 Enstrophy-transport equation, 306 Entropy equation, 199, 275 Entropy production, 105 Equilibrium elastic modulus, 408 Exergy, 109 Extra-entropy flux, 104

F Fading memory space, 385

© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG 2023 A. Morro and C. Giorgi, Mathematical Modelling of Continuum Physics, Modeling and Simulation in Science, Engineering and Technology, https://doi.org/10.1007/978-3-031-20814-0

1007

1008 Finger tensor, 9 First law of thermodynamics, 89 First Piola–Kirchhoff stress tensor, 94 Flux, 984 Fountain effect, 17, 851 Fresnel’s equation, 746 G Generalized Stokes theorem, 50 Gibbs equations, 581 Gibbs–Duhem equation, 610 Gibbs-Helmholtz relation, 200 Gilbert equation, 720 Ginzburg–Landau equation, 930 Green-St. Venant strain tensor, 7 Gyration tensor, 613 H Hess–Fairbank effect, 865 Hyperelastic solids, 813 Hypoelastic materials, 214 Hypoelastic solids, 215, 814 Hysteresis in ferroelectrics, 901 in ferromagnetism, 905 in plasticity, 824 in SMA, 946 Hysteretic function, 821 I Infinitesimal strain tensor, 8 Influence function, 384 Inner part, 567 Instantaneous elastic modulus, 408 Interstitial work flux, 523 Interstitial working, 522 Isotropic, 217 J Jeffreys fluid, 329 K Kelvin-Voigt solid, 328 Kinematical condition of compatibility, 42 Kirchhoff stress, 95 Kröner decomposition, 805 L Lamé moduli, 226

Index Landau–Lifshitz equation, 719 Landau-de Gennes model, 643 Langmuir waves, 769 Larmor frequency, 781 Law of action and reaction, 80 Linear theory of elasticity, 225 Liquidus and solidus lines, 932 Liquidus line, 932 Local speed of propagation, 36, 38, 43 Lorentz force, 133 Lorentz trasformation, 130 Loss modulus, 416 Lower convected derivative, 58 Lyotropic LC, 629 M Magnetic induction, 122 Magnetic intensity, 126 Magnetic pressure, 762 Magnetic susceptibility, 126, 902 Magnetization, 126 Magnetostriction, 709 Mandel stress, 808 Mass fraction, 548 Maxwell–Cattaneo equation, 397 Maxwell equations, 128 Maxwell’s relations, 278 Maxwell stress tensor, 156 Maxwell stress tensor in matter, 159 Maxwell-Wiechert fluid, 325 Mechanocaloric effect in liquid helium, 850 Meissner effect, 836 Metallotropic LC, 629 Minkowski formulation, 140 Mixing entropy, 589 Mobility, 665 Mole fraction, 548 N Navier–Stokes equation, 291, 301 Nernst’s principle, 109 Newtonian fluid, 291, 300 O Oldroyd-B fluid, 363 Orientational momentum, 616 Orthogonal tensor, 960 P Passive rotation, 22, 24 Peculiar derivative, 546

Index Phase field, 111 Phase rule, 925 Phonons, 236 Piola stress, 94 Planck inequality, 106 Poisson’s condition, 163 Poisson’s ratio, 228 Polarization vector, 124, 125 Porosity, 660 Poynting vector, 144 Pressure head, 666 Primary wave, 230 Principle of objectivity, 603 of phase separation, 561 Product of tensors, 956 Pyroelectric vector, 690 R Rankine–Hugoniot jump condition, 164 Relaxation function, 408 Relaxation modulus, 337 Relaxation property, 383 Relaxation test, 334 Relaxation time, 325, 329, 332 Retardation time, 328, 333 Rigid deformation, 12 Rivlin–Ericksen tensors, 69 Rivlin model, 244 S Secondary wave, 230 Shear modulus, 227 Solidus line, 932 Speed of displacement, 36, 37 Speed of propagation, 36 Spin, 20 Standard linear solid, 332 Strain tensor, 8 Stretching, 20 Strong singularities, 43

1009 Symmetry group, 216

T Thermodynamic pressure, 288 Thermodynamic process, 104 Thermomechanical effect in liquid helium, 850 Thermophoretic effect, 653 Thermotropic LC, 629 Three parameter model, 332 Total pressure, 288 Transpose of a tensor, 955 Truesdell rate, 59 Twirl tensor, 25 Type I superconductors, 836 Type II superconductors, 837

U Upper convected derivative, 58 Upper convected Maxwell, 364

V Velocity gradient, 19 Villari effect, 709 Voigt notation, 231 Vorticity, 21 Vorticity transport equation, 292, 306

W Water-vapour coexistence, 920 Wave, 37 Weak singularities, 43 Weingarten-Hadamard theorem, 47 Work function, 422

Y Young’s modulus, 228