Linear Delay-Differential Systems with Commensurate Delays: An Algebraic Approach 3540428216, 9783540428213

The book deals with linear time-invariant delay-differential equations with commensurated point delays in a control-theo

116 51 13MB

English Pages 176 [183] Year 2001

Report DMCA / Copyright

DOWNLOAD PDF FILE

Table of contents :
3 The Algebraic Structure of H0
3.1 Divisibility Properties
3.2 Matrices over H0
3.3 Systems over Rings: A Brief Survey
3.4 The Nonfinitely Generated Ideals of H0
3.5 The Ring H as a Convolution Algebra
3.6 Computing the Bezout Identity
4 Behaviors of Delay-Differential Systems
4.1 The Lattice of Behaviors
4.2 Input/Output Systems
4.3 Transfer Classes and Controllable Systems
4.4 Subbehaviors and Interconnections
4.5 Assigning the Characteristic Function
4.6 Biduals of Nonfinitely Generated Ideals
5 First-Order Representations
5.1 Multi-Operator Systems
5.2 The Realization Procedure of Fuhrmann
5.3 First-Order Realizations for Delay-Differential Systems
5.4 Some Minimality Issues
Blank Page
Blank Page
Recommend Papers

Linear Delay-Differential Systems with Commensurate Delays: An Algebraic Approach
 3540428216, 9783540428213

  • 0 0 0
  • Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up
File loading please wait...
Citation preview

Lecture Notes in Mathematics Edited by J.-M. Morel, F. Takens and B. Teissier

Editorial Policy for the publication of monographs 1. Lecture Notes aim to report new developments in all areas of mathematics quickly, informally and at a high level. Monograph manuscripts should be reasonably self-contained and rounded off. Thus they may, and often will, present not only results of the author but also related work by other people. They may be based on specialized lecture courses. Furthermore, the manuscripts should provide sufficient motivation, examples and applications. This clearly distinguishes Lecture Notes from journal articles or technical reports which normally are very concise. Articles intended for a journal but too long to be accepted by most journals, usually do not have this "lecture notes" character. For similar reasons it is unusual for doctoral theses to be accepted for the Lecture Notes series.

2. Manuscripts should be submitted (preferably in duplicate) either to one of the series- editors or to Springer-Verlag, Heidelberg. In general, manuscripts will be_ sent out to 2 external referees for evaluation. If a decision cannot yet be reached on the basis of the first 2 reports, further referees may be contacted: the author will be informed of this. A final decision to publish can be made only on the basis of the complete manuscript, however a refereeing process leading to a, preliminary decision can be based on a pre-final or incomplete manuscript. The strict minimum amount of material that will be considered should include a detailed outline deScribing the planned contents of each cha,pter, a bibliography and several sample chapters. Authors should be aware that incomplete or insufficiently close to final manuscripts almost always result in longer refereeing times and nevertheless unclear referees' recommendations, making further refereeing of a final draft necessary. Authors should also be aware that parallel submission of their manuscript to another publisher while under consideration for LNM will in general lead to immediate rejection. 3. Manuscripts should in general be submitted in English. Final manuscripts should contain at least 100 pages of mathematical text and should include - a table of contents; - an informative introduction, with adequate motivation and perhaps some historical remarks: it should be accessible to a reader not intimately familiar with the topic treated; - a subject index: as a rule this is genuinely helpful for the reader.

Lecture Notes in Mathematics Editors: J.-M . Morel, Cachan F. Takens, Groningen B. Teissier, Paris

1770

Springer Berlin Heidelberg New York Barcelona Hong Kong London Milan Paris Singapore Tokyo

Heide

Gluesing-Luerssen,

Linear

Delay- Differential

Systems with

Delays: An Algebraic Approach Commensurate

4

11,11 4%

Springer

Author Heide

Gluesing-Luerssen

Department of Mathematics University of Oldenburg 26111 Oldenburg, Germany e-mail:

[email protected]

Cataloging-in-Publication Data available Die Deutsche Bibliothek

-

CIP-Einheitsaufnahme

Gltising-Ltierssen, Heide: delay differential systernswith commensurate'delays : an algebraic approach / Heide Gluesing-Lueerssen. Berlin; Heidelberg; New York; Barcelona ; Hong Kong ; London ; Milan ; Paris ; Tokyo : Springer, 2002 (Lecture notes in mathematics ; 1770) Linear

-

ISBN 3-540-42821-6

Mathematics

Subject Classification (2000): 93CO5, 93B25, 93C23, 13B99, 39B72

ISSN 0075-8434 ISBN 3-540-42821-6

Springer-Verlag Berlin Heidelberg New York

subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microfilm or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer-Verlag. Violations are

This work is

liable for prosecution under the German

Springer-Verlag

Berlin

Heidelberg

Copyright'Law.

New York

a

member of BertelsmannSpringer

Science + Business Media GmbH

http://www.springer.de @

Springer-Verlag Berlin Heidelberg 2002

Printed in

Germany

registered names, trademarks, etc. in this publication does not imply, specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.

The

use

even

of general descriptive names,

in the absence of

a

Typesetting: Camera-ready TEX output by the author SPIN:10856623

41/3143/LK

-

543210

-

Printed

on

acid-free paper

Preface

delay-differential equation was coined to comprise all types of differequations in which the unknown function and its derivatives occur with

The term ential

various values of the argument. In these notes we concentrate on (implicit) linear delay-differential equations with constant coefficients and commensurate

point delays. We present

an

investigation of dynamical delay-differential

sys-

tems with respect to their general system-theoretic properties. To this end, an algebraic setting for the equations under consideration is developed. A thorough

purely algebraic study shows that this setting is well-suited for an examination of delay-differential systems from the behavioral point of view in modern systems theory. The central object is a suitably defined operator algebra which turns out to be an elementary divisor domain and thus provides the main tool for handling matrix equations of delay-differential type. The presentation is introductory and mostly self-contained, no prior knowledge of delay-differential equations or (behavioral) systems theory will be assumed. people whom I am pleased to thank for making this grateful to Jan C. Willems for suggesting the topic "delaydifferential systems in the behavioral approach" to me. Agreeing with him, that algebraic methods and the behavioral approach sound like a promising combination for these systems, I started working on the project and had no idea of what I was heading for. Many interesting problems had to be settled (resulting in Chapter 3 of this book) before the behavioral approach could be started. Special thanks go to Wiland Schmale for the numerous fruitful discussions we had in particular at the beginning of the project. They finally brought me on the right track for finding the appropriate algebraic setting. But also later on, he kept discussing the subject with me in a very stimulating fashion. His interest in computer algebra made me think about symbolic computability of the Bezout identity and Section 3.6 owes a lot to his insight on symbolic computation. I wish to thank him for his helpful feedback and criticisms. These notes grew out of my Habilitationsschrift at the University of Oldenburg, Germany. The readers Uwe Helmke, Joachim Rosenthal, Wiland Schmale, and Jan C. Willems deserve special mention for their generous collaboration. I also want, to thank the Springer-Verlag for the pleasant cooperation. Finally, my greatest thanks go There

work

are

a

number of

possible.

I

am

VI

Preface

only for many hours carefully proofreading all making helpful suggestions, but also, and even more, for and so patient, supportive, being encouraging during the time I was occupied with writing the "Schrift". to my

partner, Uwe Nagel,

these pages and

Oldenburg, July

not

various

2001

Heide

Gluesing-Luerssen

Table of Contents

1

Introduction

2

The

Algebraic

3

The

3.1

Algebraic Structure Divisibility Properties

3.2

Matrices

3.3

Systems over Rings: A Brief Survey Nonfinitely Generated Ideals of Ho The Ring H as a Convolution Algebra Computing the Bezout Identity

3.4 3.5

3.6 4

5

................................................

Framework

over

Ho

..................................

of

Wo

,

............................

25 35

.........................

43

.....................

45

......................

51

Delay-Differential Systems

4.1

The Lattice of Behaviors

4.2

Input/Output Systems

4.3

Transfer Classes and Controllable

4.4

Subbehaviors and Interconnections

Assigning

4.6

Biduals of

..........

59

.....................

73

..................................

76

....................................

89

Systems

Nonfinitely Generated

...................

.........................

the Characteristic Function

.......................

Ideals

5.1

Representations Multi-Operator Systems

5.2

The Realization Procedure of Fuhrmann

5.3

First-Order Realizations

5.4

Some

First-Order

.....................

9&

104 115 129

................................

135

...................................

138

.....................

148

...................................

157'

...................................

162

......................................................

169

...........................................................

175

References Index

23

.........................................

..................

4.5

7

......................................

The

Behaviors of

I

Minimality Issues

I Introduction

Delay-differential equations (DDEs, for short) arise when dynamical systems time-lags are being modeled. Such lags might for instance occur if some nonnegligible transportation time is involved in the system or if the system needs

with

a

certain amount of time to

sense

information and react

on

it. The characteristic

feature of

a system with time-lags is that the dynamics at a certain time does not only depend on the instantaneous state of the system but also on some past values. The dependence on the past can take various shapes. The simplest type is

that of

retardation,

delay, describing for instance the generally, might depend on time (or other effects). Modeling such systems leads to differential- difference equations, also called differential equations with a deviating argument, in which the unknown function and its derivatives occur with their respective values at various time instants t--rk. A completely different form of past dependence arises if the process under investigation depends on the full history of the system over a certain time interval. In this case a ma*matical formulation leads to general functional-differential equations, for instance integro-differential equations. In control theory the term distributed delay, as opposed to point delay, has been coined for this type of past dependence. We will consistently use the term delaydifferential'equation for differential equations having any kind of delay involved. a

constant

reaction time of

All the

a

a

so-called point

system. More

the reaction time itself

delay-differential equations described above

fall in the category of

infinite-dimensional systems. The evolution of these systems can be described in a twofold way. On the one hand, the equations can, in certain circumstances, be formulated

abstract differential

equations on an infinite-dimensional space. conditions, which in this case are segover a time interval of appropriate length. This description leads to an pperator-theoretic framework, well suited for the investigation of the qualitativeIbehavior of these systems. For, a treatment of DDEs based on functional analytic methods we refer to the books Hale and Verduyn Lunel [49] and Diekmann et al. [22] for functional-differential equations and to the introductory book Curtain and Zwart [20] on general infinite-dimensional linear systems in control theory. On the other hand, DDEs deal with one-variable functions and can be treated to a certain extent with "analysis on W' and transform techniques. For an investigation, of DDEs in this spirit we refer to the books Bellman and Cooke [3], Driver [23], El'sgol'ts and Norkin [28], and Kolmanovskii and as

The space consists ments of functions

basically of

all initial

H. Gluesing-Luerssen: LNM 1770, pp. 1 - 5, 2002 © Springer-Verlag Berlin Heidelberg 2002

1 Introduction

2

[65]

t4erein. All the monographs mentioned so far aim analyzing qualitative behavior of their respective equations, most of the time with an emphasis on stability theory. Nosov

and the references the

at

Our interest in DDEs is of

systems

different nature. Our

a

goal

is

an

investigation of

governed by DDEs with respect to their general control-theoretic prop-

adopt an approach which goes back to Willems (see nowadays called the behavioral approach to systems theory. In this framework, the key notion for specifying a system is the space -of all possible trajectories of that system. This space, the behavior, can be regarded as the most intrinsic part of the dynamical system. In case the dynamics can be described by a set of equations, it is simply the corresponding solution space. Behavioral theory now introduces all fundamental system properties and constructions in terms of the behavior, that means at the level of the trajectories of the system and independent of a chosen representation. In order to develop a mathematical theory, one must be able to deduce these properties from the equations governing the system, maybe even find characterizations in terms of the equations. For systems governed by linear time-invariant ordinary erties. To this

for instance

end,

we

will

[118, 119])

and is

differential equations this has been worked out in great detail and has led to a successful theory, see, e. g., the book Polderman and Willems [87]. Similarly for multidimensional systems, described by partial differential or discrete-time difference equations, much progress has been made in this direction, see for instance Oberst

troller, the

[84], Wood'et

al.

[123],

and Wood

[122].

The notion of

a con-

important tool of control theory, can also be incorporated in this framework. A controller forms a system itself, thus a family of trajectories, most

and the interconnection of

a

to-be-controlled system with

a

controller

simply

leads to the intersection of the two respective behaviors. The aim of this

monograph is to develop, and then to apply, a theory which dynamical systems described by DDEs can be successfully studied from the behavioral point of view. In order to pursue this goal, it is unavoidable to understand the relationship between behaviors and their -describing equations in full detail. For instance, we will need to know the (algebraic) relation between two sets of equations which share the 'same solution space. Restricting shows that

to

a

reasonable class of systems, this

can

indeed be achieved and leads to

an

al-

gebraic setting, well suited for further investigations. To.be precise, the class of systems we are going to study consists of (implicit) linear DDEs with constant coefficients and commensurate point delays. The solutions being considered are in the space of C'-functions. Formulating all this in algebraic terms, one obtains a setting where a polynomial ring in two operators acts on a module of functions. However, it turns out that in order to answer the problem raised above, this setting will not suffice, but rather has to be enlarged. More specifically, certain distributed delay operators (in other words, integro-differential equations) have to be incorporated in our framework. These distributed delays have a very specific feature; just like point-delay-differential operators they are determined by finitely many data, in fact they correspond to certain rational

1 Introduction

functions in two variables. In order to get an idea of this larger algebraic setting, only a few basic analytic properties of scalar DDEs are needed. Yet, some

algebraic investigations are necessary to see that this provides indeed the appropriate framework. In fact, it subsequently allows one to draw far-reaching consequences, even for systems of DDEs, so that finally the behavioral approach careful

can be initiated.

of

As

algebra which in

a

consequence, the

our,

opinion

is

monographcontains

fairly interesting by

a

considerable part

itself.

delay-differential systems have already been studalgebraic point of view in the seventies, see, e. g., Kamen [61], Morse [79], and Sontag [105]. These papers have initiated the theory of Systems over rings, which developed towards an investigation of dynamical systems where the trajectories evolve in the ring itself. Although this point of view leads away from the actual system, it has been (and still is) fruitful whenever system properties concerning solely the ring of operators are investigated. Furthermore it has led to interesting and difficult purely ring-theoretic problems. Even though our approach is ring-theoretic as well, it is not in the spirit of systems over rings, for simply the trajectories live in a function space., Yet, there exist a few connections between the theory of systems over rings. and our approach; we will therefore present some more detailed aspects of systems over

We want to remark that ied from

an

rings later

in the book.

give a brief overview of the organization of the book. Chapintroducing the class of DDEs under consideration along with the algebraic setting mentioned above. A very specific and simple relation between linear ordinary differential equations and DDEs'suggests to study a ring of operators consisting of point-delay-differential operators as well as certain distributed delays; it will be denoted by H. In Chapter 3 we disregard the interpretation as delay-differential operators and investigate the ring 'H from a purely algebraic point of view. The main result of this chapter will be that the ring'H forms a so-called elementary divisor domain. Roughly speaking, this says that matrices with entries in that ring behave under unimodular transformaWe

now

proceed

to

ter 2 starts with

tions like matrices are

determined

over

Euclidean domains. The fact that all operators in H many data raises the question whether these data

by finitely

is to say, a desired operator) can be determined exactly. We will address problem by discussing symbolic computability of the relevant constructions in that ring. Furthermore, we will present a description of H as a convolution algebra consisting of distributions with compact support. In Chapter 4 we finally turn to systems of DDEs. We'Start with deriving a Galois-correspondence between behaviors on the one side and the modules of annihilating operators on the other. Among other things, this comprises an algebraic characterization of systems of DDEs sharing the same solution space. The correspondence emerges from a combination of the algebraic structure of 'H with the basic analytic properties of scalar DDEs derived in Chapter 2; no further analytic study of

(that

this

1 Introduction

systems of DDEs is needed.* The Galois-correspondence constitutes

an

efficient

machinery for addressing the system-theoretic problems studied in the subsequent sections. Therein, some of the basic concepts of systems theory, defined purely in terms of trajectories, will be characterized by algebraic properties of the associated equations. We will mainly be concerned with the notions of controllability, input/output partitions (including causality) and the investigation of interconnection of systems. The latter touches upon the central concept of theory, feedback control. The algebraic characterizations generalize the

control

well-known results for systems described by linear time-invariant ordinary differential equations. A new version of the finite-spectrum assignment problem,-

well-studied in the

analytic framework of time-delay systems, will be given in the Chapter 5 we study a problem which is known as state-space realization in case of systems of ordinary differential equations. If we cast this concept in the behavioral context for DDEs, the problem amounts to finding system descriptions, which, upon introducing auxiliary variables, form explicit DDEs of first -order (with respect to differentiation) and of retarded type. Hence, among other things, we aim at transforming implicit system descriptions into explicit ones. Explicit first order DDEs of retarded type form the simplest kind of systems within our framework. -Of the various classes of DDEs investigated in the literature, they are the best studied and, with respect to applications, the most important ones. The construction of such a description (if it, exists) takes place in a completely polynomial setting, in other words, no distributed delays arise. Therefore, the methods of this chapter are different from what has been used previously. As a consequence and by-product, the construction even works for a much broader class of systems including for instance certain partial differential equations. A complete characterization, however, of systems allowing such an explicit first order description, will be derived only for algebraic setting.

In the final

DDEs.

A

more

detailed

description of the

contents of each

chapter

is

given

in its

re-

spective introduction. We close the introduction with

some remarks on applications of DDEs. One applications occurred in population dynamics, beginning with the predator-prey models of Volterra in the 1920s. Since population models are in general nonlinear, we will not discuss this area and refer to the books Kuang [66],

of the first

MacDonald

[70],

and Diekmann et al.

of Volterra remained

basically

[22]

and the references

therein.

The work

unnoticed for almost two decades and

only in early forties DDEs got much at "tention when Minorsky [77] began to study ship stabilization and automatic steering. He pointed out that for these systems the existing delays in the feedback mechanism can by no means be neglected. Because of the great interest in control theory during that time and the

At this point the reader familiar with the paper [84] of Oberst will notice the similarity of systems of DDEs to multidimensional systems. We will point

structural

out the similarities and differences between these two

several occasions later

on.

types of systems classes

on

1 Introduction

the decades to follow the work of

rapid development

of the

of

Minorsky led to DDEs; for more

other

applications

and

a

theory period see for instance the preface of Kolmanovskii and Nosov [65] and the list of applications in Driver [23, pp. 239]. It was Myschkis [81] who first introduced a class of functional-differential equations and laid 'the foundations of a general theory of these systems. Monographs and textbooks that appeared ever since include Bellman and Cooke [3], El'sgol'ts and Norkin [281, Hale [481, Driver [23], Kolmanovskii and Nosov [65], Hale and Verduyn Lunel [49], and Diekmann et al. [22]. A nice and brief overview of applications of DDEs in engineering can be found in the book Kolmanovskii and Nosov

details about that

[65],

from which

we

extract

following mixing processes are engineering, with of because natural a delay, examples time-lag arises due systems to the time the process needs to complete its job; see also Ray [89, Sec. 4.5] for an explicit example given in transfer function form. Furthermore, any kind of system where substances, information, or energy (wave propagation in deep space communication) is being transmitted to certain distances, experiences a time-lag due to transportation time. An additional time-lag might arise due to the

list. In chemical

reactors and

standard

(ship stabilization) or for (biological models). A model

the time needed for certain measurements to be taken the system to of a turbojet

sense

information and react

on

it

engine, given by a linear system of five first-order delay equainputs and five to-be-controlled variables can be found in [65, Sec. 1.5]. Moreover a system of fifth-order DDEs of neutral type arises as a linear model of a grinding process in [65, Sec. 1.7]. Finally we would like to mention a linearized model of the Mach number control in a wind tunnel presented in Manitius [75]. The system consists of three explicit equations of first order with a time-delay occurring only in one of the state variables but not in the input

tions with three

channel. In that paper the problem of feedback control for the regulation of the Mach number is studied and various different feedback controllers are derived

by transfer function methods. This problem can be regarded as a special case of the finite-spectrum assignment problem and can therefore also be solved within our algebraic approach developed in Section 4.5. Our procedure leads to one of the feedback controllers (in fact, the simplest and most practical one) derived in

[75].

Algebraic Framework for Delay-Differential Equations 2 The

specific class of delay-differential equations we are some basic, yet important, properties. In this way we hope to make clear that, and how, the algebraic approach we are heading for depends only on a few elementary analytic properties of the equations under consideration. The fact that we can indeed proceed by mainly algebraic arguments results from the structure of the equations under consideration together with'the type of problems we are interested in. To be precise, we will restrict to linear delay-differential equations with constant coefficients and commensurate point-delays on the space C' (R, C). We are not aiming at solving these equaIn this

chapter

we

introduce the

interested in and derive

tions and

expressing the solutions

(appropriate)

-in terms of

initial data. For

purposes it will suffice to know that the solution space of a DDE (without initial conditions), L e. the kernel of the associated delay-differential operator,.

our

is "sufficiently rich". In essence, we need some knowledge about the exponential polynomials in the solution space; hence about the zeros of a suitably defined characteristic function in the complex plane.

Yet,

in order to pursue

by algebraic

means, the

appropriate setting has to be driving goal to handle also systems of DDEs, in other words, matrix equations. In this chapter we will develop the algebraic context for these considerations. Precisely, a ring of delay-differential operators acting on C1 (R, C) will be defined, comprising not only the pointdelay differential operators induced by the above-mentioned equations but also certain distributed delays which arise from a simple comparison of ordinary differential equations and DDEs. It is by no means clear that the so-defined operator ring will be suitable for studying systems of DDEs. That this is indeed the case will turn out only after a thorough algebraic study in Chapter 3. In the present chapter we confine ourselves with introducing that ring and providing some standard results about DDEs necessary for later exposition. In particular, we will show that the delay-differential operators under consideration are surjections on C1 (R, C). found first. The

force in this direction is

our

As the starting point of our investigation, let us consider a homogeneous, linear DDE with constant coefficients and commensurate point delays, that is an

equation of the type

H. Gluesing-Luerssen: LNM 1770, pp. 7 - 21, 2002 © Springer-Verlag Berlin Heidelberg 2002

2 The

Algebraic

Framework N

M

EEpijf( )(t-jh)=O, i=0

tER,

j=0

N,

where

M E No, pij c involved. Hence all

R, and h > 0 is the smallest length of the point delays are integer multiples of the constant h, thus commensurate. For our purposes it suffices to assume the smallest delay to be of unit length, which can easily be achieved by rescaling the time axis. Therefore, from now on we will only be concerned with the case h I and the equation

delays

=

above reads

as

M

N

EEpjjf(')(t-j)=0, i=0

(2.1)

tGR.

j=0

It will be

important for our setting that the equation- is considered on the full time axis R. Moreover, we are not imposing any kind of initial conditions but rather focus on the solution space in C C' (R, C), hence on B The choice C

C' A

:=

ff

E

L

1 (2. 1)

is

satisfiedl.

C)

is algebraically very convenient, for 'C is invariant shift, hence a module over the corresponding ring of delay-differential operators. In a certain way, however, larger classes of functions can be incorporated in the algebraic approach; this will be discussed occasionally throughout the book. =

under differentiation and

Observe that'equations of the type (2.1) cover in particular linear time-invariant ordinary differential equations (ODEs, for short) as well as pure delay equations

(N

=

0). briefly

Equation (2.1). Disregarding intuitively clear what the minimum amount 'of initial data should be in order for (2.1) to single out a unique solution (if any). It is natural to require that f satisfy f (t) fo(t) for t E [0, M], where fo is some prespecified function on the interval [0, M] and M is the largest delay appearing in (2.1). Then finding a solution on the full time axis R amounts to solving the initial value problem in both forward and backward direction. This, is, of course, not always possible. It also fails if one starts with an arbitrary smooth initial condition, i. e. fo C- C' Q0, M], C), and seeks solutions in L. But, if fo is chosen correctly (that is, with correct data at the endpoints of the interval [0, M]), a unique forward and backward &-solution exists; this will be shown in Proposition 2.14. The solvability of this restricted initial value problem for the quite general equation (2.1) rests on Let

us

think about initial conditions for

the requirement that solutions be

smooth,

it should be

=

the fact that

we

differentiability

consider

C'-functions, so that we have a sufficient amount of fo, necessary for solving the equation on

of the initial condition

the whole of R. Remark 2.1 It is crucial for

restrict to DDEs with

mensurate

of noncommensurate

essentially all parts of our work to delays. As it turns out, the occurrence

com-

delays

2 The

Algebraic

Framework

(like e. g. delays of length 1 and V2_ or -7r) leads to serious obstacles preventing an algebraic approach similar to the one to be presented here; see [47, 109, 111, 26]. At this point we only want to remark that in the general case the according operator ring lacks the advantageous algebraic properties which will be derived for our case in the next chapter. These differences will be pointed out in some more detail in later chapters (see 3.1-8, 4.1.15, 4.3.13). Remark 2.2 In the

theory of

DDEs

distinguishes equations of retarded, neutral,

one

and

advanced type. These notions describe whether or not the highest derivative in, say (2.1), occurs with a delayed argument. Precisely, Equation (2.1) is called

retarded if PNO : PNO 0 0 and PNj

4].

0 and PNj 0 for some

0 for

=

0

j

>

j M; it is said to be neutral if 1, 0, and advanced in all other cases, see [28, =

.

.

.

,

This classification is relevant when

solving initial value problems in forward Roughly speaking, it reflects how much differentiability of the initial condition on [0, M] is required for (2.1) being solvable in forward direction; see for instance the results [3, Thms. 6.1, 6.2, and the transformation on p. 192]. Since we are dealing with infinitely differentiable functions and, additionally, requite forward and backward solvability, these notions are not really relevant p.

direction.

for

Let

our

us

purposes.

now

Introducing

rewrite

af (t) and the

p(D, u)f

Equation (2.1)

the forward shift :=

f (t

-

a

1),

in terms of the

of unit

where

f

is a1unction defined

ordinary differential operator 0, where N

p(D, a)

is

a polynomial simply

in the two

D

=

R,

d, Equation (2.1)

dt

reads

as

M

(2.2)

j=0

commuting operators D and B

on

1: 1: pij D'ai i=0

is

corresponding operators.

length

ker p(D,

a)

C

a.

The solution space

(2.3)

L.

For notational reasons, which will become clear in a moment, it will be convenient to have an abstract polynomial ring R[s, z] with algebraically independent

disposal. (The names chosen for the indeterminates Laplace transform s of the differential operator D and the z-transform of the shift-operator in discrete7time systems.) Since the shift U is a bijection on L, it will be advantageous to introduce even the (partially) Laurent polynomial ring elements

s

and

z

at

our

should remind of the

R[s, z Z-1

N i=O

pijSY j=m

Tn, ME Z N E=

No, pij

E R

10

Algebraic

2 The

Associating

Framework

polynomial the delay-differential operator (including possibly backward shifts) we obtain the ring embedding with each Laurent

R[s, z, z-1] (of

)

EndC (,C),

p

p(D, o,)

)

i

(2.4)

polynomial, then the operator p(D, 0') is not the words, the operators D and a are algebraically operator C). R the in over independent ring Endc(,C). Put yet another way, C is a faithful module over the commutative operator ring R[D, a, o-1]. course, if p is

zero

Let

a nonzero

In other

on

exponential functions eA*

look for

us now

like for ODEs

in the solution space

(2.3).

Just

has for A E C

one

(NE

p(D, o,) (e A.)

i=O

M

N

M

E.Y pjjA e- \j )

E pjjDY) (e\')

e

A-

(2-5)

,

j=M

i=O

j=M

p(A, e--\)e"' Hence the

function the

exponential

p(s, e-')

function e,\* is

solution if and

a

only if

A is

a zero

of the

which therefore will be called the characteristic function of

0. Obviously, it delay-differential equation p(D, o,)f as exponential polynomial (or quasi polynomial).

is

=

an

entire

function,

known

Before some

providing

details

some more

on

exponential polynomials,

we

want to fix

notation.

Definition 2.3

(1)

tions

(2)

by H(C) (resp. M(C)) the ring on the full complex plane.

Denote

For

a

zeros

subset S C

H(C)

(3)

define the variety

fl,

S

case

.

.

.

,

JA

:=

fj I

is

C

finite,

=

P

I f (A)

In

case

q*

is

f

E

the set of all

as

we

call the set zeros

func-

common

E

S}.

m

meromorphic

p(s, e_S) 0(s)

V(q*)

'

.

.

E

.

,

fl)

for V (S).

R[s, z, z-']

function given

for

S

G

and

by

C\V(O).

the characteristic variety and its

of q.

and A E C let

ord.\ (f denote the

f

write V (fl,

j=

O(S) entire,

0 for all

EN 0 EM pijs'.zj j= the

M

H(C)

for A e C.

=

EN 0 EM Pijsie-i' j=

elements the characteristic For

V(S)

simply

we

=

0

q*(s)

(4)

E

R(s) [z, z-'], where p R[s]\f 01, denote by q* G M(C)

For q

(resp. meromorphic)

of S, thus

V(S) In

of entire

multiplicity of A '

,

minf k

E

as a zero

No I of

f.

f(k) (A) If

f

=-

76 0} 0,

we

put

ord.\(f)

=

oo

2 The

Algebraic

Framework

11

(1) of the next proposition is standard in the theory of DDEs. Just like ODEs, the multiplicities of the characteristic zeros correspond to exponen tial monomials in the solution space. As a simple consequence we include the fact that delay-differential operators are surjective on the space of exponential polynomials. Part

for

Proposition

(1)

2.4

R[s, z, z-'] \10}.

Let p e

No and A E C denote by ek,A tke,\'. Then

For k

ek,A (t)

,=o

In

particular, ek,X

C ker

H(C) is called operator p(D, o). (2) The operator p(D, o)

p(D, o)

is

ponential polynomials cisely, let a := ord,\(p*) !

monomial

only if ord,\ (p*)

k. The function

>

delay-differential

surjective endomorphism. on the space of exspan(C f ek,A I k E No, A E C}. More pre-

a

B

ao,

if and

the characteristic function of the

E

al+a E

exponential

(k)' (p*)()(A)ek-K,A.

p(D, u)ek,,%

p*

L the

E

:=

0.

Then, for

:

0 such that

C with al+a

all el,,\ E B there exist constants

( 1=0 +a

p(D, a)

E a,, e,,,,\

(2.6)

el,,\.

r.

PROOF:

(1)

Let p

verified in the

=

I:i,j pijs'zi

following

E

R[s, z, z-1].

The asserted

identity

is

easily

way:

di

(p(D, u)ek,.\) (t)

[(t

Pij Tti

_

E P'j Tti dAk (eA(t-j)

10

dk dAk

dk

di

j)k e)(t-j)]

1,3

(E pjjA'e'X(t-j) )

dk dAk

(p*(A)e\t)

1,3

k

E

=

K=o

The rest of

(1)

(k) (p*)( ')(A)ek-r.,A(t)K

is clear.

(2) It suffices to establish (2.6). We proceed by (p*) (a) (A). Then c : - 0 by assumption. For I =' 0 it follows from (1) that p(D, o) (c- 1 ea,,\)

induction

=

For 1 > 0 put al+a

1+a)-

1

a

c

-1

1+a

p(D, o,)(al+ael+a,,\)

=

al+a

E r.=a

.

Then, by

(1 a)

virtue of

eo,,\,

as

1. Put

c

desired.

(1),

+

K

on

el+a-r.,,X

=

el,,x +

1:'bjej,,\ j=o

2 The

12

Algebraic

Framework

some constants bj G C. By induction the functions bjej,,\ have preimages involving solely exponential monomials ei,.\ with i < 1 + a 1. Combining them El suitably with the equation above yields the desired result.

for

-

foregoing

play exactly the correspond to the exponentialmonomials in the solution space. The main difference to OI?Es is that the characteristic function has infinitely many zeros in the complex plane unless it degenerates to a polynomial. Since this property will be of central importance for the algebraic setting (in fact, this will be the only information about the solution spaces of DDEs we are going to need), we include a short proof showing The

same

role

how it in

as

ODEs,

in the

sense

that their

zeros

be deduced from Hadamard's Factorization Theorem. The estimate

can

(1)

part

considerations show that characteristic functions

for

below will be useful in

later section to embed

a

R[s, z, z-']

in

a

Paley-Wiener algebra. Proposition Let p E

(1)

2.5

R[s, z, z-1] .'Then

there exist constants

C,

jp*(S)1: (2)

0 and N G

C(I

+

No such that

ISI)N ealResi

for all

C

S

(C'

variety satisfies

the characteristic

#V(P*)

>

a

< 00 4==> P

=

Zko

for

some

k E Z and

0

E

R[s]\f01.

In the classical paper [88] much more details about the location of the zeros of p* can be found, see also [3, Ch. 13]. As we are not dealing with stability

issues, the above information (2) suffices for

(1) Letting p

PROOF:

=

:5

where C > 0 is

a

Theorem,

defined

Pij

estimate

we can


m. The function q* consequence, said to be the characteristic function of the operator 4.

As is

sum

2

:=

=

Proposition 2.4(l),

we

=

obtain

qf

=

and the desired result follows since

Pg

Em+kg,'Ev=0(v) (P _.,v=O

(p*) W(A)

=

K

0 for

n

K

< 1 +

k.

(A)e,-,.,,\ EJ

Remark 2.13

Notice that

we did not consider any expansions of solutions as infinite series exponential polynomials. Such expansions do exist, see [102] and [3, Ch. 6], the latter for solutions of retarded equations on R+. We will not utilize these facts since the only case, where the full information about the solution space is needed, is that of ODEs, see also (2.10). For the general case it will be sufficient for us to know which exponential monomials are contained in the solution space. Series expansions of the type above are important when dealing with stability of DDEs. We will briefly discuss the issue of stability in Section 4.5, where we will simply quote the relevant results from the literature.

of

We conclude

our

considerations

scalar DDEs with the

on

surjectivity of delay-

differential operators on L. This fact is well-known and can be found in [25, p. 697], where it is stated in a much more general context and proven with rather elaborate methods.

However,

we

shows what kind of initial conditions also

the opportunity to present the method for-solving initial value problems of DDEs.

gives

cedure

us

Earlier in this

be

would like to prove a version which also be imposed for the DDE (2.1). This

can

specified

chapter

we

in order for

briefly

(2.1)

of steps, the

standard pro-

addressed what kind of initial data should

single out a unique solution f. Apart from suggested that f has to be specified on an interval delay occurring in (2.1). For instance, a solution of the pure delay equation of 0 is determined completely by the restriction f fo := f 1 [0, 1). But in order that f be smooth, it is certainly necessary that the initial condition'fo can be extended to a smooth function on [0, 1] having equal (v) derivatives f0(v) (0) f0 (1) of all orders v E No at the endpoints of the interval. In other words, fo and all its derivatives have to satisfy the delay equation for 1. This idea generalizes to arbitrary DDEs and leads to the restriction given t smoothness requirements, of length M, the largest

we

-

=

=

to

=

2 The

18

Algebraic

(2.11) below,

in

Framework says that the initial condition has to be

simply

which

ble with the given

our

advanced equations of we could not find a reference for the result cedure is standard and

one

0 : 0, and

(1)

b]

:=

fo

E

Ej' o pj zi,

=

C' [0,

M]

there exists

f I [0,M]

PROOF:

--

(=- ker

unique function f

a

As

fo.

=

a

4 9 L satisfies f I [k,k+M]

g() (M)

for all

b.

[s],

0

po

:

pm,

=_

0 for

some

k c

(2.11)

No

p(D, o,)f on

g

and

L.

R, then f

-=

0.

b]

defined

M

0

-

g(') (t)

for all

v

E

No

(2.12)

j=o

for all t G

[a

+

M, b]

+

1]

which satisfies

be extended in

can

(2.12)

a

[a

on

-

unique way to a solution 1 + M, b + 1]. (Notice that the

initial condition given in the proposition is included To this

E

v

L such that

E

Epj (D)oif (v)) (t)

(t)

C' [a

a=O and

a or

-

0

1, b

G R

To prove the existence of f , we show: every fo C C' [a, length b a > M which satisfies the condition

(1)

(p(D, u)f(v)) -

0

pj,

consequence, the map 4 is surjective

interval of

on an

endpoints

satisfying

0

f

as

Furthermore,

(p(D, a)f ()) (M) (2)

b], (C)

Thms. 3.1 and 5.21. In f M for f C- C' [a, b]

[3,

well

as

0 1 where p let g E L

7io

G

M > 1.

For every

If

in the book

C' ([a,

2.14

po'

=

presented

course, to one-sided derivatives when taken at the

Proposition Let q

stated below. However, the prosimilarity of the proof given below

as

should notice the

for part (1) with, e. g., those the sequel the notation C' [a,

refers, of

compati-

approach comprises retarded, neutral, and also arbitrary order, and, additionally, requires smoothness,

DDE. As

as an

extreme

case

where

b=:M.) write po (s)

end,

=0

ai sz + Sr and

consider the

inhomogeneous ODE

M

po(D)f (t)

g(t)

=

1: pj (D) ci fo) (t)

-

(2.13)

j=1

for t G

[b, b + 1]

with initial condition

j(v) (b) (If a

r

=

0, then

po

unique solution

1 and c

=

no

(v)

f0

(b)

for

(b)

=

g (b)

=

0,..., r

-

(2.14)

1.

imposed). In any (2.13), (2.14) and j satisfies

initial condition is

C' [b, b +

1]

to

M

M

v

pj (D) ai

-

j=1

fo)

case, there is

r-1

(b)

-

E ai j(') (b) i=O

=

()

f0

(b).

Algebraic

2 The

Framework

19

(") Differentiating (2.13) and using (2.12) shows successively j(') (b) f0 (b) for all v E No. Therefore, the function f, defined by f, (t) for t E [a, b] and fo (t) f, (t) =j (t) for t E (b, b+1] is in C'[a, b+1] and, by construction, satisfies (2.12) on [a + M, b + 1]. In the same manner one can extend f, to a smooth solution =

=

on

[a

-

1, b

+

1];

one

takes the

unique solution of the ODE

M-1

pm(D)f (t)

=

g(t)

E pj(D)fj(t

-

-

j)

[a +

on

M

-

1,

a

+

M]

j=0

initial'data

with

(a

+

M)

fl(') (a)

=

for

v

=

0,

.

.

.

,

deg pm

-

1 and

puts

f2(t):=f(t+M)f6ra-1 0, that are linearly independent over Q. As shown in [47, Thms. 5.4 and 5.9], the algebraic approach leads to thepperator algebra

In

the

approach of

our

with noncommensurate

.

.

.

,

=

-

'H(j):=

f

G

f*(s)

above is due to Exa.

5.13]

ker

R[s,zj,...,zj],

p E

:=

[4].

q E

f (s, e-rls,...' e-"') Note that 'H

'H(j)

reveals that

is not

zout

identity by

kerfil

R[s]\fOl,v

and z'

a

algebraic approach

an

upon these issues in

common

divisor

equations with Chapter 4.

to

along with

a

Be-

matrices

H,

over

see

Exam-

3.2.3.

Example 3.1.9 For computational issues, which

will be addressed in Section

track of the coefficients of the indeterminates

starting with coefficients in Q. S2 Z + S2 1, q (1) Let p =

s

-

=

f (s) simple

=

s

requirement b* -

a* (s) (e-s +

choice

a

-1 G

=

8

over

ap

a

of the indeterminates Let p

=

Bezout

z, q

=

s

equation

+ 1 E I

the sole condition

=

-

a

Bezout

identity

Q(e).

as

_

are z

0

can

1) in

+

S

+

(Z

S2

+

_

82

0. The

identity

1) 82

Q (s) [z] n HO, that is, all the coefficients equation above are in Q.

in the

Q[s, z].

ap +

=

function

=

_

and

a*(-l)

p* (s) 'HO

over

a(z + 82

Ro suffices and leads to the Bezout

1

is

since

bq

H(C) forces a G Ro to be such that the 1) has a zero of multiplicity 2 at s

and b

s

s

+

S2

_(Z +'32

=

HO. Notice that

E

S2.

keep below,

will

we

as

q

Now the

3.6,

in the calculations

z

=

(p*)'(0) : 0. In this case, easily be found by rewriting it b

and

s

Q [s, z]. Then gcd,O (p, q) a Bezout identity s ap

G

=

-

and

(2)

H(C)

of the theorem above will be

addressed in the next section when

ple

E

q*

z1".....zl". The last identity

:=

simple examples. Part (b) considering

some

P*

=

greatest

a

N', 0

E

if -ri 1. A fairly simple example [47, Bezout domain whenever 1 > 1. As a

delays. We will touch

We illustrate the determination of

C

7i(j)

=

consequence, serious obstacles arise for

noncommensurate

4

H(C)

R(s, zl,..., zj) f

zvq wherenow

R[s, zj,..., zi],

p, q G

q

33

Divisibility Properties

3.1

Then p and q

bq one needs e-1. Hence

b*

are

coprime

1-a*e-'

8+1

E:

in

HO and for

H(C), leading

a

to

=

=

e-1Z

1

+

-

e-1z

S+1

desired. In this

case

(8+1) the coefficients of

8

and

z are

It is easy to see that no Bezout equation with coefficients in the field A of algebraic numbers exists. in the field

3 The

34

(3)

Let p

Algebraic Structure q

+1

e

e

identity

s

=

we

+

first let

0 have

=

a

Ro. The elements

G

z

e-\

and A +

of 'HO

=

are

coprime since the equations in C. To obtain

no common zeros

I and b

=

-

(s

bp

aq +

+

1)

s

+

=

a

Bezout

and get

(3.1.5)

e.

This is indeed the first step in the procedure given in the proof of Theo3.1.6(a) and corresponds to the elementary transformation

rem

I-(s The next step of the

R[s]

1)

+

I'll

q

.1

=

procedure would be

[0 where 6 E

01

1

-,

satisfies

+

8

-

e

transformation of the type

(s"+

e

-

8+1

(8+1)

e

1

a

(

' _e

S+1

J(-I)

e,

6(-e)

ee. Instead of

way, which would

require another step thereafter, Equation (3.1.5) implies

[q*(-e),p*(-e)] thus, by coprimeness of p and

with the

given

E

c

1-e

=

e-- e,

p, q, a, and b

As

E R.

a

a s

+ cp +

e

q+

s

+

e

-

-

going this as

follows.

0,

has

r-p*(-e) q*(-e)

'+P s+e

to the Bezout

C

and

identity

--q ',_4 +e

are

in

'HO and

P

e)z + (e ee)s (e ee)(s + 1)(s

-

one

consequence,

altering Equation (3.1.5) leads

proceed

P*(-e) [ q*(-e)

imR

a*(-e)) (b*(-e)) where

e

we

q, it follows

(a*(-e) b*(-e)) Indeed,

(' (b*(-e)) a

e

e

-

+

+

e)

e

2

q +

(e

-

1)z + (,e 1)s + (e ee-) (s + e) -

e -

e

-

Q(e, ee). (3) should demonstrate how (successive)

with coefficients in

examples (2) Bezout identities force to extend step by step the field of coefficients, in this case from Q through Q(e) to Q(e, ee). It seems unknown whether the transcendence degree of (Q(e, ee) is two, which is what one would expect. This is a very specific case of a more The

and

3.2 Matrices

general conjecture of Schanuel

WO

35

theory, which we will conjecture (just to

However, very little is known about this

in 3.6.5.

present

in transcendental number

over

2

give an example, it is only known that at least one of the numbers el or ee is transcendental, gee. [1, p. 119]). Handling of the successive field extensions forms an important (and troublesome) issue in symbolic computations of Bezout identities in 7to. We will turn to these questions in Section 3.6. The results stated

far show

so

algebraic

to their

respect

resemblance of 'H and

striking

a

being presented next, another postponed until Section 3.4.

is

one

H(C)

with

differences, one of them the dimension of the rings and has to be

structure. But there

also

are

Remark 3.1.10 For

commutative domain R with

a

of R if for all

bl,..., bn

exist

[30,

n

345].

p.

unity

says that 1 is in the stable range

one

satisfying R (a,, an+ 1) there blan+l,..., an + bnan+1), see e.g., equivalent to the property that for all

and a,, E R , an+ 1 c R such that R (a, + > 1

.

.

.

=

It is easy to

al,..., an+1 E R

see

satisfying

that this is

R

=

(a,,..., an+1)

=

En+1

.

there exist C2,

.

.

,

Cn+1 c R such

is a unit in R. While this is true for the ring H(C), see [82, that a, + i=2 ciai p. 138], this is not the case for the rings H and 'Ho, as the following example 1 and a2 shows. Let a, z in 'H and a Bezout equation 1 =

=

-

-

Cl

Considering coefficlents In

[82,

p.

(s

=

-

1) (s

-

2)

cial + C2a2

C2a2

E X Then a, and a2 are coprime implies for the coefficients -

a,

the roots of the denominators it

cl

139]

clal

-

a2 can

be

seen

that neither of the

and C2 can be a unit in X it has been proven that for every Bezout domain with 1 in the sta-

ble range unimodular matrices are finite products of elementary matrices. This result applies in particular to the ring H(C) and we arrive at Theorem 3.1.6(b) for IC

=

H(C).

3.2 Matrices

In this section

we

WO

over

turn

our

attention to matrices

over

'Ho. First of all, it is

an

easy consequence of the Bezout property that one can always achieve'left equivalent triangular forms. Rom Theorem 3.1.6(b) we know that this can even be

by elementary

transformations. But

even more can be accomplished. adequate commutative Bezout domain allows diagonal reductions via left and right equivalence for its matrices. In other words, matrices admit a Smith-form, just like matrices with entries in a Euclidean do-

done It is

a

row

classical result that

an

main. This will be dealt with in the first theorem below and

some

consequences

Algebraic Structure

3 'The

36

of Ho

pointed out. Thereafter we present a generalization of the concepts of greatest common divisors and least common multiples for matrices. As our arguments work over arbitrary commutative Bezout domains, the results will be given in that generality. The end of this section is devoted to a summary of the matrix-theoretic results in terms of general module theory. will be

Let

us

start with

triangular and diagonal forms.

Theorem 3.2.1 Let IC be any of the

(a) everymatrix that

(b)

IC is

is, there

rings H

'Ho. Then

or

P (=- 1C'11 is left

equivalent

Gl,,(IC)

exists U G

to

an

upper

such that UP is upper

triangular matrix, triangular,

elementary divisor domain, that is, by definition, every matrix equivalent to a diagonal matrix where each diagonal element divides the next one. Precisely, there exist V E Gl,,(IC) and W c Glm(IC) an

P Cz IC"I is

such that

VPW

diag,,

=

,m

(di,

.

.

_dl where

rk P and

di

elements

are

the

name

PROOF: Part

follows from

(They

(3.2.1)

drj /C\f01 satisfying di 1,, di+1

G

are

is

also

=

1,.

..,

r

-

unique

1. The up to

elementary divisors in [51, 64], explaining this type of diagonal reduction.)

also called

consequence of Theorem

a

see

for i

the invariant factors of P and hence

of rings with

(a) [51],

...

rrxr

=

units of 1C.

0

d2

A

diagonal

r

0 01

d,)

-

L with

,

.

[64,

3.1.6(a).

The statement in

(b)

473],

where is has been proven that adequate divisor domains; recall Theorem 3.1.6(c) for the

p.

are elementary adequateness of Ho. The uniqueness'of the diagonal elements follows, just like for Euclidean domains, from the invariance of the elementary divisors under left and right equivalence, which in turn is a consequence of the Cauchy-Binet theorem (valid over every commutative domain), see e. g. [83, pp. 25] for principal ideal

Bezout domains

domains. It is worth

C1

mentioning

that it is still

tative Bezout domain is

and

[68].

an

an

elementary

open

conjecture whether every commudomain, see [17, p. 492, ex. 7]

divisor

Remark 3.2.2 It is worthwhile over an

noticing that left equivalent triangular forms can be obtained arbitrary commutative Bezout domain R. This can easily be seen as

3.2 Matrices

follows. Let P

(pij)

=

R"' and alpil+. ..+a,,p,,l

G

over

gcd,,.(pjj,

=

-

-

'Ho

37

-,Pni)

=:

d

equation for the first column of P. Then the coefficients a,, , an unimodular form hence in row which, a using again are coprime (a,,..., an), R, the Bezout property, can always be completed to a unimodular matrix A E Gln(R), see [12, pp. 81]. This way one can transform P via left equivalence

be

a

Bezout

to

a

matrix with first column

.

(d, 0,

.

.

,

.

OT.

The rest follows

.

.

by induction. Our implicitly, use of (see the proof of

proof of part (a) above is slightly simpler since we made, the division with remainder as given in Proposition 3.1.2(g)

3.1.6(a)).

Theorem

Example

(1)

3.2.3

Consider the matrix

[

P=

Since the entries of P is

are

82

+ 1

8Z

Z-1

8z

coprime

2

i

-

1

'Ho,

in

H2X2. 0

EE

elementary divisor form of

an

P

given by

[0 P] [1 Z2(83._ S) 1

0

0

=

det

0

Z('5

+

8 S

can

be derived

as

gcd,. (s 2, z

form. Notice that

in

+

=

z

1

-

82

82

Example 3.1.9(l).

S2 + 1

_

matrices, let

In order to obtain also the transformation

riving a triangular equation

1)

_

(Z

_

*

us

begin with

=

s.

de-

The Bezout

1).

_

Hence

1)

-

1

get the left unimodular

we

transformation

82

1

S.

S

z

-

Sz

1

Sz

s+z-1 (

+ 1

2

S - ks

1-1z ,

0

S

(SZ

Z

+

+

1)

+

(SZ2 S(SZ2

j

[0 b_ s a

To obtain

a

diagonal

form notice that

there exist x, y G 'Ho such that 1 and x (I 2/3a)s-1 E Ho yields =

-

[-by 1_ [s b]

=

0

(2)

y

=

[I O's] [01

a

a

0

coprime in 'Ho, hence simple choice y 2/3

a are

+ ya. The

now

-

1

and

s

xs

o -b

-8

P]

0

=

-

det

The matrix

[

M is in

G12(R[s, z])

ring R0, however,

but not in

1 + -Z

S2

8z

2

1

E2(R[s, z]),

-

see

SZ] [16]

[97,

or

p.

it factors into

M=

I Is

1-z

0

11 I [ 0110, S] Is -

0

1 -

1

1

-11

1

1

0

Z-1

i -

676].

Over the

3 The

38

Let

Algebraic

Structure of 'Ho

equivalence p* I ,(c ) q* 4* p 1, q for p, q E H, given (for Proposition 3.1.2(c). Using diagonal forms, this can easily be Ho) generalized to matrices. To this end, we extend the embedding H -+ H(C) to matrices in the obvious entrywise way, thus return to the

us now

the ring

in

,Hpxq

Clearly, (PQ)*

H(C)Pxq,

P*Q*

=

and

(P

+

P

Q)*

=

=

(Pij)

P* +

P*

:=

(P V-).

(3.2.2)

Q*, whenever defined.

Proposition 3.2.4 Let Pi E Hpixq, i 1, 2, be two matrices. There exists F G H(C)P2XP1 such that FP,* P2* if and only if there exists X E 'HP2XP1 such that XPI P2. If P, and P2 have entries in Ho and P, satisfies rk rk then the P1 (S PI, 0) R(s) matrix X can be chosen with entries in Ho, too. =

=

=

=

1

PROOF: The where is

d

A is

if-part is.obvious. As for the other direction, let UPIV as

(3.2. 1)

in

and

U, V

are

partitioned accordingly. Then P2*V*

Q 7,3 -, the

3

xij X

X

E

Defining

[X,O]U

=

entries of

10 01 ,thus Q' ,A* 0

F(U-')*

=

EHP2.XPl.

agonal elements dj applicable again.

G

Ho of A

are

case

of entries in

not divisible

by

z,

'Ho guarantees that the di-

making Proposition

following

conditions

(a) P has a right inverse over H, that is, (b) P* has a right inverse over H(C). (C) rkP*(A) =p for all A E C. (d) P is right equivalent to [Ip, 0]. (e)

P

(f)

The greatest

can

be

Furthermore,

completed

to

common

a

The that

PM

=

are

Ip

unimodular matrix

char-

equivalent:

for

[Qp]

some

E

matrix M E 'Hqxp.

Glq (H)

divisor of the full-size minors of P is

each matrix Q E 7jrxq of rank p

Ae7jr,pisofrankpandP and P

following

3.2.5

matrix P E 7jpXq the

a

3.1.2 (c) 1:1

Another standard consequence of the diagonal reduction is the acterization of right invertibility for matrices over H.

Corollary

0 and

Q*. Proposition 3.1.2(c) yields Qjj xijdj for some x 'HP2 the desired E left factor is given by r, (xij)

The additional rank condition in the

For

'A 0

[Q, Q']

unimodular. Assume P2 V =

1001

=

C

can

be factored

Hpxq is right invertible

,

over

as

a

Q

unit in 'H. =

AP where

H. The matrices A

right resp. left equivalence. corresponding equivalences are true when H is replaced by Ho provided are

one

unique

up to

adds the condition rk

R(s) P(S)

0)

=

p in the

parts

(b)

and

(c).

3.2 Matrices

noticing that the above equivalences (a)

It is worthwhile

accordingly,

lated

valid for matrices

are

over a

any

Ho

(e),

'*

polynomial ring

field), too, see also Theorem 5.1.12 in Chapter Theorem of Quillen/Suslin on projective modules. (K

(c)

4=

over

39

if formu-

K [xi,

.

.

.

,

XMI

5. This is the celebrated

(f)", recalling the [,A, 0] with A G "(c) =:>. (d)", in hence is unit detzA* and a HPIP. Then P* U* H(C), whence, by 0] Remark 3.1.5, A G Glp (H). Thus, P is right equivalent to [Ip, 0]. To establish 1 (d) => (e) ", let PU [Ip, 0] with U G Glq (H). Then Q [0, Iq-pj U- leads to the asserted unimodular matrix. "(e) => (f)" follows from the Laplace expansion PROOF:

"(a)

(b)

=>.

(c)"

=: ,

obvious, and

is

"(c)

is

so

units in 'H from Remark 3.1.5. As for

#

let PU

=

=

"

=

=

of det

IQ] along P

the block

row

Q

For the factorization of

given by P.

use a

The

implication "(e)

diagonal form Q

unimodular matrices U and V. Then A

=

=>

Udiag, X q (di,

Udiagxp(dl,..., dp)

=

(a)"

and P

is trivial.

dp) V with [1p, 01V =

yield the desired result. The uniqueness is straightforward. The additional condition for the ring Ho guarantees that z, which is not in

7-10,

is not

unit

a

divisor of the full-size minors of P.

a common

The second part of this section is devoted to a generalization of the concepts of greatest common divisors and least common multiples from functions in H to matrices

over

'H. We will formulate the results for matrices

commutative Bezout

greatest p.

right

common

nonsingular,

31-36].

domain,

as

this is

exactly

divisor of two matrices,

theory multiple comes

is standard in matrix

A least

left

common

literally given below in Theorem

over

to Bezout domains and

over

one

of them ideal

principal by-product.

as a

arbitrary proof The

over an

what is needed for the

being square and domains, see [71, The result carries

to non-square matrices in the way

even

fairly standard, too, but seems present proof since the precise description will be needed later in Chapter 4, where a Galois correspondence between finitely generated submodules of Hq and solution spaces in fq of systems of DDEs will 3.2.8. This version looks

to be less known. We would like to

a

be established. The

following

notation will be

helpful.

Definition, 3.2.6 Let n, q E N and

(a)

Let

Jn,q

:=

n

< q.

J(PI,

-

-

-,

Pn)

E

Nn

11

< p,
0.

R1 X q B

U,

U4

B

0

for

some

R7n X q be two

Let

Gli+,n(R), partitioned according

1U3 U2] [A [D

e

to

U,

G

R'xl,

D G R'Xq.

(3.2-3)

Then

(a)

D is

a greatest common right divisor of A and B of full row rank r and as such is unique up to left equivalence. We write D gcrd(A, B). Moreover, there exist M G Rrxl, N E Rrxn such that D MA + NB and therefore, =

=

im AT +

(b) Suppose

im.6r rk A

=

=

im.

1,

Lir.

rk B

If r < 1 + m, then M

multiple of A and

:=

=

m.

U3A

B of full

=

row

-U4B rank.

E

R(1+1 -r)xq

F irthermore,

is

a

least

im AT n im

common

BT

=.

left

im MT.

3.2 Matrices

Every

least

multiple of A

left

common

and B in

over

R(1+1-1) xq

Ho

41

of full

row

equivalent to M. We write M lclm(A, B). If r 1 + m, the only common left multiple of A and B is the zero matrix; in particular, im AT n im, BT 0. It will be convenient to define lclm(A, B) Xq as the empty matrix in R' (The image, resp. kernel, of an empty matrix rank is left

=

=

=

.

is the

If rk A

zero

1,

=

subspace,

rk B

=

resp. the full

m, then rk A + rk B

space.) rk gcrd(A, B) =

+ rk lclm(A,

B).

all, recall that a matrix over an arbitrary commutative Bezout equivalent to an upper triangular form, see Remark 3.2.2. This

PROOF: First of

domain is left

guarantees the existence of the matrices U and D. It is not in triangular form; solely the full row rank is important.

(a) Using (3.2.3)

and

letting

[ U]

V2]

ui Q3 U4

we

V1 V3 V4

get the equations UjA + U2B

assertions of

As for

(b),

(a)

consider the

one

gets rk

of U

U3

=

case r




quotient

space M :=r

Rl+'/im [AT, _BTTtogether with the two maps from Rq

factoring through A and B, see [67, p. 59]. Indeed, with the proof above, it is easy to see that the map

into M in the

M,

V

)

I'lv+im['] -

V4

-B

notation

as

Algebraic

3 The

42

Structure of Ho

an embedding of Rl+'-' into M. Moreover, the finitely generated decomposes into its free part and its torsion submodule as follows:

is

M

(By

virtue of the

domains the in

.V)(R'+'-')

=

are

way

same

[67,

p.

free as

E)

v

+ im

[ ]I A

-B

I

a

c

R:

av

c im

module M

[A]j -B

-

'

finitely generated torsion-free modules over Bezout [97, p. 478], the decomposition above can basically be proven for principal ideal domains, for which a proof can be found fact,

that

533].)

Remark 3.2.10 We wish to end

our

matrix-theoretic considerations with

an

interpretation of

the results given above into module-theoretic terms. First of all, the Bezout property of H can simply be expressed as stating that every finitely generated a free 'H-module of rank one. Secondly, the left or right equivalent triangular forms for matrices over H (Theorem 3.2.1 (a)) imply that every finitely generated submodule of a free H-module is free. Indeed, if M is such a finitely generated module, we can assume without loss of generality that M C 'Hr for im Q for some matrix Q E Hr,,. Using a right some r C N and that M equivalent triangular form of Q, one can single out a full column rank matrix im , showing that M is free. Thirdly, the sum of two representation M finitely generated submodules N, and N2 of an H-module M is certainly finitely generated again, hence a free module if M is free. The construction of a greatest common right divisor in Theorem 3.2.8(a) presents a way of how to construct a basis for the sum N, + N2, given generating matrices AT and BT for N, and N2, respectively. More interesting from a module-theoretic point of view is the fact that also the intersection N, n N2 of two finitely generated submodules of a free H-module is finitely generated and free again. A basis for N, n N2 is given by the least common left multiple of generating matrices for N, and N2 (see Theorem 3.2.8(b)). Observe that all the above is true for arbitrary commutative Bezout domains (see Remark 3.2.2). In commutative ring theory the situation above is captured in a more general context by the notion of coherent rings

ideal of H is

=

=

and modules. A module M

over a

commutative

ring R

is called coherent if M

finitely generated and every finitely generated submodule N of M 'is finitely presented, hence there is by definition, an exact sequence F, --+ Fo --+ N - 0 with finitely generated free modules F0 and Fl. A commutative ring R is called coherent if it is a coherent R-module, hence if every finitely, generated ideal of R is finitely presented [38, Sec. 21. Since finitely generated free modules are trivially finitely presented, every commutative Bezout domain is coherent. It is known that if R is coherent, then every finitely generated submodule of a free R-module is finitely presented [38, Thm. 2.3.2]. This generalizes the situation is

for commutative Bezout domains where these modules turn out to be free

as we

have

submodules of

a

generalization, of the greatest left multiple. arrive at

a

even

above. Furthermore, sum and intersection of two coherent coherent module are coherent again [38, Cor. 2.2.4] and we

seen

common

divisor and the least

common

Systems

3.3

Systems

3.3

In this section

over

we

over

Rings: A Brief Survey

43

Rings: A Brief Survey

want to take

short excursion into the

a

area

of systems

over

rings. We present some of the main ideas and discuss the ring Ho with respect to some ring-theoretic properties arising in the context of systems over rings.

theory of systems over rings is a well-established part of systems theory, mainly by the papers [79, 105], in which it has been observed that in various types of systems, like for instance delay-differential systems, the main underlying structure is that of a ring. As a consequence, the properties of such systems can be studied, to a certain extent, in an algebraic setting. This in turn has led to several notions for rings, which, beyond their system-theoretic background, can be studied in purely algebraic terms. The book [12] provides not only an excellent overview of these various concepts, but also introduces a variety of rings to systems theory. Although our algebraic approach to delaydifferential systems is not in the spirit of systems over rings, the book [12] has been our main guide through the area of Bezout domains and elementary divisor The

initiated

domains. In the

sequel

we

want to survey

For the moment it

one

might simply

branch of the

serve as a

(weak)

systems theory. But there is also

a

where

topic.

The

we

will

come

back to this

theory of systems

over

brief introduction into that

connection to Section 4.5 of

rings.

area

our

of

work

starting point for the theory of systems over rings is the description of dynamical system as an equation

a

linear first-order discrete-time

Xk+1

where A

E

R"' and B

C

=

R"'

AXk are

+

Buk, k

matrices

>

(3.3.1)

0,

over some

ring R and

Xk G R'

and Uk E R' are the sequences of the states and inputs, respectively (at this point there is no need to consider an -output equation). Rom a system-theoretic

point of

a lot of natural questions arise. The most basic one is whether possible for a given system (3.3.1) to steer it from one state to any other in finite time by suitable choice of the inputs Uk. This is the well-known notion of reachability and can be expressed solely in ring-theoretic terms. or

view

not it is

(1)

The pair (A,B) is called reachable, if im[B,AB,...'A'-'B] R', see [79, 529]. If R is a domain, the above is equivalent to [AI A, B] being right invertible over the polynomial ring R[A], see [46, Thm. 2.2.3]. =

p.

It is

a

-

purely algebraic

the internal modes Uk

=

can

-

result that for reachable systems (3.3-1) over a field arbitrarily by use of static state feedback

be altered

FXk. This problem of modifying the systems dynamics can equally well rings. In this case it falls apart into two subproblems.

be formulated for

(2) [105,

20]

A pair (A, B) E R1 I I x R1 x ' is called coefficient if for each monic polynomial a E R[A] of degree n there exists p.

assignable, a

feedback

Algebraic Structure

3 The

44

of Ho

loop system given by

matrix F E R"I such that the closed Xk+1

(3), [79,

=

p.

(A + BF)Xk has characteristic polynomial det(Al xn x Rnxm 530], [105, p. 20] A pair (A, B) G Rn

assignable, such that

if forallaj,...'an G R there exists

det(/\I

A

-

-

BF)

Flnj= 1 (,\

=

a

the

A

-

-

equation

BF)

=

is called

a.

pole

feedback matrix F e Rmxn

aj

_

It is easy to see that coefficient assignability is stronger than pole assignability which in turn implies reachability, see [105, p. 21]. Whether or not the converse is true,

depends strongly

on

the

underlying ring

R. This has led to the

following

notions.

67] A ring R is called a CA-ring (resp. PA-ring) if pair (A, B). is coefficient assignable (resp. pole assignable).

(4) [12,

p.

each reachable

a CA-ring. In the general case of systems over rings a particular 1. In this case, simple case arises if there is only one input channel, that is, M reachability of (A, b) simply says that b is a cyclic vector for the matrix A and one straightforwardly verifies that (A, b) is coefficient assignable.

Each field is

=

As

a

the

one can

consequence,

following

show that

ring

a

is

a

CA-ring

if it is

an

FC-ring

in

sense.

(5) [105, p. 21], [12, p. 74] A ring is said to be an FC-ring (feedback cyclization and ring), if for each reachable pair (A, B) there exists a matrix F G Rm a vector v G Rn such that (A + BF, Bv) is'reachable. xn

simple rings it is surprisingly difficult to see if they have one of the properties above. We confine ourselves with reporting the following results and open questions. Even for

(i)

The

polynomial ring R[z]

is

a

PA-ring [105,

p.

23],

but not

a

CA-ring.

For

instance, the pair

(A, B) is

reachable, but

det(/\I and [99].

-

ii)

The it is

(iii)

In

A +

( rzo- [-1 Z20 1] 0

0

does not allow

BF)

A2 + A

=

+

a

(3.3.2)

-

R[Z]2x2 such that R[z][A]; see [29, p. 111]

feedback matrix F (E

(Z2

+

ring C[z] is a CA-ring [11], but FC-ring [100].

z

+

it is

2)/4

an

E

open

question whether

or

not

an

[10,

Thm.

0.1]

polynomial ring K[z] is a CA-ring, taking square roots and even qth such that the qth roots of unity are contained in K.

it is shown

that, if

a

then the field K is closed with respect to roots for every

prime

(iv) Every elementary

q

divisor domain is

a

PA-ring [12,

Thm.

3.13].

ring Ho has any of the properties PA, CA, or'FC. From the general result quoted in (iv) we obtain immediately that HO is a 'PA-ring. Furthermore, it is not hard'to see that HO is not a CA-ring. Indeed, Ho is contained in. R(s) [z], which is not CA according to the result quoted in (iii). But applying the proof in [10] to the ring R(s)[z]

In this context it is interes'ting to

see

whether the

3.4 The

Nonfinitely

Generated Ideals of Ro

provides an example of a pair (A, B) c H2X2 0 over 7io but not coefficient assignable. Hence HO is not an FC-ring either. even

We will

back to

come

a

slightly different

H2X2 0

which is, reachable

not CA

and, consequently,

X

notion of coefficient

Section 4.5. The topic of realization theory for systems addressed in the introduction to Chapter 5.

Nonfinitely Generated

3.4 The

45

over

assignability in rings will be briefly

Ideals of WO

The Bezout property of Ho says that all finitely generated ideals are principal, they are completely described by one generator. In this section we focus

hence our

can

attention to the

be

set -of

fully

zeros

nonzero

nonfinitely generated ideals. As we will see, each such ideal by one "generating" polynomial along with a specified

described

(counting multiplicities).

prime ideals

are

As

a

consequence, it will turn out that all

maximal, in other words, the Krull-dimension of HO

is

one.

The results of this section

are

directly

not

related to

differential equations in the next chapter. However, for a further algebraic study of the ring RO. We restrict to Let tion

us

thering HO.

The results about 71

our

can

investigation of delaythey are interesting

think

we

readily

be deduced.

first rephrase the characterization of prime elements, given in Proposiin ideal-theoretic language. The following is an immediate conse-

3.1.2(d),

quence of the Bezout

property together with Proposition 3.1.2(d) and

Proposition 3.4.1 Let 10} 54 1 C Ho be

a

I is prime

begin

Then

I is maximal I

We

finitely generated ideal.

(i).

=

0)

for

some

irreducible

R[s]\101

or

I

=

(z).

investigation with an important class of nonfinitely generated They can be regarded as "generalized" principal ideals, for the information on such an ideal is contained completely in one (generating) polynomial. These ideals will serve as a sort of building block for all nonfinitely generated ideals. In the sequel a polynomial 0 E R[s] is called monic if 0 54 0 and its leading coefficient is 1. our

ideals in HO.

Definition 3.4.2 Let p E R[s, z). Deline Dp all admissible denominators of p.

R[s] 10

monic and

Furthermore,

let

0

1,0 p}

to be the set of

3 The

46

Algebraic

Structure of Ho

((p)) We call

((p))

:=

the full ideal

jhP

I

0

h E

((p)).

It is clear that

full ideal is indeed

a

((p)),

generate

Proposition

G

Dpj

7io.

C

generated by p and the polynomial

full generator of

to consider full ideals

7io, 0

an

generated by

p is said to be

ideal of Ho. Notice that there is

ho\R[s, z]

q E

q

as

=

P-

E

no

a

need

Ho would fully

too.

3.4.3

R[s, z]\101. b)) is at most countably generated 0. is, Vif I f E ((P))} (2) Let q E R[s, z]\10}. Then

Let p C

(1)

The ideal *

((q)) In

(3)

particular, ((q))

Let

and has empty variety, that

=

=

((p))

9

((p))

if and

b))

p

only if q

q.

V)p for

some

0

E

R(s).

p. Then

z

p E

R[s]\f01

=

Ho

-

R(s)

n

assume

some

E M

((p))(M)

q c

is

finitely

Ho. Then

there is

some

-Ho, which implies

(po- 1),

=

q

h4>

=

E

M C

gen-

h-P- for V)

Ho such

Dp,

and

Remark 3.4.9 It is easy to see that we can confine ourselves without restriction to saturated admissible sets of denominators. Indeed, for all p e R[s, z] and each admissible set M of denominators for p

one

has. the

identity ((p)) (M)

=

((p)) (V)

where

M=f0GDpj0monic,]0EM: 010} is the saturation of M.

Now we can completely describe the ideals in Ho. The presentation of nonfinitely generated ideals given in part (3) below, will,be important later in Section 4.6 where we study the solution spaces of delay-differential equations corresponding to these ideals.

Theorem 3.4.10 Let

101 :/ -

I C

'Ho be

an

ideal and p G

Put M:=

0

G

R[s, z]

Dp

be

a

sandwich-polynomial

of 1.

J:E Ij. E

Then

(1) M=f0EDpj3heHo: gcd,,(h,0)=,1andhP.E1}. (2) M is a saturated admissible set of denominators. (3) I= ((p)) (M). (1) (h, 0) gcd,. PROOF:

The inclusion "C" is trivial. For "D" =

ah +

bo, where

a, b G Ho

are

multiply a Bezout identity I coefficients, by po-1 E Ho.

suitable

Algebraic Structure

3 The

50

(2)

If

c M

and

0

R[s]

E

is

of Ho

monic divisor of

a

0,

say

Op

0,

=

P

then

=

pR

G 1.

Hence M is saturated and there remains to show that M is closed.with respect to taking least common multiples. To this end, let 01, 02 G M and 0 1CM(01 1 02) =

0 1 2

Since R [s] is factorial, we may write Rom the above we know f G I for i

1,

Oi

P

+

01 Since

(3)

gcd(ol

021 0102)

+

=

((p))(M)

The inclusion

7

02

1,

C

=

we

(01

+

where

,

j I

Oi

and

gcd( ,, 2)

=

1

2. Hence P

02)-7--

E I-

0102

obtain from

(1)

that

0

=

0102

E M.

I is immediate from the definition of M. For the

let q G I, hence q E ((p)) since p is a sandwich-polynomial of 1. Then hP- for some 0 Dp and h (E Ho. Using Proposition 3.1.2(h), one can

converse

q

=

assume

gcd,. (h, 0)

1, which, by (1), yields 0

G M.

Thus q c

desired.

f0} 7

.for

3.4.11

I C

some a

PROOF:

as

EJ

Corollary If

((p)) (M)

'Ho

c R

is

[s]

ideal

an

having

sandwich-polynomial

a

in

R[S],

then I

=

(a)

-

By Proposition 3.4.5(b), each other sandwich-polynomial of _T is in R[s], representation in part (3) of Theorem 3.4.10 completes the proof.

too, and the

n

The

following

of Ho

(cf.

theorem

Theorem

provides

3.1.6(c)).

an

alternative argument for the adequateness Ho is a one-dimensional ring, that

It reveals that

is, each ascending chain of prime ideals has maximal length that one-dimensional Bezout domains

are

adequate [12,

p.

1. It is well-known

95].

We would like

mention, that, H(C) Krull-dimension, yet still adequate [12, Thms. 3.17, 3.18].

of entire functions is of infinite

contrast, the ring

in

to

Theorem 3.4.12 Let

f01 :

(a)

If I is

(b)

I is

I C

lio be

an

ideal.

prime ideal that is not irreducible p G R[s, z]\R[s].

PROOF:

a

finitely generated,

then I

=

( p))

for

some

prime if and only if I is maximal.

(a)

Let I be

a

nonfinitely generated prime

otherwise the intersection would contain

an

ideal. Then I n

irreducible element

the consequence (a) C I, contradicting Proposition 3.4.1. Let p be a sandwich-polynomial of 1, hence 19 ((p)). But then even I

R[s] a

10} for R[s] with R[s, z]\R[s] ((p)) is true, =

G

OP -E I and the primeness of I together Dp we have p 10} implies R G I. Again by the primeness of I and by virtue of Proposition 3.4.3(2), p can be chosen as an irreducible polynomial in R[s, z]. since for each

with I n

R[s]

=

=

Ring 'H

3.5 The

(b)

as a

Convolution

Algebra

51

light of Proposition 3.4.1 we are reduced to show that each I ((p)), where p E R[s, zj R[s] is irreducible, is maximal. To this end, let I C J for some ideal J having sandwich-polynomial q E R[s, z]. The case q E R[s] can be handled with Corollary 3.4.11 and Remark 3.4.4. If q 0 R[s], then Proposition 3.4.3(2) applied to ((p)) 9 &)) together with the irreducibility of p yields ((P)) Hence I is

In

=

maximal ideal.

a

n

We close this section with the

following

result

concerning the uniqueness of the proof is lengthy but straight-

representation of nonfinitely generated ideals. Its forward and will be omitted.

Proposition

3.4.13

b)) (M)

Consider the ideal

R[s, z]

which is

9 'Ho where p

primitive

as a

E R

polynomial

in

[s, ;]

for

R [s] and

some

Furthermore,

z.

saturated admissible set of denominators for p. Let monic polynomial such that M n D4, D,, and put a

a

E

R[s]

M C

Dp

is

be the unique

=

0

k 11

Then

is

=

a

((-rip,))

in z, the

gcd(0, a)

JOEMI

and

ki

( -rf))) (k)

and

11

n

D,

=

us

D,p

satis-

unique presentation of the ideal ((p)) (M) in the following sense: ((-r2h)) (k2) where Pi G R [s, z] are primitive as polyn'omials

ki

g

D,pi

are

Then rl

I

saturated admis=

k2-

Ring W

3.5'The

Let

ER[s].

f 1}.

polynomialsri E.R[s] are monic, and satisfying Mi n D.,

=

0 a

sible sets of denominators and

-r=

saturated admissible set of denominators contained in

a

fying ((p)) (M) This provides let

=

recall that the

as a

ring i

r2,

PIP2

1

E R

Convolution Algebra has been introduced in

Chapter

2

as a

ring of delay-

differential operators acting on C' (R, C) The main purpose of this section is now to place this situation in the broader context of convolution operators. .

More

precisely,

will describe H

we

as an

algebra

of distributions with compact 2.9(2) will

support. The delay-differential operators q introduced in Definition turn out to be

form and

a

'the

associated convolution operators.

suitable

(isomorphic to)

can

with

the

are

Laplace see

trans-

that R is

rational expressions in the

J, and have compact support. The structure of these be exhibited in more detail by going through some additional and

explicit calculations. can

Using

it will be easy to

the space of distributions which

Dirac-impulses J(1) 0 distributions

Paley-Wiener Theorem,

In

particular, it will turn out that each such distribution of a piecewise smooth function and a distribution finite support, hence as a polynomial of Dirac-distributions. Algebraically,

be written

as

the

sum

3 The

52

Algebraic Structure of 'Ho

by the decomposition of the functions in H into their strictly polynomial part in a sense to be made precise below. For the algebraic approach to delay-differential equations this description is important

this is reflected

proper and their

because it allows

to abandon the restriction to C'-functions for the solu-

one

tions. Recall that from

view -the space Coo (R,

convenient to

is

turns out

an algebraic point of begin with, simply because it

(cf.

3.5-7)

Remark

that

over

a

module

over

C) is very R[s, z, .Z- 11. it

the proper part of H much

more

gen-

eral function spaces, for example Lj, are modules with respect to convolution, too. We will take this aspect into consideration when discussing input/output

systems in the

chapter.

next

For the main line df

of this section

are

of distributions sheds We

approach, where we restrict to Cl-functions, the results strictly necessary. Yet, we think the description in terms some new light on our investigations.

our

not

begin with fixing

notation. Let D' be the

some

vector-space of complex-

the space D := If E CI(R, C) I supp f is compactl, endowed with the usual inductive limit topology. Here supp, f denotes the sup-

valued distributions

port of

function

a

on

(or distribution) f. Furthermore,

D+'

T E D' -

supp T bounded

let

on

the

left}

and

E)c'

T

D' I supp T

compactl.

D,' with their extension to distributions on S, instead of L as in Chapter 2, is meant to indicate that the space 9 is endowed with the topology of uniform convergence in all S n D+' be the space of functions in E derivatives on all compact sets. Let S+ We will

identify

C' (R,

.6

C).

the distributions in

,

The notation

with support bounded

on

the Dirac-distribution at

the left.

a

Finally, denote by j(k) a

the k-th derivative of

Ei R. Recall that the convolution S*T of distributions

is well-defined and commutative if either both factors

are

in

V+

or

if at least

one

factor is in D. Moreover, convolution is associative if either all three factors are C is an R-algebra in V+ or if at least two of them are in D,. Finally, (D+, without p.

zero

divisors and with J0

as

identity [104,

p.

14,

p.

28/29]

or

[128,

124-129].

setting, differentiation (resp..forward-shift) corresponds to convolution Sy E R[s,z,z-1] and with J(1) j:L 1 ENO (resp. Ji). Precisely, for p 0 i= Pij In this

f

G

.6

we

j=

have

ff

=

XW E Y, Apo j=1

Notice that

R[JO(l), Ji, 6-1]

is

approach

Jj)

*

f

=

WO(l), Ji)

*

f

-

(3.5.1)

subring of D+' and isomorphic to R[s, z, z-1]. already in [61], where it was utilized for a delay-differential systems. a

This observation has been made

transfer function

*

i=O

to

3.5 The

In the

subsequent discussions

we

3 tk G

f

PC'

:

R

Convolution

as a

Algebra

53

will also consider the function space

R, k

E

C liMk-cx) tk

--+

Ring R

f I (tk,tk+ I ]

such that tk


q(J(1), Ji) 0

part.

r

refers to the left-derivative of g. Since p and 0 have real consequently jb(g) are actually real-valued and

the function g and

(3.5.6)

is

a

the coefficient of

decomposes

into

a

in R [JO(l), Ji, 5- ], which q(k), J,)..It vanishes if and only

polynomial

the distribution

J(N-r)

we

1

0

is

nonzero).

As

a

call the

if N
." notice that rk

the existence of m

+p

-

r

a

r Q Q, resulting that the maximality of m yields

nonsingular

free variables

so

r x

=

r-submatrix of

of all, by (2) the formal transfer function

-Q-1P

E

R(s, z)Pxm

C

R(s)((z))Pxm

nonanticipation, dealing with inputs having their support bounded to operator given by the distribution (- Q P) (JO(l), Jj) G (D+)P x acting on ET, see Theorem 3.5. 1. Precisely, for all u FT satisfying u- 0, there exists a unique output y E S+P given exists. For the

left,

it is most convenient to utilize the convolution '

=

by

y

=

(-Q-'P) (J(1), Jj) 0

* u.

If

-Q-1P

has support in [0, oo) and thus yconverse follows from Lemma 3.5.4.

=

0,

C

R(s) jz Pxm,

then

too. Hence B is

(-Q-'P) (k), Jj) 0

nonanticipating.

The 0

Rom the above it is immediate that every behavior can be turned into an i/o-system by suitably reordering the external variables. It turns out that the same

is true

to comment

even on

for

nonanticipation. Before proving that assertion, nonanticipation given above.

the characterization of

we

want

4 Behaviors of

92

Delay-Differential Systems

Remark 4.2.4 For

an

i/07systern

B

=

kerC [P, Q] the formal transfer function

and induces the distribution It therefore T

Since

D+'

(D+)'

S+

*

C

utilized this fact

(D+)P +

E

Q

P exists

see

Theorem 3.5. 1.

* U.

(4.2.2)

rise to the convolution operator

gives :

(- Q -'P) (JO(l), Ji) 0

-

)

(D+')P,

u

i

)

(-Q-'P)(60(1)'6J)

6+, the operator can be restricted to a map .6+1 already in the proof of part (3) above.) In this way,

9+'. (We

T may be associated with as an the regarded input/output (ilo-) operator system B. The of the restriction the to is B of all one-sided graph subspace n .6+m exactly

.67+P

trajectories

in B. The distribution

(-Q_1P)(JO(1), 61) is

E

(D+)Px"'

usually called the impulse response since its columns are the responses to the 6o ei E (D+')', where ei,...' en denote the standard basis

Dirac inputs ui vectors in R'.

According

=

to Theorem

4.2.3(3),

the operator T

(or

rather its

graph

in

9

rn+p

is nonanticipating if and only if -Q-1P C R(s)JzJPx'. As a consequence, each purely differential behavior kerc [P, Q] (that is, [P, Q] C R [s]P x (m+P)) is a nonanticipating i/o-system provided that Q is nonsingular. In this context no requirement like -Q-1P being a proper rational matrix arises. This is simply due to the fact that we allow C'-functions only, so that differentiation .(the polynomial part of a rational matrix) causes no particular difficulties. The situation isidifferent when taking other functions into consideration. In Remark 3.5.7 we discussed the possibility of more general functions spaces. Let us consider the case of (LI,,)+ -functions being fed into the system. Then, in order to avoid impulsive parts in the output, -Q-1P has to be proper in the sense

that map

-Q-1P C R(s),((z))Pxm (see (4.2.2) specializes to

Remark 3.5.7 for the

notation).

Then the

P

+

nonanticipating iff -Q-1P is a power series (rather than merely series) over the ring R(s),. For systems of ODEs this has been described in [120, p. 333]. We will call a system kerL [P, Q] satisfying the condition -Q-1P E R(s)jz 'xm a strongly nonanticipating ilo-s'ystem. At this point a main difference between behaviors defined by DDEs and those given by ODEs arises. The latter ones can always be turned into strongly nonanticipating i/o-systems by suitably reordering the external variables, see also [87, Thm. 3.3.22]. This is not true for delay-differential systems. For instance, for the behavior B given by [p, q] [5 92Z, I 'S3 z] neither q-1p nor p-lq is in

which, again, a

is

Laurent

=

R(s), ((z)). Thus, systems. But defines

a

on

B

_

_

neither way be regarded a strongly nonanticipating i/othe other hand, both quotients are in R(s) Jz , so the behavior B can

nonanticipating i/o-system (over C') either

way.

Input/Output Systems

4.2

following proposition provides

The

some

information how to read off

93

directly

expanding -Q-1P into a series, whether or not [P, Q], the system is (strongly) nonanticipating. The criteria take their best formulation by choosing a normalized form for [P, Q] in the sense that the matrix has no negative powers of z and a constant coefficient (with respect to z) of full row rank. Part (a) below shows that'each behavior admits such a normalized kernel-representation. The criterion for Q-'P being a matrix over R(S) zj is from the matrix

without

then very natural: the constant coefficient of Q has to be nonsingular. The normalization is a,Iso implicitly contained in the assumption of part (c) leading to a

strongly nonanticipating system. Although

we

will not dwell

on

the

case

of

(L,I,) +-trajectories

later on, we would like to include this particular criterion. It will be utilized later to demonstrate that the systems arising in Chapter 5 as well

the controller used for spectrum

as

assignment

in Section 4.5

are

actually

strongly nonanticipating systems.

deg's q for rational functions q G R(s) [z] given in 3. 1. For R(S)[Z]pXq we denote by M(s, 0) the matrix in R(s)PXq obtained 0 into M. We call M normalized if rkR(s)M(SI 0) substituting z R

Recall the definition a

matrix M E

after

=

Proposition

(a)

U e

(b)

4.2.5

For each matrix

L

e.

Let

=

71" (m+P) with rank p there exists a matrix (m+p) and such that U[P, Q] is in Hpx normalized, 0

[P, Q]

Glp(R[s, z, z-1]) rkR(s)(U[PQD(S,0) =P[P, Q] E Hop (m+p)'be a normalized matrix and Q x

Q-1P (c)

E

Let

[P, Q]

4==>

det

Q(s, 0):

0.

=

EL

=

Q-1P

Then

nonsingular. Then

zi with and det Q -A 0. Write det Q j= 0 qj (s) and Moreover, R(s) suppose deg, (det Q) deg, q0. supQ) is maximal among all degrees of the full-size minors of c

deg, (det

[P, Q].

R(s)[zJPxm

(m+p) Hpx 0

G

coefficients qj pose

E

be

G

R(s), jz px'.

by (a) and (b) every system can be i/o-system by reordering the external variables. Notice that

turned into

a

nonanticipating

PROOF: (a) It is enough to establish a denominator free version, i. e., [P, Q] G R[s, z]PI (m+P). Assume rk R(s) [P) Q1 (Si 0) < p. Then there exists a row transformation U E Glp (R [s]) such that the last row of U [P, Q] (s, 0) is identically zero. Hence the matrix

1IP-1 il U[P, Q1 0

0

has entries in R [s, ceed in the

on

as

the other

=:

[p, Qi]

[Pi, Q 1 ] (s, 0) p we are done. Otherwise we can pro[PI, Q1]. This way we can build a procedure which the current matrix [P1, Q1] satisfies rk [Pi, Q1] (s, 0) < p.

If rk

same manner

keeps running But

z].

Z_

=

with

long as hand, the procedure

must

stop after finitely many steps since

94

4 Behaviors of

the full

rank of

row

Delay-Differential Systems

[P, Q] guarantees

that the maximaldegree in z of the fullstrictly decreasing sequence of nonnegative numbers. the desired matrix after finitely many steps, which proves the

size minors constitutes

Thus

obtain

we

a

assertion.

(b)

Notice that both P and

Q are matrices over the ring R(s)IzT and Q is in only if det Q is a unit in R(s) zT, hence iff det Q(s, 0) = k 0. This proves For "=>." observe that P Q Aj (s) zi with coefficients Aj c R(s)Px"n implies P(s, 0) Q(s, O)Ao(s), which together with the normalization rk [P(s, 0), Q(s, 0)] p yields rk Q(s, 0) p. vertible

such if and

as

=

=

=

(c)

Let

us

=

start with the scalar

Q_iP

=

since ao

zi Too _j=0 aj (s)

=

q-lpo 0

The matrix

for

R(s),

c

1. Write P p EjM=0 pj (s) z3 where degs (det Q) reads as deg, qO ! deg, pj Using (b), we have Q-'P c R(s) zj, say G R (s). Now the result follows by induction

case m

pj cz R,(s). Then the assumption and deg, qo ; degs qj for each j. some a

i

and aj

=

=

=

=

on

q-lpj 0

j=

q lqjaj_j

G

R(s),.

consequence of the scalar case along with Cramer's rule. where Indeed, the entry (Q-'P)ij is of the form (det Q)-' det is the matrix obtained by replacing the ith column of Q with the jth column of P. case is a

jj,

Hence

Qjj

is

a

full-size minor of

tions combined with the scalar

[P, Q]

Oij

and the result follows from the assumpEl

case.

Remark 4.2.6 For normalized matrices i

=

1, 2,

reads

the

E

Hpox (m+p),

hence rk

[Pi, Qj] (s, 0)

kernel-representations

p for in Theorem 4.1.5 (a) =

as

ker,c [Pi, Qi] This

[Pi, Qj]

uniqueness result about

can

=

kerc [P2) Q2]

be verified

: XEGlp(Ho): [P2iQ2]=X[P17Q1]-

straightforwardly.

We close this section with

an algebraic characterization of autonomy. It is immediate from the definition that autonomous systems have no free variables. The converse is true as well and follows from the identity kerC R C kerL (det R 1q), -

where R is we

nonsingular, together Proposition 2.14(2). completeness, special case of finite-dimensional systems, which can easily by use of a diagonal form together with the scalar case in Corolwith

also include the

be derived

lary 2.6(a) and Lemma Proposition

4.1.10.

4.2.7

Let R EE -Hpxq be

(a) (b)

For

a

matrix with associated behavior B

B is autonomous if and

B is finite-dimensional

only

(as

if rk R

R-vector

=

kerc R

C Lq

-

Then

q.

space)

if and

nonsingular purely differential operator, nonsingular T E R[S]qxq. some

=

only

i. e., B

if B is the kernel of

kerc

T for

some

Systems

4.3 Transfer Classes and Controllable

4.3 rJ[ransfer Classes and Controllable

In Section 4.1

95

Systems

equality of behaviors

equivalence of we a weaker equivkernel-representations over alence relation on the lattice B, which will be called transfer equivalence. This notion refers to the fact that for i/o-systems each equivalence class is going to consist of the systems with the same formal transfer function. However, the equivalence itself can easily be handled.without use of any input/output partition, which is merely a reordering of the external variables, anyway. In particular, there is no need for giving an interpretation of -Q-1P as an operator. It will be shown that each equivalence class is a sublattice of B with a (unique) least element. This particular element can be characterized algebraically, but also purely in terms of its trajectories. It turns out to be a controllable system meaning that every trajectory of the behavior can be steered into every other within finite time without violating the laws governing the system. Finally, a direct decomposition of behaviors into their controllable part and an autonomous subsystem will be derived. characterized the

we

associated

H. Now

via left

will tum to

Definition 4.3.1

(a)

For B

o(B) (b)

kerc R, where R

=

:=

For systems define

B1

-

HpXq

E

define the output number of 8

by

rkR.

Bi

kerc Ri, where Ri

O(BI)

132

R2

=

=

O(L32)

MR1 for

Z

HpiXq have full

row

rank,

i

=

1, 2,

and a

nonsingular

matrix

M'C-

R (s,

z)PI xP,-.

provides an equivalence relation on the lattice B. We call two B, and B2 transfer equivalent if B, B2. The equivalence class behavior B will be denoted by [B] and is called its transfer class. This

tems

-

sys-

of

a

The output number is well-defined by Theorem 4.1.5(a). It does indeed count the number of output variables of the system, see Theorem 4.2.3(2). Observe that transfer equivalence simply means that the kernel-representations share the same rowspace as R(s, z)-vector spaces. Since R(s, z) is the quotient field of

the operator ring

B1 It is

-

L32

easily

seen

H, transfer equivalence

O(Bl) AR2 that for

=

=

O(B2)

can

just

as

well be

expressed

as

and

BRI for nonsingular

i/o-systems

transfer

matrices

equivalence

A,

B E RPI

is the

same as

IPI

equality

of the formal transfer functions. In the next theorem Nye describe the structure of the transfer classes. Among other things, we obtain thatbehaviors with right' invertible erators.

kernel-representations

are

exactly

the

images of delay-differential

op-

Delay-Differential Systems

4 Behaviors of

96

Theorem 4.3.2 Let B G B have output number o(S) p. Then the transfer class [B] of B is a sublattice of B. It contains a least element Bc and can therefore be written as =

[B] For

(1) (2) (3)

system B'

a

13' B'

G

=

[B]

=

13c,

=

kerc R' for

S' has

f B'

I o( 3')

B

Cz

following

the

full

row

an

right invertible R'

some

L31 132

E

1

[B]

B,

be given

-

rem

3.2.8, rk lclm(Ri, R2)

and

(d),

[B]

obtain

with respect to

(B1

[B] satisfying (2).

rank.

+

L32

Q

(4.3.1)

=

L32)

a

least

To this

Using Corollary

3.2.5

Bi

as

B,

-

for

some

Q

C-

Hqx(q-p)

[R2] Ri

some

rk R1

=

=

Ri

E

rk R2

-HpX q having

and, by

Theo-

R2) p, too. Using Theorem 4.1.5 (c) n B2), which implies the closedness of =

(BI

-

imCQ

be chosen left invertible.

kerc Ri for

=

rk gcrd(Ri,

-HpXq.

E

is B=

can

it follows rk

taking finite

As for the existence of in

Bc 9 13'1.

equivalent:

are

image-representation, that

rank. From

we

and

the least element.

of full column rank. The matrix

PROOF: Let

o(B)

=

and intersections.

sums

element, we first show that there exists a behavior kerc R where R G Hp X q has full row end, let B =

we

may factor R

R

=

as

(4.3.2)

BRc

where B G HPxP is

nonsingular

and

Rc

7jpxq is right invertible.

.

(4-3-3)

Now

Bc is

a

system in

To show the

:=

kerc Rc

G

[B]

[B] satisfying (2). implication "(2) = . (3)", let B' Completing R' to U:=

(see Corollary 3.2.5)

and

U-1 obtains kerc R'

=

V

=

Hence kerc R' C

R'Q

=

0.

kerc R'

=

invertible matrix R' EE 7jpXq.

one

(4.3.4)

=

[R']

partitioning the

U-1UV

imcQ

inverse

[Q', Q] according

imcQ. Indeed, for =

and the

[Q" Q]

G

[B]

for

some

v

to G

(0) W

converse

Q

(4.3.5) as

G

7jqX(q-p)

kerc R' and =

right

unimodular matrix

Glq(H)

E

U/

a

QW

c

w

(4-3-6) Uv

one

has

iM'CQ.

inclusion follows from the

identity

4.3 T ansfer Classes and Controllable

Systems

97

Hqx(q-p) imCQ for some matrix A is nonand invertible left is QA Q Q The matrix observe imc we Q can imCQ. singular. Using Proposition 4.1.4, be completed to a'unimodular matrix, say U-1 as in (4.3.6) and U as in (4.3.5), and the argument above leads again to B' kerc R', where R' is a imcQ matrix. invertible right For the

implication "(3)

=: ,

(2)"

,

of full column rank and factor

let B'

=

=

where

=

=

=

In order to prove "(2) (1)", we first remark that the system Bc defined in (4.3.4) is the unique system in the transfer class [B] with a right invertible kernel-representation. To see this, let MRc NRc, where R'c G Hpxq =

is

verses,

one

Al,

and

right invertible, too,

N G 'HPxP

N-1M, M-1N Rc showing that kerc Rc'

obtains that

are

'HPxP,

E

nonsingular. Using right thus

Rc'

=

(N-'M)Rc

in-

is left

kerc Rc by Theorem 4.1.5 (a). Now kerL Rc in [B]. We know kerC R' be any imc Q for some matrix Q . Let B' already that kerc lic LR for some nonsingular matrices K, L c 'HPxP behavior in [B]. Then KR'

equivalent

to

there remains to establish the

=

minimality of Be

=

=

=

=

0 and LBRc by (4.3.2) and (4.3.3). This yields R'Q latin the element least B'. Hence Bc is the (unique) thus imCQ C kerL R' the well as tice [B]. Together with Theorem 4.1.5 (a) we get (4.3, 1) as implication

and hence KR'

=

=

=

"(1)

=;>

(2)", completing

Obviously, the system kerc I

autonomous =

10}

as

proof

the

systems in Cq form

a

transfer class

having

the trivial

its least element.

system-theoretic significance. It is a controllable system in the sense that it is capable of steering every trajectory into every other trajectory within finite time and without leaving the behavior. Put another way, controllability is the possibility to combine any past of the The least element Bc of

system with we

a

transfer class is of

(far)

future of the system. In order to make this precise notion for combining functions.

any desired

first need

a

Definition 4.3.3 For w, w' E Cq and to G R deline the concatenation of the function wAt,,w' : R --+ (Cq given by

(wAt.w')(t)

W(t) W,(t)

Using concatenations, trajectory steering

(see [87,

Definition 4.3.4 A time-invariant there exists

some

wAocAtoutOw' Note that the

E

Def.

5.2.2]

can

w

and w' at time to

as

for t < to for t > to

be

and the

expressed

as

follows.

interpretation given therein)

subspace B ofCq is called controllable if for all w, w' c B time instant to > 0 and a function c : [0, to) ---+ (Cq such that

B.

requirement

wAocAt,,ot0w' utowf(to)

concatenation is smooth. Since

E =

B implies in particular, that the

w'(0),

the concatenation switches

98

4 Behaviors of

Delay-Differential Systems

exactly from w(O) to w'(0) but allows for some finite time to ! switching smooth and compatible with the laws of the system.

0 to make the

Remark 4.3.5

The definition of

controllability given above appears to be the most intrinsic merely refers to the collection of all trajectories of the system and does not make use of any kind of representation, for instance, a kernelrepresentation or a state space representation. A slightly different version of controllability, yet also based solely on the set of possible trajectories, has been introduced in the algebraic approach to systems theory in [125, p. 153]. In this case, the notion resorts to input/output partitions, which makes the concept of controllability more technical than the definition above. one

possible.

It

Of course, the space Lq is controllable. It is even controllable in arbitrarily short time, that is, for all w, w' EE rq and all to > 0 there exists a function c such that wAocAto, to Wt C Lq.

verify (straightforwardly) that the image U(wAtow') wAtow' under a delay-differential operator U is aconcatenation of U(w) and U(w') and some intermediary piece. Its length is determined by the size of the maximal retardation appearing in the operator U. In the next lemma

of

we

smooth concatenation

a

Lemma 4.3.6 Let w, w' EE Lq and to C- R be such that matrix U E

HpXq 0

Then there exists

be written a

function

PROOF: First of we

a)

proceed

all,

in two

Assume first U

Ej=o Ujzj

as c

[to, to

U(wAtow')

wAtow'

L

U

=

L)

+

E

Lq

Furthermore,

with coefficients

Uj

G

let the

R(s)PXq.

CP such that

--+

U(w)AtcAto+LU(W1)

it is clear that

.

U(wAtow')

C

E

Lp-

LP. As for the concatenation;

steps. G

R[s, Z]pXq

,

hence

Uj

R[S]pXq

G

L

Then

L

E Uj(wAt.w)(t

U(wAtow')(t)

.

-

j)

1: (Uj(w)AtoUj(w'))(t

=

j=o

-

j)

j=o

-_jL

=0

Uj (w') (t

[ELi= OUj (W) (t

-

-

j)

j)

U(w') (t)

=

=

U (W) (t)

if t > to + L if t

to

and the desired result follows.

b)

For the

put V

U( FD) Using

=

=

general 3

V(V),

the

VjO-' Uj R[s, Z]pxq Then

case

ELj=o V-z3

let

=

where V E Lq satisfies

appropriate

Vj G R[S]pXq and Vo-' and for all 70

where

C:

U

Lq

=

R[s]\10}. we

have

O(V) entrywise. Let fowAtow'. initial, conditions at to, one observes that one may find =

fv-

=

Systems

4.3 'h-ansfer Classes and Controllable

;V-

G

vAt,,v' proof yields U(Cv)

Lqsuch that V

part a)

of the

U(W)At0CAto+LU(WI) [to, to + 4 One obtains

=

for

some

w. But then w and 0(v') 0(v) V(D) V(vAt,,v') V(v)At,,CAt,,+LV(VI)

where =

99

=

=

=

=

=

suitable function

c

defined

the interval

on

13

immediately

Corollary

4.3.7

Let B be

a

time-invariant controllable

the space

U(B)

C

LP is

controllable,

subspace of Lq Then for all U .

E

HP'q

too.

Let U PROOF: Since B is time-invariant, it is enough to consider U E 'Hp"'. 0 as in Lemma 4.3.6. We have to show that for all w, w' c B the images

be

U(w')

U(w)

and

have

0,LWI

i7v

can

be concatenated within

U(B). By assumption

on

B

we

B and there exists to > 0 together with a function c such that wAocAt00,to+Lw' E B. Now Lemma 4.3.6 provides some intermediary

:=

G

function cl such that

U(fv-)

=

=

completing Now

we are

proof

the in

U(wAocAt0ato+LW/) U(W)A0C1Ato+L U(Oto+LWf) U(W)A0C1Ato+LU to+LU(WI), =

since

position

a

U(fv-)

E

U(B).

to establish the

following

characterization of control-

lable behaviors. Theorem 4.3.8

kerc R where R G HpXq is a matrix of rank r. Then B is controllable r for all s c C. only if rk R* (s) Bc, where Bc is the least consequence, B is controllable if and only if B

Let B

=

if and

As

a

=

=

element in its transfer class

[B].

Notice that the rank condition does not

depend

on

the choice of the kernel-

representation R.

Sufficiency follows from Corollary 4.3.7 together with the existence of image representations as derived in Theorem 4.3.2. For necessity we first prove the assertion for the case B C L, hence R E X Let w G B be any trajectory. By controllability there exist to > 0 and a function c such that 0 and v wAocAt,,O G kerc R. Using twice Proposition 2.14(2), we obtain v desired shows R and Lemma 2.12 0 'H R 0. c w 1, as Therefore, kerc (cf. 1 PROOF:

=

Remark

3-1.5).

For the

general

and V

are

case use a

diagonal form URV

unimodular matrices and dl,..., d, E

=

diagp

H\101.

xq

(di,

.

.

.

,

d,)

where U

Since kerc R is control-

kerc diagpX q (di, lable, the same is true for the system V-'(kerc R) dr) see Corollary 4.3.7. This implies the controllability of kerc di C f- for each =

.

.

.

,

7

100

i

=

4 Behaviors of

1,

r

and

now

Delay-Differential Systems the rank condition

on

R follows from the first part of the

proof. The second part of the assertion can be deduced from Theorem 4.3.2(2) by using a full row rank kernel-representation and resorting to the rank criterion in

Corollary 3.2.5(c)

for

right invertibility.

El

Remark 4.3.9

Reconsidering

the arguments above

of B is equivalent to the zero.

we see

with

capability of steering

hindsight

each

that

trajectory

controllability

in finite time to

Precisely, B is controllable

V

-

w

G

B 3 to

such that

In the next remark

we

want to relate the

! 0,

[O'.to)

C :

wAocAtOO

E

controllability

__4

(Cq

B.

criterion above to

some

other results in the literature. Remark 4.3.10

(i)

The criterion for Thm.

5.51.

In the

controllability special

representations it has been

case

in Theorem 4.3.8

appeared first

of behaviors

proven

having a polynomial by completely different methods

The result

in

[42,

kernelin

[91].

generalizes the well-known Hautus-criterion for systems of ODEs to delay-differential systems; see [50] for state-space systems and [118, Prop. 4.3] for behavioral controllability of ODEs. For certain time-delay Ax+Bu with matrices A, B over R[z] or even Ho,p, systems of the form b it is also known to characterize spectral controllability [6, 74, 73], a notion referring to the controllability of certain finite-dimensional systems associ=

ated with the

zeros

of

det(sl

-

A* (s)). In

[85,

Thm.

1]

it has been shown

that spectral controllability is identical to null controllability. The latter that for every piecewise continuous initial condition there exists a continuous control u of bounded support in [0, oo) such that the corresponding solution x is of bounded support. means

piecewise

(ii)

easily seen that the constant rank assumption equivalent to the quotient module M :=

It is is

on

Hq/iM'J

R* for

controllability torsion-free. being gr

The connection between the system kerc R and the module M has been explained in Remark 4.1.1. Recall in particular that for R being polynomial, the quotient T is taken as the definition of a R[s, Z]q

/iMR[s,z]ff

delay-differential system in [32, 80]. In [80], controllability, depending on an R[s, z]-algebra A, is defined algebraically as the torsion-freeness of the module A (&R[,,,,l T. Since M H OR[s,z] T, behavioral controllability coincides with the algebraic notion of H-torsion-free controllability in [80]. (iii) For systems of PDEs, or generally for multidimensional systems, the notion of controllability or concatenability does not come as straightforward =

as. for onedimensional

systems

(like

ODEs and

DDEs).

Various notions of

4.3 Transfer Classesl and Controllable

Systems

101

controllability have been suggested in [124] (see also [129, Sec. 1.4]) and characterized algebraically and in structural terms similar to our Theorems 4.3.2 and 4.3.8. Some of the structural characterizations appeared first in [84, PP. 139]; controllability of smooth systems of PDEs has been investigated in detail also in [86].

(iv)

It is

For systems of DDEs with noncomMensurate be summarized in Remark 4.3.13 below.

an

in Cq

delays the existing results will

immediate consequence of Theorem 4.3.8 that two controllable systems transfer equivalent if and only if they are identical. Put another way,

are

the formal transfer function, taken after a suitable input/output partition, determines the (unique) controllable behavior Be in the transfer class [B]. The

ptoof of the not

shows, see (4.3.2), (4.3.3) and (4.3.4), how this controlobtained from a given system kerc R, namely by cancelling

Theorem 4.3.2

lable behavior

can

be

nonsingular left factors (if any) of R (which for change the formal transfer function -Q-'P).

R

=

[P, Q],

of course, does

minimality of Be in the transfer class can be rephrased as follows: a system B if and only if it has no proper subsystem with the same number of free variables.. As we will show next, there is another way to characterize Be. It says that Be is simply the controllable part of B in the sense that it is the The

is controllable

maximal controllable subbehavior contained in B. Recall from Remark 4.1.9

that ker,7-j R is

Proposition

finitely generated for

every matrix R.

4.3.11

Let R E -Hpxq be

a

matrix and put B

=

kerL R. Let Be be the (unique)

con-

trollable system in the transfer class [B]. Moreover, let kerjj R im-H T C ? q for some T (E HqXt. Then Bc imLT. Furthermore, one has B' C Be for every =

=

controllable behavior B' contained in B. We call Be the controllable part of B.

By Theorem 4.3.2(3), each controllable behavior B' has an image0 imLT' for some T' (E -HqXr.- Hence B' C 8 implies RT' representation B' TX for some X E -Htxr and B' so that T' imLT' C imcT. As a special BRc is factored as case, we obtain Be C imLT. On the other hand, if R 0 and in (4.3.2) and (4-3.3), then kerH R kerh Re imH T, whence RcT D Be. This concludes the proof. iMLT C kerL Rc PROOF:

=

=

=

=

=

=

=

=

=

Remark 4.3.12

Another characterization of controllable behaviors

can

be found

in

[111, C'

" only C and the of D Cl-functions 9 is having compact support where, again, space --c denotes the closure with respect to the topology on S. The only-if part follows in essence from the existence of image-representations and the denseThm.

3.5].

A behavior B C Sq is controllable if and

if B

=

B n Dq

102

4 Behaviors of

ness*of

D in S. The

Delay-Differential Systems

proof of

the other direction

form to the scalar case, where then kers p n D (Proposition 2.14(2)) is the key argument.

can

be reduced via

0 1 for each

a

diagonal

nonzero

p E 'H

Remark 4.3.13 In the

same

paper

[111],

controllable behaviors have been

tems with noncommensurate

type discussed in Remark

delays and

4.1.15(4).

investigated for

sys-

for convolution systems of the In this generality, it is not known whether even

the properties (a) controllability, (b) having a kernel-representation with constant rank on C, (c) having an image-representation, and (d) being the closure of its compact support part, Thms. 3.5, 3.6] that for R G next

equivalent. However, it has been shown in [111, each of the following conditions implies the

one:

(i) kerE R (ii) ker,, R (iii) kere (iv) (V)

are

(SI)pXq

R

kers R

=

im

,

Q for

some

Q

E

is controllable in the =

kers R

n J)q

-6 =

im,,Q

for

(S') q 1, x

sense

of Definition 4.3.4,

-0 I

some

Q

E

(Ef)qX1

LR(s) is constant on C, where LR denotes the Laplace (in this case, kerg R is called spectrally controllable).

rk

If R has full

one

a

R E=_ HpXq

row rank, then delay-differential operator

transform of R

"(v) =>. (iv)". In the special case of (see the Remarks 3.1.8 and 4.1.15) it is proven in [41, Thm 3.12] that "(iii) - * (iv) 4#. (v)", regardless of any rank constraint. The implication "(v) =* (h)", however, does not hold for general x operators R c Hp q, see the example in [41, Ch. 4]. Controllable systems are, in

also has

just the extreme opposite of ausystems. Controllability capability to switch from any trajectory to any other, in other words, the past of a trajectory has no lasting implications on the far future. On the other side, autonomy prohibits any switching at all, because, by definition, the past of a trajectory determines completely its future. These two extreme points on a scale of flexibility for behaviors can also be expressed in module-theoretic terms. It is easy to see that a system A kerc A is autonomous if and only if its annihilator in 71 is not trivial (indeed, if A is nonsingular, then det A e ann(A)\f0}; the other direction follows from Theorem 4.1.5(a)). On the other hand, it is not hard to show that a behavior B is controllable if and only if it is a divisible li-module, that is, if each a G H\f01 is a surjection on B. Next we show that each behavior can be decomposed into a direct sum of its controllable part and an autonomous subsystem. tonomous

=

a

certain sense,

describes the

4.3 T ansfer Classes and Controllable

Systems

103

Theorem 4.3.14 Let 8 C

behavior with controllable part Bc. Then there exists system A C Lq such that

Cq be

autonomous

a

B

Furthermore, let B

(4.3.3).

=

=

Bc

kerL R where R

(4.3.7)

(D A

BRc

=

an

(2

-Hpxq is factored

as

in

(4.3.2),

decomposition B Bc,nt, ED Baut into a controllable and an autonomous subsystem, the controllable system is given by Be ntr Bc, while the autonomous part is of the form Baut kerLA for some A Ej HqXq det B, up to units in H. satisfying det A Then in every direct

=

=

=

=

PROOF: Consider the factorization R

given by Be

controllable part of B is unimodular matrix

Re

U and partition the inverse

the.

matrix A

nonsingular

IU/Re] [QI7 Q]

the, identities

immediately verifies RcA

Iq

=

R

=

[Q', Q]

U-1

:=

GIq (H)

UI

U-1

as

BRc in (4.3.2), (4.3.3). Hence the kerL Re by (4.3.4). Complete Re to a

=

=

:---

as

[RI U/

IU/I Re

B

Thus

by

(4.3.7)

Theorem

3.2.8, 1

=

follows from Theorem

Consider

now a

with Theorem

gcrd(Rc, A) and BRc 4.1.5(c) and (d).

given decomposition 13

4.1.5(d)

one

and A

verifies

kerL A. Using

:=

Q'BRc

=

Define

QU',

+

one

=

on

13contr

the

one

once more

0(6contr)

(4-3.8)

Ip+q-

Q A

of the controllable term, observe that tion 4.3.11. On the other hand, using

Hqx(q-p)

E

as

[ Rc'] [ 0, Re] Q' QU

Q

HqXq and put A

G

[QIj Q]

well

such that

=

p

(D

=

R

lclm(Rc, A)

--

and

Baut. As for the uniqueness Bcontr 9 Be by Proposi-

hand

Theorem 3.2.8 in combination so

that

Bcontr

G

[B]

and there-

fore Be C Bcont, by Theorem 4.3.2. Hence Bcontr Be is the controllable part of B. As for the autonomous part, write Baut =: kerc A where A G Hq x q. We have =

to show that det

such that RcW

A

=

=

det B up to units in R. To this

[1p, 0]

and VAW

firstly, kerL RcW n kerL VAW tion 4.1.4. Secondly, one has kerL BRcW

=

=

Hence

[A,, 0]

f 01 gives

end, let V, W

where A4 E det A4 c

=

E

Glq (71)

H(q-p) X (q-p) Then, .

H',

see

kerL lclm(RcW,

also

Proposi-

VAW)

kerc [A,, 0].

and BRcW

R, which

=

kerL RcW + kerc VAW

are

left

divisor of the full-size minors of to units in

=

[A3A40] A,

is what

we

equivalent. Since det B is BRcW, this yields det A wanted.

the greatest =

det

A,

common

det B up 1:1

4 Behaviors of

104

Delay-Differential Systems

We close the section with Remark 4.3.15

(a)

The

decomposition (4.3.7)

is

quite standard

in behavioral

systems theory,

[87, 5.2.14] for systems described by ODEs. The sum can also be derived for multidimensional systems given by PDEs, but in this case the Thm.

see

decomposition

(b)

To

always direct,

is not

see

[123, Thm. 5.21.

regard the direct decomposition (4.3.7) as the "classical" decomposition of a system into its forced and free motions, see, e. g., [52, Prop. 3.1] in a slightly different context. Indeed, denoting by L+ the H-submodule of L consisting of all functions having support bounded on the left, it is easy to derive from (4.3.7) and (4.3.8) the relation kerL R n Lq kerC Rc n Lq extent,

some

one

can

"behavioral version" of the

=

This space

be viewed

the set of all forced motions of the system (including the forcing input, starting at some finite time to G R), while A kerL A contains the free motions (including input which has been actcan

as

=

ing

on

the system

forever).

In case,

kerc R

kerc [P, Q]

=

is

an

i/o-system

HP' (+P) and det Q =7 0, we know from Remark 4.2.4 that kerL R n Lq+ is the graph of the convolution op-

with

kernel-representation

[P, Q]

E=-

Q `P) (JO(l), Ji) restricted to LT. This way, we observe given by again that the formal transfer function is related merely to the controllable part of the system. Consequently, nonanticipation, as well, is a property related to the controllable part only. erator

4.4 Subbehaviors and Interconnections

So far

we

have

only been concerned with the analysis of

this and the next section two

systems,

will direct

our

a

single system.

In

attention to the interconnection of

of which

one

to-be-designed

we

controller.

being regarded the given plant, the other one the Indeed, a controller does constitute a system itself.

It processes (part of) the output of the to-be-controlled system and computes (part of) the inputs for that system with the purpose to achieve certain desired

properties of the overall system, like for instance stability. Thus, the system and the controller are interconnected to form a new system. In the behavioral framework the interconnection

ably defined behaviors.

can

be written

as

the intersection of two suit-

underlying idea is simply, that the trajectories of the interconnection have to satisfy both sets of equations, those governing the system and those imposed by the controller. Depending on the type of interconnection or on the description of the components, the resulting system might be described with the help of some auxiliary (latent) variables, which hopefully can be eliminated in a second step so that one ends up with a kernel-representation The

for the external variables of the interconnection. This elimination

procedure will be dealt with

at the

beginning

of the section.

4.4 Subbehaviors and Interconnections

Thereafter

we

turn to the interconnection of

105

systems and investigate the achiev-

ability of a given subsystem via regular interconnections from the overall system. The notion of regularity can be understood as requiring, in a certain sense, most efficient controllers. At the end of the section the dual of regular interconnections will be treated, are direct sum decompositions of behaviors. It will be shown that the

these

existence of direct

sum

decompositions

closely

is

related to the notion of skew-

primeness for matrices. The

following

theorem shows that

(and how)

considered

latent variables

can

be eliminated

exactly showing up in typparticularly important in the next latent variable where we chapter study systems of a specific type. In that conwill role be a text, special played by polynomial kernel-representations; therefore we also include the polynomial case in the theorem below. For the term "latent variable" we would like to recall the discussion following Definition 4.1 in the introduction to this chapter. in certain situations. The

cases

those

are

ical interconnections. The theorem will be

Theorem 4.4.1

(a)

The

image of a behavior under a delay-differential operator is again a bePrecisely, if Ri G Hp;,Xq for i 1, 2 are matrices of full row rank,

havior.

=

then

R1 (kerc R2)

=

kerc X,

where the matrix X C H"P-1 is such that XR, is

multiple of R,

(b)

least

a

common

left

and R2.

Moreover if Ri E

R[s, Z]pi,

the matrix X

be chosen in

can

Ri E RPxPi, i Fbrthermore, assume Let

V

V2

R2

qand

[M0-

Ri

rkC

R2

two matrices and

[VjT, V2TF

=

rkR(s,z)

Rj* (s)

for all

R2

s

E

C,

R[s, z]txP'.

1, 2, be

=

[ 1]

X

for

assume

rk

[Ri, R21

=

P-

is such that

c

some

M (,- HrXP2 with rank

r.

Then

B:= If s

we

E

have

C,

fw

c

LP'

additionally Ri V2

then the matrix

G

I Rjw

E

imLR21

R[s, z]PxPi

can

=

kerL (V2Rj)-

and rk R(s,,) R2 be chosen with entries in

=

rk CR* for all 2 (S)

R[s, z],

too.

As the proof will show, the condition on the gene?7ic rank of RI and R2 in (a), and hence also in (b), is not inherently necessary. It simply allows to

part

use of the least common left multiple, which has been defined for this only. Note that also the (extreme) case where rk [Rj, Rjf P1 + P2 is encompassed in the statement above, as in this situation the least common left multiple is the empty matrix while R, (kerC R2) is indeed all of LPI.

make case

=

4 Behaviors of

106

Delay-Differential Systems

I in (b) shows again that imCR2 is a behavior, a fact special case R, in Remark 4.1.9. In light of Theorem 4.3.2 we see that indicated already being the systems of this form (that is, having an image-representation), are just the

The

=

controllable systems.

possible

It is not

drop

to

pointwise rank condition imposed for the poly-

the

Z

nomial kernel-representations. For instance, imC

Proposition 4.3.11) representation can be found

in this

PROOF

(a)

and because of Theorem

as

THEOREm 4.4. 1:

OF

-

1

kerc [1,

8

4.1.5(a)

no

(by

polynomial

kernel-

case.

The first part is fairly standard and can be seen we know that there exist matrices Ui

By the Bezout property of H,

follows.

such that

U2] [R2Ri] [D]

U, U3 U4

=

for

0

and the leftmost matrix is in

by

Theorem

for

w

Ez-

3.2.8(b). Using

some

D CHrXq with rkD

Glp, +p2 (H).

provides lclm(Rl, R2) D, see Proposition 4.1.4,

This

surjectivity of

the

(4.4.1)

r

U3R,

=

we

get

Lq

w

E

(w)

R2)

R, (kerc

C

o

c

w

11

m'C

===,,

R2

(Ulw) U3W

E

m'C

[D0

kerL U3,

which proves the first assertion of (a). Let us now turn to R, and R2 being polynomial matrices. The existence of a polynomial kernel-representation for

R, (kerc R2) will be can

any

be chosen

proven

polynomial.

have established that

once we

This

accomplished

be

can

equation of the type (4.4. 1). Notice that

Theorem

4.1.13(2)

we

matrix

[R2 ki

[f?J, f? 2TT

[R

know that

T 1 ,

RTT 2

r

is

=

rk

as

[U3, U4]

(4.4. 1)

in

follows. We start with

[RIT, RiT. By

right equivalent

virtue of

over

H to

a

-

0

R[s, Z] (pl +p2)

G

0

x

q

-

The rank assumption on [R 1T , RTF and 2 the invariance of'the invariant factors under equivalence imply the coprimeness and

has full column rank

of the full-size minors of

[f?,T, f?jf

r.

in X

equation

[U3, U4]

4.1.13(l)

[03, 1 4].

Now

R,

=

now

Lemma

3.2.7(l)

to the

0

are polynomial so that by Theoequivalent to a polynomial matrix replace the unimodular matrix in (4.4.1) by

the matrix we can

I A21

[U3, U4]

shows that the full-size minors of rem

Applying

[U3, U41

is left

[ l 2] U3 U4

E

Glp, +P2 M)

4.4 Subbehaviors and Interconnections

and obtain from the first part of the

hence

(b)

a

proof the identity R, (kerC R2) polynomial kernel-representation.

(a) by observing

follows from

[R j j 0

that the matrix Let

us now

-R2

start with the

Definition 4.4.2

that B

[Ip, 0] (kerc [Ri, -R21).

=

=

107

kerC 6r31

Note also

has constant rank whenever R*2 has.

investigation of interconnecting systems.

(see [120,

p.

332])

The interconnection of two systems B1, B2 C- B is defined to be the system B : 13, n B2. The interconnection is called regular if o (B) o (131) + 0(132) =

=

-

The concept of a regular interconnection is rather natural in the behavioral setting as can be seen by Theorem 4.2.3. Indeed, the number q of external variables minus the rank of a kernel-representation represents the number of input variables of

system. If one thinks of one of the interconnecting', components as it is natural to require that each linearly independent equation of the controller should put a restriction onto one additional input channel, for the

a

controller,

otherwise the controller would be inefficient. Put another way, restrictions are imposed on what is not yet restricted. As a consequence, the resulting inter-

B, and B2 is left with regularity condition.

connection of

exactly

the

Using

once more

0(131)

+

0(132)

q

-

o(BI)

Theorem 4.1.5 and 3.2.8,

one

-

o(B2) input variables,

obtains

which is

o(Bj n132) +O(B1 +B2)

and'theiefore

o(Bj

n

B2)

=

Hence the interconnection is

O(BI)

+

regular

0(132) if and

B,

+

132

=

(4.4.2)

Lq

only if the components add

up to

the full space rq.

As

an

example

we

want to discuss the classical

feedback-configuration

of two

systems. It also exhibits how "interconnected" variables may,turn into latent variables of the interconnection in the sense that they are not describing the external behavior of the

Example

new

system.

4.4.3

Given the two systems B,

=

(U)

C

Lq

Yi

I

Qiyi

piui +

x (m+P) where q and p + m and [Pl, Q 1 j c RP sical feedback-interconnection given by u := ul =

by

ker,C

0

-1 1

OQ1 Pi

_O

P2

0

01,

[P2 Q21

the system 1

=

0

Q2_

7

Y27 Y1

E

HM X (P+m). The clas-

=

U2

=:

y is described

Delay-Differential Systems

4 Behaviors of

108

(U) Y

for the variables

and y

only,

i

U 11

Y2)

If

-

one

is interested in the

1 0

00100-0 -

B:

one can

B

:=

[U3 U2] U1

G

U4

kerC

0

Using Theorem 4.4.1(a),

where U

new

eliminates the latent variables ul and Y2

one

_O P2 find the

G1,,+p(H)

u

projection

0

Q2_

0

kernel-representation

kerc [U4P1, Q3P2

=

the

-1 1

Q1 P1

0

external variables

by taking

+

U4Q1],

is such that U

[Q2] [D01

for

=

P,

some

full

row

governing the external variables (u, y) of easily be seen that the external behavior B is an i/osystem with output y if and only if det(I Q, 1P1Q2 1P2) 6 0. This is the usual well-posedness condition for this type of feedback-configurations in the classical transfer function approach. In the same way one can handle series- and parallel-interconnections. As this is completely analogous to the case of systems described by ODEs in [87, rank matrix D. It describes the laws

the

system. It

new

can

-

Exa. 6.2.9, Ex. 6.3, Ex.

6.4],

the details will be omitted.

Obviously, an interconnection is a subsystem of either of its components. It is fairly simple to characterize algebraically those subsystems of a given system, which can be achieved as regular interconnection from that system. But it is also not hard to give a dynamical characterization purely in terms of the trajectories involved.

Theorem 4.4.4 Let

8

B C ,Cq be two behaviors and

C

Then the

(a)

following

There exists

a

statements

system B'

are

C

B, C V of B

assume

8

=

kerC A where A

G

H,25 x q.

equivalent: 8 n BI is

f q such that

a

regular

intercon-

nection of B and

(b) (C) (d)

the

8

image

=

B is a

Bc

+

A(8) 8, where Bc c :

[0, to)

_._

w

G

point of view, part (d)

us

we

is the most

with an of trajectories

provides

B there exist to

! 0, lb

E

B,

and

Cq such that wAo cAto ?b C B.

equivalent conditions is satisfied, regular interconnection from B.

behavioral

a

since it

controllable,

8-controllable, that is, for each

function

If any of these achievable via a

From

is

denotes the controllable part of B,

intrinsic criterion for

say the

subsystem 8

is

important characterization

regular interconnections;

it is

and does not resort to any kind of representation purely of the behaviors. Observe that 8-controllability can be understood as the capain terms

bility

to steer every

trajectory of B

into the

subspace 8

in finite time. In

light

4.4 Subbehaviors and Interconnections

of Remark 4.3.9 is the

same as

controllability

that

we see

101-controllability.

in the

sense

109

-

of the previous section

The characterization above is close to what

has been obtained for multidimensional systems in [92, Thm. 4.2] more the structural analogy between these classes of systems.

showing

once

equivalence of (a) and (b) can be derived by taking the duals of.the behavconsidering the corresponding problem in terms of finitely generated submodules of Hq. However, we think it is reasonable to stay on the systems side in order to use one and the same language throughout the proof The

iors and

PROOF is

THEOREm 4.4.4: Let 13

OF

rank. We may also

row

co'ntained

assume

Hop^xq ,thus f?

in

kerL R for

=

does not contain any

simplify the application of k onto a inclusion L C B implies a relation Xf?'=

R E Hpxq

some

f?

without restriction that

has full

having full

powers of

negative

latter will

concatenation

The

R where X E HPxf' is

rank matrix. Note that R

lclm(f?, R)

=

R(B)

and therefore

rank and

row

later

=

The

z.

in the

proof.

full

a

kerL X

row

CP

C

by

Theorem 4.4.1 (a).

"(a)

(b)"

=)>

ker,c f?

Let B'

8

=

=

kerc

=

kerc R' where R'

[R]

Hence Theorem 4.1.5 (a) Thus X is

"(b)

=,,.

block

a

controllable

by

and

R'

row

P

yields that the

of

a

"(b)

= ,

to R

is

matrices

f?

a

the form L

=

BRc be factored

lclm(Rc, k) (up Af?

E

Pick

a

=

"(c) = (d)"

i'v'E B sdch that

trajectory

v

:=

which proves

"(d) =: ,'(b)"

left

equivalent. =

f?(B)

is

3.2.5.

unimodular matrix

[XT, YTT (see

as

in

(4.3.2)

and

to unimodular left

factors),

w

right invertibility of X, IpXq and a right divisor of R

=

(4.3.3), Bc+B

see

Xf?

=

equivalent

4.1.5(d). lelm(Rc, f?) is of

Theorem

since every =

thus Bc

is

BRc.

G B.

By assumption there exist wc G 13c and trajectory w wc +,Cb. Controllability of 13c implies the existence of a =:

wcA'OcAt,,O

c-

Bc. As

a

consequence,

v

+ &

=,wA0'c'At0tb

G

B,

(d). Let

trajectory Cv

f?w

v

E

suitable function

c

c

f?(B)

for

such that w, defined on

kwi

point

Then

=

Let R

an

are

R'

unimodular matrix and therefore kerc X

But the latter follows from

this

[R]

and

Corollary

p'.

of the interconnection.

is the controllable part of B. Then the condition B

being

obtain

matrix of rank

a

p' by regularity

(a)" 'follows by completing X to Yk and defining R' (c)"

-HP'xq

virtue of Theorem 4.3.8 and

Corollary 3.2.5) kerL Rc

p +

=

E

kb

R(B)

B.

wAocAt.7-b

tO,to).

c

c

some w

:=

Now

for

By assumption there E

we can

some

t,

B for

some

to

apply Lemma

> 0

and

a

4.3.6 and

function

it is convenient, but not necessary, to have the entries of

exists 0 and

>

f?

c.

in

(At

'Ho in

order to avoid any backward shifts of the concatenating time instants.) Since f?t^v 0, the last part shows that every trajectory in can be steered to

f?(B)

=

zero, which

by Remark

4.3.9 is

equivalent

to

controllability of R(B).

4 Behaviors of

110

Delay-Differential Systems

Remark 4.4.5

Note that the map

f?(ker,c R) is

an

kerc

Rw

w

+

kerc R

isomorphism of H-modules. Therefore, "quotient behaviors

tified with real behaviors"

controllability

the

R/kerc k,

(with

a

different number of external

condition in part

(b)

above could be

can

be iden-

variables)

expressed

and

in terms of

the quotient behavior.

Since the image of a controllable,behavior is controllable again (see Corollary 4.3.7), the following additional characterization is immediate from the theorem

above. Notice that

(Definition via

regular

by part (b) below the

term

controllability

can now

be

twofold way. Firstly, it describes the ability to steer trajectories 4.3.4), and secondly, it expresses the achievability of all subsystems

understood in

a

interconnections. In other

words,

it

guarantees the very existence of

controllers.

Corollary 4.4.6 The following conditions

on a

system 13

C

Cq

are

equivalent.

controllable, (a) (b) each subbehavior 1 C B can be achieved via regular interconnection from B, (c) f 0} C B can be achieved via regular interconnection from B. B is

Remark 4.4.7

Consider

-

once more

the situation of Theorem 4.4.4. In

case

that

B n B, is

a

regular interconnection, the output number of 8 is, by definition of regularity, the sum of the output numbers of the components B and B'. This, however,

does not guarantee that the outputs of the given subsystem 8 are made up by the outputs of the two components. But this can always be achieved by a ,

suitable choice of the component B'. Even more can be accomplished. If 8 C B are both nonanticipating i/o-systems, then the controller B' can be chosen in this

form,

too

Thm.

9];

is worth

(and,

of course, such that the outputs match). This can easily the same way as described for systems of ODEs in [120,

exactly also Proposition 4.2.5(b) for the condition of nonanticipation. It mentioning that in general it is not possible to have all components

be shown in

see

strongly nonanticipating i/o-systems (see Remark 4.2.4) at the as can be seen by the example

same

time. This

fails'even for systems of ODEs

kerc In this

case

11

2s 3 +1 ,

s

2

82 s+

1]

C

kerc [2, '93 + 1,

B

strong nonanticipation of 8 and B requires by Proposition

that the second and third external variable

4.2.5(c)

the output of 8, while the second to find a strongly nonanticipating

are

is the output of B. But it is not possible interconnecting system B' having the third variable

one

S21..

as

output.

4.4 Subbehaviors and Interconnections

After these considerations be

regarded,

interconnections

on

we now

turn to

ill

problem,

a

that

precise below, as the dual of achievability via regular interconnections. Given a behavior 80 with subbehavior B, C Bo, we ask for conditions which guarantee that B, is a direct summand of Bo in the "behavioral sense", that is can

in

a sense

L30 In this I

case we

161

made

L32 for

0)

simply call B,

Bi -Hq, mpdules.A4o that M1 + -A42

a

some

(4.4-3)

behavior L32 C 130-

direct term of Bo. In terms of the duals Mi

=

question above can be posed as follows: given finitely generated C A41 g -Hq, find a finitely generated submodule M2 C -Hq such

the

C

achievability

Hq and M, n -M2

=

regular

=

Mo. This

is

interconnections where

exactly

the condition of

behaviors

now are replaced by regularity condition). The problem stated above on direct terms might not be of system-theoretic significance by itself, but nevertheless we believe it is natural to be investigated.

via

(see

modules

also

see

(4.4.2)

for the

Example 4.4.8 (a) For Bo =,Cq, the clas's of all direct terms of Bo is immediately seen to be the class of all controllable systems. Indeed, kerc R, E) kerc R2 C-q is equivalent to gcrd(RI, R2) Iq and lclm(Rl, R2) being the empty matrix. But this simply means that [Rj, RjT is unimodular so that by Corollary 3.2.5 and Theorem 4.3.8 the behaviors kerc R, and kerc R2 are controllable. =

=

(b)

In the

previous section it has been shown that the controllable part of a system is always a,direct term, the complementary term being autonomous, see

Theorem 4.3.14. The theorem below will show that

lable

(c)

subsystem

Consider

is

a

even

each control-

direct term.

system Bo 9 Llq given by Bo

kerc A, hence nonsingular. Choose a frequency A E C with k > 0. It is intuitively clear that there exists an exponential ord,x (det A*) solution w(t) woe,\' in Bo. We will show even more. By some matrix calculations it is possible to derive a direct decomposition of kerC.A that extracts exactly the solutions having frequency A. To this end, let U, V E A is diagonal. Extracting Glq('H) such that UAV diagqxq(al ......aq) from each a the (possible) root A with maximal multiplicity, we obtain a an

autonomous

the matrix A E

Hq Xq

,

=

is

=

=

factorization A

==

diagq

Xq

('al

where eti E H and

aq) diagq -

i ....

&,i* (A) (S

coprimeness of eti and

ker,c 6,j and

E)

(d).

kerc (s

-

A) ki

This in turn

the direct

sum

0. In

A)ki

x

q

((S

_

A)kj

particular,

)(S we

_

have

induces the direct

A)k,)

Ejq_

sum

3

-

A

k. The ki decompositions =

kerc ai for the components, see Theorem 4.1.5(c) implies kerc A kerc 3 E) kerc A and we finally get =

=

decomposition ker,c A

=

kerc

(3V-1)

(D

kerc

(AV-').

(4.4.4)

112

4 Behaviors of

Since

Delay-Differential Systems

det(AV-1)

==

Lemma 4.1.10 that

eratorAE R[s]9 xq

(s

kerL .

A)k c R[s] (up to a unit in H), we know by (AV-') = = kerLA for somepurely differential op-

-

Hence this behavior is

k-dimensional vector space

a

consisting solely of functions of the type w(t) p(t)eA' where p c(C[t]q. On the other hand, the first component kerL (ZAV-1) in (4.4.4) does not contain any (vector-valued) exponential polynomial of frequency A; this follows =

from the inclusion kerc

systems of ODEs

(,AV-')

g kerL

(det(,AV-1)1q).

can derive this way

For autonomous

successively complete direct finitely many various frequencies of the system. This is, of course, nothing else but the well-known expansion of the solutions into finite sums of exponential polynomials. Remark that the decomposition J4.4.4) implies the identities A lclm(.AV-1,AV-1) and I gcrd(.AV-1, AV-') by virtue of Theorem 4.1.5. In this particular case this is also clear from the fact that A and A are commuting. sum

one

decomposition according

a

to the

=

=

question posed above let us first rewrite (4.4-3). Choosing kerL Ri, we see that, as in the previkernel-representations Bi ous example, the decomposition (4.4.3) is equivalent to gcrd(RI, R2) 1. and Ro. Let furthermore, Ro XR, be the factorization implied lclm(Rl, R2) by the inclusion B, C B0. In the scalar case the existence of R2 satisfying the above requirements is identical to the coprimeness of X and R1. In the matrix case this generalizes to some skew primeness between these two matrices, which then provides a criterion for a direct sum (4.4.3) in terms of the given data R, and Ro. This is the content of Theorem 4.4.9 below. The role played by the quotient 80113, will be discussed in Remark 4.4.10 right after the proof In order to attack the

full

row

rank

=

=

=

The

=

(straightforward) equivalence (a)

#

(b)

is the

analogue of

[108,

result for two-dimensional discrete-time systems in

Thm.

a

corresponding

18.3.4].

Theorem 4.4.9

Let Ri G RP' ", i ated behaviors

Bi

0, 1, be two matrices with full row rank. Define the associkerL Ri g cq and assume XR, Ro for some X E 'HPO xP1, =

thus B, C B0. Then the

following

conditions

(a) B, is a direct term of B0, (b) the matrices X and Ri are skew-prime, 7P,

xP0

equivalent:

that is, there exist matrices F G

and G E 'HqxP' such that

lp:

(c)

are

there exists

a

=

(4.4.5)

FX + R, G,

matrix G G I-PxPi such that

Bo

=

Bi

ED

GRI (Bo).

Furthermore, every direct term B, C 13o is of the form B1 H(BO) for some H G -Hgxq. Moreover, every controllable subbehavior B, is a direct term of BO, and in case B0 is controllable, every direct term of Bo is controllable, too. =

4.4 Subbehaviors and Interconnections

113

skew-primeness condition does not depend on the choice of R, R0, which, being of full row rank, are left equivalent to every other chosen representation. Remark that the

and

"(a)

PROOF:

full

row

=: ,

(b)

"

Let

B0

=:

L31

E) 132 where

rank. Then Theorem 4.1.5

XR1. Rom Theorem 3.2.8

we

form

132

kerc R2 and R2

=

yields gcrd(RI, R2)

get that po

P1 + P2

-

=

1.

and

q and

an

G

'HP2 X q has

lclm(RI, R2)

=:::

equation of the

[G Z] [Ri] [Iq] C Y

where the leftmost matrix is in

R2

0

Glp,+P2(H)

partitioned according to G E Again Theorem 3.2.8 implies that the matrix R1 is an lclm(Ri, R2) and hence by the uniqueness of the least common left multiple we can assume without loss of generality that X. Completing [RJ, RiF to a unimodular matrix (which is possible by Corollary 3.2.5) we get after some elementary column transformations, if necessary, a matrix identity of the form ,HqXpi

and

.

=

[R2 NJF] [G[X YJZ] Ri

with matrices F and N of

"(b)

= -

(c)"

The equation

fitting

(4.4.5)

=

[Ip, L

0

0

IP2

sizes. This shows

]

(4.4.6)

(b).

shows that both matrices

[RI, F]

and

[(SF, Xrf

completed to unimodular matrices. Choosing the completions appropriately, we arrive again at Equation (4.4.6) with suitable matrices R2, N, Y, and Z. For the verification of the direct sum in (c) we use the identity Ro XR1 and calculate for wo c B0 can

be

=

(i) R1 GRI wo (I (ii) RI(I GRI)wo =

-

-

FX) R1 wo R, wo, implying FXRlwo (I RIG)Rlwo

=

=

=

-

the directness of the sum, =

0, hence Bo

is contained

in the sum,

0 by (ii), thus (iii) RoGRiwo Ro(GR1 I)wo X(RIG I)Rlwo GRI (Bo) 9 Bo. Since Theorem 4.4. 1 (a) guarantees that GRI (Bo) is a behavior, the implication "(c) =: , (a)" is clear. In order to establish the representation B1 H(BO) for a given direct term B, of B0, consider again (4.4.6) and define H := ZR2 I GR1. The inclusion Bi D H(Bo) is immediate by (ii) above, while the converse follows from B, E_ ker,c GRI g kerc (I ZR2). The remaining assertions are consequences of the =

-

=

=

-

=

=

-

-

above in combination with Theorem 4.3.8 and Cor 4.3.7.

El

Remark 4.4.10

Unfortunately we are not able to provide an intrinsic characterization for B, being a direct term of B0, that is to say a criterion purely in terms of the trajectories. However, the skew-primeness of the matrices X and R, can be given a behavioral interpretation. Note that the existence of a direct decomposition does not only require the splitting of the exact sequence

4 Behaviors of

114

Delay-Differential Systems 0

)

B,

)

L30

B01B,

)

)

0,

B01131

to be isomorphic to a behavior contained in B0 that, trivially with B1. From Remark 4.4.5 we know that the quotient can be regarded as the behavior Rj(Bo) contained in 01. Thanks to Equation (4.4.5) it is indeed possible to embed this space as a behavior in B0, complementary to B1. Precisely, the operator G induces an 7i-isomorphism from

but also the

additionally,

RI(BO)

quotient

intersects

onto the behavior

GRI(Bo)

C

L30

C

fq.

The theorem above tells how to check whether

or not B, is a direct term of B0, and, if so, how to determine a complementary term. One has to check the solvability of the skew-primeness equation and to find a solution, if it exists. Since this equation is linear this is not a problem (apart from computational issues, see Section 3.6). For matrices over K[x], where K is a field, a nice criterion for solvability has been derived in [94]. Studying the proof in [94], one remarks that it works equally well for the ring H(C) of entire functions and, as a conse-

quence, also for R. The result will be summarized next. We will confine ourselves to

sketching

the main idea of the

proof

in

[94] along

with its

situation. For the details the reader is asked to consult

adaptation

to

our

[94].

Theorem 4.4.11

H1 x n,

Let A E

B EE Hn x m, and C 2 7in x n be

given matrices. Then the

matrix

equation C=FA+BG is solvable

over

Ii if and

only

if the matrices

[B qA] [13 AO] 0

(4.4.7)

,

-H(n+1)x(,rn+n)

E

0

equivalent.

are

We remark that rem

by

the

3.2.1), equivalence

uniqueness of the elementary divisor form (Theoover H can easily be checked (easily again

of matrices

up to

practical computational issues) by calculating the

given

matrices.

SKETCH

OF

1) Necessity

0

For

sufficiency

one

the matrices A and B

and

bl,

.

.

.

,

invariant factors of the

PROOF: We follow the steps taken in [94]. follows easily (over every domain) since (4.4.7)

[I -.r] [B AC] 2)

(4.4-8)

0 A

1

may are

assume

in

-1 .

rkA

diagonal

ba, respectively. Hence a, to finding fij and

ing (4.4.7) reduces

0

=

] r =

a

implies

0"

0 A

>

0, rk B

=

3

>

0 and that

form with invariant factors a,.... , a, a, and bi b,3. Now, solv-

gij such that

L

4.5

fijaj

(cij)

Assigning +

bigij

=

the Characteristic Function

cij

115

(4.4.9)

,

bi for j > a and i >,3. The, solvability of (4.4.9) ring K[x] by showing that the equivalence of the matrices in (4.4.8) implies that for each irreducible polynomial -Y E K[x] which. occurs with maximal power r in aj and bi, the element -yr is also a divisor of cij. Thus, cij is in the ideal generated by aj and bi. As for the ring h, one can use

where C

and aj 0 is established in [94] for the

the

=

same

=

=

line of arguments to show that

minf ord,\ (aj*), ord,\ (V)} Hence and

ci*j

is in the ideal


. (d)" is in Proposition 4.5.5 since kerc A C kerc (b1q) is stable. R

=

=

=

=

b.

=

=

Notice that condition or

(a)

above is

if the set of rank deficiencies,

satisfied, if, for instance, kerL R

fA c C

I

rk R* (A)
0, i. e. -ri). Then R [aj, (71, D] is the ring of all linear, time-invariant

5.1.2

C' (R,

=

f (t

-

.

,

..

delay-diff rential operators of the form N

P"jo'j'

P

1

0

...

o-,"

o

o

D',

p,,i E R,

(5.1.1)

i=O

where

E'

ture of'an

means

this

sum

being

finite. The space A naturally carries the strucfor p as in (5.1.1) and f E A one

R[aj,...' al, DI-module. Precisely,

has N

E pv,i f (') (t

pf (t)

-

(v, -r)),

t E

R,

VEN' i=O

E,1=1 vj-rj denotes the standard scalar product..It is obvious that

(v,r)

where

Endc (A) mutually commute. Moreover, if -ri ......rl E R are linearly independent over Q, then al, al, D are algebraically independent elements in the ring Endc(A). To see this, let p be as in (5.1.1). Then p being the zero operator in Endc (A) implies in particular for the exponential functions eo,.\ the identity al, and D E

0'1'

.

0

=

.

.

,

pv,jA'e-A(vI'r)

peo,.\(t)

e\t

for alltERand all A EC.

IEN' i=O

Since zero.

(v, -r : (p,,r) R [a,,

Thus,

elements

are

6971

whenever

v

:

p in

N',

all coefficients p,,i G R must be

al, D] is a polynomial ring in 1 + 1 indeterminates. Its delay-di'fferential operators with 1 noncommensurate delays. From .

[25,

p.

The

following

.

.

,

it is known that the

operators

are

surjective

on

A.

class of systems arises in multidimensional systems a unified manner in [84].

have been studied in

theory. They

140

5 First-Order

Example

5.1.3

Let K be of

(Multidimensional Systems)

following

Consider the

(a)

Representations

situations.

of the fields R

one

or

'9

'9

K[ 49xl

C and let

aXj+j

]

be the

ring

partial differential operators acting on A C'(R'+',K) or on A the space of real- or complex-valued distributions on R1+1; =

D'(R'+'), (b)

Let K be any

(possibly finite)

field and let

1

a(n)tnj.....tn,+, a(n) 1 1+1

A:= nEN'+1

be the

K-algebra of formal

E

Kj,

where

n

=

(nl,..., nj+j),

power series in 1 + I indeterminates

over

K. Via

the tackward shifts with truncation

a(nl,.

zi

..'

)

tnt+1

nl+,)tni 1

1+1

nEN1+1L

a(nl,..., ni

tnt+,, 1+1

nl+,)tni 1

+

nEN1+1

the space A can be endowed with the structure of a K[zl,.. -, z1+1]-module. This is usually the framework for discrete-time multidimensional systems,

cf. In all

[123, 122].

cases

above the operator ring is a polynomial ring in 1 + I indeterminates. [841 that these situations have some strong algebraic

It is the main result of

structure in

common:

the module A constitutes

a

large injective cogenerator

category of K[z, s]-modules, see [84, (54) P. 33]. Part of this result,goes back to work of Ehrenpreis and Palamodov in the case of PDEs. The large in the

injective cogenerator property itself is

not needed for

our

purposes and

we

refer

[84] for the details. More important for us are the for. the operators acting on A. In essence, the correspondence consequences between kernels in Aq and -operators in K[z, SjPXq is quite similar to that for the interested reader to

delay-differential systems discussed in Section following from [84] for future reference.

(1) [84, (46),

p.

30]

For matrices

kerK[z, ,] RIT

=:

im

R,

E

4.1. We would like to extract the

K[z, S]pxq

K[z,s] Rj

and R2 c-

kerA R2

(2) In particular, if R G K[z, S]pxq has rank p, then (3) [84, (61), p. 36] For matrices Ri E K[z, S]piXq, i kerA R, 9 kerA R2 Recall the

analogous

R2

=

XR, for

some

one

has

im AR1.

=

im AR =

K[z, s]'xP

1, 2,

=

AP.

one

has

X E K [z,

S]P2 XPI.

Proposition 4.1.4, Theorem 4.1.5(a), and Remark 4.1.9 for the case where H is acting on L C' (R, C) As we saw in Remark 2.11, property (1) does not hold true for the operator ring R[U, -!2L] dt acting on L, preventing point-delay-differential systems to be covered by the paper [84]. For the construction. in the next section only the suriectivity (2) will be needed. Parts (1) and (3). will, be used merely for a more detailed discussion of multidimensional systems in Examples 5.1.10 and 5.2.6. results in

=

-

Multi-Operator Systems

5.1

141

we introduced the formal transfer function -Q-1P E i/o-system kerL [P, Q] C C'+P of DDEs. In the same way transfer function can (and will) be introduced for the general poly7

In Theorem 4.2.3

R(s, z)P11 the formal

nomial

of

an

setting of this

section. In this context the

following

situation will

play

a

crucial role. 5.1.4 (Transfer Functions) K[z, s] be any polynomial ring in 1 + 1 indeterminates. Then the space K(z, s) carries a natural K[z, s]-module structure given by multiplication.

Example Let

A

=

The same is true for the space N

I

fis

N E

Z, fi

c

K(z)J

i=-00

of formal Laurent series in s-1 with coefficients in the field spaces

are

divisible

K[z, s]-modules,

thus

our

abstract

K(z). Clearly,

both

approach applies. For

setting, behavioral theory coincides with the transfer function framework we will make precise in Example 5.1.8.

this as

Remark 5.1.5

Throughout this section, it does distinguished. Even more, if xi, over K, the same is true for yi, .

play

not

x1+1

...'

.

yl+ 1, where

,

.

(yi,..., y1+1T

having one of the variables algebraically independent elements

any role

are

=

A(xi,...' x1+1T

+ b

and b E K1+1. In particular, K[yl,...,Yl+,] instance, in Example 5.1.2, the polynomial ring can also be presented as R[D,ol 1,...,ol 1], where we replaced the shift operators by the corresponding difference operators and changed the ordering of the indeterminates. In this case, the list of,operators (zl,...,zi,s) reads as 1 is the distinguished operator. The 1, (D, a, a, 1), so that s al procedure of the next section would then result in a first-order realization with respect to the last difference operator ol 1, provided that certain necessary for

some

A

E

K[xl,...,xl+l].

G11+1(K)

For

-

-

-

.

.

.

,

-

=

-

-

conditions are satisfied.

general case of a divisible K[z, s]-module A. For R E kerA R is a submodule of Aq and can be regarded as an abstract version of a behavior of a dynamical system, generalizing those of Definition 4.1. If A is a function space, it consists of all trajectories in Aq that are governed by a system of (higher order) equations, e. g., delay-differential equations, partial differential equations, or partial difference equations in case of the examples above. In the general case, for instance in Example 5.1.4, there is no interpretation of kerA R in terms of trajectories.. In the following definition we introduce these systems formally along with the desired first-order representations. Let

us

return to the

K[z, S]pXq

the kernel

5 First-Order

142

Representations

Definition 5.1.6 Let R E K [z,

(a)

s]

"

('+P) be any matrix.

The module

fa

E

A+P I Ra

system)

in

A+P.

kerAR is called

(b)

a

behavior

(or

The behavior

kerA R,

there exists

number

a

(A, B, C, E)

a

or n

=

simply

R,

is said to be

realizable,

if

E N and matrices

K[Z]nxn

E

the matrix

01

K[Z]nxm

x

x

K[Z]pxn

x

K[z]Px'

such that

kerA R

8 '(A, B, C, E)

=

(5.1.2)

where

1(yU)

BA'(A, B, C, E) In

case

such matrices exist,

A-+P 3 x

E

we

call the

G

An

EUI

sx=Ax + Bu :

Y=Cx

+

quadruple (A, B, C, E)

a

(5.1-3) realization

of kerA R. The system

is said to

be

a

8 '(A, B, C, E)

sx

=

Ax + Bu,

y

=

Cx + Eu

first-order representation of kerA R and the behavior (5.1.4). The length n of

is called the external behavior of

the internal vector

x

is called the dimension of the realization

The matrix C(sI-A)-'B+E E

function of

The term

(5.1.4)

(5.1.4)

or

of

K(z, s)Pxl

(A, B, C, E).

issaid to be theformal transfer

(5.1.3).

first-order representation

or

first-order system refers, of

course,

to

the fact that the first equation in (5.1.4) is linear with respect to the operator induced by s. As has been discussed for DDEs in the introduction to this Chap-

ter, itdoes not make

sense

cases, where the matrices

A few remarks

are

to call

are

(5.1.4)

a

state-space system. Only for certain

constant, this

might be appropriate.

in order.

Remark 5.1.7

(i)

It is not clear whether each external behavior of

a

first-order system does

kernel-representation, always be eliminated. We will see examples above except possibly for delay systems with delays, where this is unknown. admit

a

words, whether latent variables can below that this is indeed the case for the

in other

noncommensurate

5.1

(ii)

Remember the notions of free and

Multi-Operator Systems

maximally free variables of

143

a

delay-'

differential system from Definition 4.2.1. These concepts generalize naturally to the context of operators acting on A and can be applied to the

(5.1.3).

A on A' it is immediate surjectivity of sI that each for u E A' there exists free, meaning the such that R. For AP E examples 5.1.2 5.1.4, again ker.A (u' y with the possible exception of systems with noncommensurate delays, the variables u are even maximally free, so that the last p variables constibehavior

Rom the

that the variables

-

u are

-

,

tute the

outputs of the system;

see

the discussion below. We know from

delay-differential systems of Chapter 4, that this implies that R'has rank p, see Theorem 4.2.3. That means that the number of outputs equals the

the number of

erality. However, rank

row ces

independent equations. Again,

this will be true in

more

procedure in the next section applies only kernel-representation, meaning that we are restricted to

the realization

R E K [z,

s]PI ('-4-P)

to start with. Put another way,

we

will

gen-

to

full

matri-

assume

in

Section 5.2 that the system is governed by exactly p linearly independent equations. Except for the case of transfer functions and systems with commensurate delays, this restriction is indeed crucial: since K [z, S] is not a principal ideal domain, it is in general not possible to eliminate lineaxly dependent rows of R without.changing the associated behavior kerA R, see Example 5.1.10 below.

(iii)

In accordance with tion

4.2.1),

we

our

definition of

always place

of the external variables; restrictive point of view.

Let

us

input/output systems (see

the free variables into the first

see

also Remark 4.2.2 for

discuss the definition for the list of

ext

(Y) U

(A, B, C, E) =

where

-Q-1P

=

c-

Am+P

m

components

comment

on

this

examples above.

Example 5.1.8 (nansfer Functions) Consider again Example 5.1.4 where A is either K(z, s) case, the external behavior of (5.1.4) is simply

B

a

Defini-

y

=

(C(sl

-

or

K(z)((s-1)).

In this

A)-'B + E)u

kerA [P, Q],

C(sI'- A)-'B

+ E is any factorization of the formal trans-

polynomial matrices (which, of course, exists). Thus, the external behavior B (A, B, C, E) admits a full row rank kernel-representation [P, Q] E K [z, s]Px 4+P). Obviously, for this special choice of A, realizing a behavior kerA [P, Q] is the same as realizing the rational function -Q-1P, that is, as finding matrices (A, B, C, E) satisfying -Q-1P C(sI A)-'B + E. Note also that in this case u is maximally free. fer function into

ext

=

-

5 First-Order

144

Example

Representations

(Delay-Differential Systems)

5.1.9

In the situation of

D and a,, Example 5.1.2, where s al are shift operators the first-order lengths -rl ......rl, system in (5.1.4) reads as =

of noncommensurate

E A,c'x + E'B,o,'u, vEN1

Y

VENI

E'C,,o,'x + 1]'E,,o,'u,

=

VEN'

VEN'

where

the notation a'

we use

:=

al"' 1

o

o

...

a,"

and A Z/ ,

1

B, C, and E,,

are

constant matrices with entries in R.

If 1

=

1,

we

is in fact

a

know from Theorem

4.4.1(a)

137 (A, B, C, E)

C E

behavior in the

that the external behavior

0 1

sense

(kerA [sI

-

of Definition 4. 1.

A,

-BI)

Moreover,

we

will

see

in

5.3.1 that it always admits a kernel-representation kerA [P, Q] where C- 'HP' ('+P) and Q is nonsingular. In particular, u is maximally free, see

Proposition

[P, Q]

Theorem 4.2.3. It remains

an

open

question whether similar results

noncommensurate

delays, cf. [127',

Example

(Multidimensional Systems)

5.1.10

Let A be any of the spaces in

p.

234]

Example

a

[41,

are

true for

systems with

3.1].

Sec.

5.1.3 with the

structure. Then each external behavior B

mits

and

"

A

corresponding moduleB, C, (A, E) of a system (5.1.4) ad-

kernel-representation of

system. This

can

be

seen as

rank p, the number of output variables y in the follows. Define the matrix

sl -A -B M:=

IM

0 -

Since each submodule of K[z,

[Y, P, Q]

E K [z,

s]

s]'+'+P is finitely generated, for

some

T

kerK[z,s]M It follows rk rk

0

=

p.

=

p

(5-1.5)

.

E_

C


(i) is trivial and (ii) == ' (iii) as (i) =: , (iii) are simple consequences of the Binet-Cauchy formula for as the minors of matrix products. SKETCH

PROOF: The

OF THE

well

The assertion

lp,q,

(iii)

= ,

(i)

M(p), of M

by Mp, thus det Mp

see

such that

EPE-Tp,q CAP)

K[z, S]qxp

where

Ep

with indices p

rows

are given by corresponding p x p-submatrices M(p). By assumption there exist numbers cp E K

be

can

seen as

follows: The minors of M

Definition 3.2.6. Denote the

P E

Then C constitutes

cz =

a

=

E pEl cpEp adiMp E identity 1p sitting on the elsewhere, hence MEp Mp.

1. Define the matrix C

=

:=

P,,

Kqxp is the matrix with the

(pi,

right

.

.

.

,

pp)

and

zeros

=

inverse of M.

The remaining implication (i) = - (ii) as well as the alternative formulation is the celebrated result of Quillen/Suslin, see [67, pp. 491]; we also want to mention [69]

for

an

algorithm computing. a unimodular completion.

PROOF

OF

PROPOSITION 5.1.11:

(a)

Rom

(5.1.2)

we

El

will first derive the iden-

tity M:=

QCadj(sI

-

A)B

+

det(sl

-

A)QE

+

det(sl

-

A)P

=

0.

(5.1.9)

0 for all u E A'. fact, by divisibility of A it is enough to show that Mu Thus, let u c A' be an arbitrary element and pick x G A' such that Bu Cx + Eu. Then Pu + Qy 0 and one (sl A)x; see Lemma 5.1.1. Put y easily verifies In

=

=

-

=

=

5.1

Mu

hence

(5.1.9)

det(sl

=

A) (QCx

QEu

+

Pu)

+

=

as an

rank, Lemma

equation

3.2.7

yields

(b)

0,

C(sI

over

det

the field

Q :7

0'

A)-'B + E

-

0 and

K(z, s). Since both matrices (5.1.8) is established.

Write again R [P, Q]. By Theorem 5.1.12, the matrix matrix unimodular to a completed

[-X U2] UI

c

R

and the assumptions

I

as

the matrix

91 -A -B

U1 U2 -X R

0

IM

C

E

Uj[sI-A,-B] +U2

be

identity

rIP1

P 0

-

:=

can

Gl,+m+p(K[z 8])

be rewritten

can

have full

[-X, R]

=

where T

147

follows. This in turn implies

[P' Q1 considered

-

Multi-Opeiator Systems

I

_

01] [CE

E

K[z, .] (n+m) x (n+m)

is

nonsingular.

Hence

(U),

sI -A -B

0 G

Y

B t (A, B, C, E)