XML Step by Step [2nd ed] 9780735614659, 0735614652

Superb organization and presentation of the material. All code functions, all examples are relevant. Excellent developme

202 94 3MB

English Pages 389 Year 2002

Table of contents :
00000___07be008c8276e95dbbf4cdee2cdf4385......Page 1
00002___79c277462295ccdeedc24e5b50330987......Page 2
00003___7e0229f16eefad21c64fe455ba7840d5......Page 3
00004___a933fe67c82b988d8a19eb1cf4460489......Page 4
00005___c78cce8114aad80865103a642f927492......Page 5
00006___05fca2542d7816c3088c08cb5a953c52......Page 6
00007___dbcd3471236536641cfe07a8b300c340......Page 7
00008___51a230a8ae343e365423c24fc4c892b8......Page 8
00011___81cd371e1c1bacf76929df14a6b72ecd......Page 9
00012___fa71951b85f2335bba3707db9c9945c3......Page 10
00013___26bf5c8899f363dbe89421db9772700b......Page 11
00015___5022a336bf7648f50eb2940bafca422e......Page 12
00016___8ffef2ab3f4bb8896512f1f52c960d61......Page 13
00018___b1f3d94f696c5e655031dbac60c8b37d......Page 14
00020___4c4b41b89831554d0ed8dd83097ce467......Page 15
00021___0c64d3975047800446fafdd265235893......Page 16
00022___1f3d089ae9c652aad2afe9c4f573bd2d......Page 17
00023___4740dcd998da64186e5528f819ce220e......Page 18
00025___b7b672ade2024f358bfd2d179c77558a......Page 19
00026___92dda05b322aef279ea8e2bb3247ae41......Page 20
00027___40313eef7c48a42502c35f6245d97598......Page 21
00029___6a2229da2f7ad2a89c6e7eeb4e1f0d14......Page 22
00030___7e5507d2755064a6b2e2b4c0b987641b......Page 23
00031___cf3da0e8f97dd392043e87fc5ae4593f......Page 24
00032___91eac5195e31cd5a6022cdc6e4b14f6b......Page 25
00033___33d8ccbeecdd88d993fe65249b0359ee......Page 26
00035___cdef2c5e5e588e4e137279c64bc71b53......Page 27
00036___c823951e12c7da285c8187105bd7a971......Page 28
00038___cdaf368e1d91100316a46597e6567496......Page 29
00039___cd2505397f0bb2edd28b22f4a484123f......Page 30
00040___54c8df44e5864ca869920507a5b26105......Page 31
00041___4611f2737c4a637b71ae06d401d2cd76......Page 32
00043___d4e4f989d951d35de0382e120b4b4e19......Page 33
00044___0fa30a5072ae1086425b8696c8142499......Page 34
00045___d434437e62a47d05906c4e24a2ce79ba......Page 35
00046___16afb0554435a9be7c6c791195b06c98......Page 36
00047___cdcd0ebde96c32c015750a023a28744c......Page 37
00048___1e76a009ede3ffe83276816004f0f59c......Page 38
00049___21b65988b7ba745715028e7f65b50cbf......Page 39
00050___7f3b086353629e6a6c304e929afb7209......Page 40
00051___f60fe2ef5ce81c2be1e01eb4bb3fba9f......Page 41
00052___0a86962fd629f749e29f8e3002544138......Page 42
00053___5a97e02eb10b4dc03ea597f8ac6d7ac6......Page 43
00056___14ab7a7909284fc6401b19d21537d29c......Page 44
00057___5f6c898cb2ac1ac0ad4b032580c00844......Page 45
00058___81a6a0b4992bf7f884bb9f59270852e7......Page 46
00059___188053450dbd26400de2c0fe0e6dd62c......Page 47
00060___e4190846d17ef81e3826a0871de71ffb......Page 48
00061___7bd385ff2b8e6cbfc9dcb68777e1561f......Page 49
00062___6fece48acf4f08b58afa9b4b8ec6e2c7......Page 50
00063___5d14fdc588d22e2c868a2bdfc4748378......Page 51
00064___902ff685222620c1dd2b1b92f54dbafb......Page 52
00065___e10449efde06e242167ac4a82fa281d9......Page 53
00066___96cce74f24f7883ee805b4f3669e9a05......Page 54
00067___066cd22e047fce2c5725e693e4616d2c......Page 55
00068___a38efa9725ce5691c587d6c86693f60c......Page 56
00069___67c87fa03c6e81ef8e3076a0a556f1bf......Page 57
00070___be1599b9b2f01d3ef1708601e5305c8b......Page 58
00071___077aac04b65b4b09bd9eee0cf602d2e7......Page 59
00073___db91d036847cd284782f8102d0d30226......Page 60
00075___347552d272eab581240ca12a63f8df8b......Page 61
00076___c568dc6735bad69e18abf376a59a325f......Page 62
00077___2e1012290c983657e4f9b410ebc0b64c......Page 63
00078___20d300bfc29acfe447daeb59c93b30cd......Page 64
00079___e15f6f912d51a56fe5cb6fd90eea9c96......Page 65
00080___3c6b864e647fda3991319c851aab5279......Page 66
00081___fce256505c2cb5384233649dac023d86......Page 67
00082___5ba9398b3ebff8ff8ca11989892ff915......Page 68
00083___b4946b94d61aaf72d0838bdf10af5ac8......Page 69
00085___e32d5b7ce84396a8aa892a5e87b4c93e......Page 70
00088___c4a89139ee94f4861239e3152005dec0......Page 71
00089___647ad2be306267d969c4cdd8e6ed6d9e......Page 72
00090___1c3463392d1f86fb4c763e5866a34c3f......Page 73
00091___7cbf1cf841d8c2ca6569e79e35408f84......Page 74
00093___556f32e761c0f178bc1286f34d9426f0......Page 75
00094___5a26e56a9385672f1b318341cdb43854......Page 76
00095___5bff626f10cccce7656a8f010f8944ac......Page 77
00097___563f97a2b51ba65253e353b1a62495a7......Page 78
00098___164f62b26bbec199f7a49da01ebb83e7......Page 79
00099___72ae2c75944b66959bf7e02105197df6......Page 80
00100___0af2e625881ebb172c11a19e248836d6......Page 81
00101___22afff2dca889c5107eebb853b74b583......Page 82
00103___75ce2ab24dff7d14181576fffe09395f......Page 83
00104___5e2dd566428426287b0541f4139b2be8......Page 84
00106___65cc5d3de5566589cc7e85237cc3df0e......Page 85
00107___70c944bc69ea188d7975088014b1bb4a......Page 86
00108___a0b8b732fc5222e4cadf7cc75cadc48d......Page 87
00109___8faf9c79c01752ff6d219e1fe54eae15......Page 88
00110___d82bab581409feea270f38e8087e4d82......Page 89
00112___e348ce0ec89e86b0eeb8199dfa566a76......Page 90
00113___ff0c0987c8c681c86732aeb2888eaf3f......Page 91
00114___df5be5ff959daec1e10663d147ec830f......Page 92
00115___b19b4d04bd50fc968e04f3eae262cc31......Page 93
00118___bb69e9e1ae026339cbf80695449133c7......Page 94
00119___93bbfa20b63dbc80a3f034da82c87988......Page 95
00120___da8b71bd337e6e577b9b5c1fb5b74b82......Page 96
00121___1e69f8d6e81eaf85980d287b45cf16dd......Page 97
00122___bc250eac32f66e0f9441ac7a1bf43507......Page 98
00127___e35f961ce8ecf2aa0a3d12068b66846d......Page 99
00128___10badc2d67bc077915fe5400d842db0d......Page 100
00130___e6625092f4c3eb36c495f51cb0356a3a......Page 101
00131___bac4ff3810c493a3425e2a810e1a0dd7......Page 102
00132___bd88eae7a16f27c908345c50ac1d540d......Page 103
00133___bc8dff5811bb593a839c66306096d60f......Page 104
00134___67bce0e289ba178cb4a048c2e54f0edb......Page 105
00135___b282dd4a6f6122ad4aa269d281c69cf4......Page 106
00136___2ea62ab44365c10fcb3dbdccc3291793......Page 107
00139___ea1a865d89cbcb941c4db58b23c2d239......Page 108
00140___ffbbe549d9e8051b881a37c3019cff06......Page 109
00141___9249403aedfb836b3db67b35e3cba673......Page 110
00142___cdc3bf65af3a82257274d1728ac38a0a......Page 111
00143___099a9b209a24ac2b2894f61b4f12a995......Page 112
00148___41edce29bea5e759793932bae1be497c......Page 113
00149___c8e2f302c10023ead709cfe71ade5f88......Page 114
00151___1bcdb5f0f4108c507f57027a4aaa3694......Page 115
00152___343b3bf70462e15969d6748130567061......Page 116
00153___88ec6879d6b7017928a0669c205d9539......Page 117
00154___f269ff02f399d0ab531694e2ea878607......Page 118
00155___b4995d2c742ce1ed07aec54a4b351af9......Page 119
00157___10ab4d2c516c109f7b8dd82452eb56e4......Page 120
00158___56fcfd1e500243d2eb7294b55ebc7e3d......Page 121
00159___69fd564fc4f7a1b8467e73fca68f5c3b......Page 122
00160___a3c8b4bfa2e04ceea6e67fea45b98732......Page 123
00161___c0532c1d09179559824cbfdd8bb4305c......Page 124
00163___be60a612cbc8a4af0dac55435b6559b1......Page 125
00166___b22e2f700a22e4a7976246f50fbf76dc......Page 126
00167___3662fbeded10b110273ede7e4dda7c4e......Page 127
00170___2f656900cd5d1c3ea9442a649aeee38d......Page 128
00171___09b08c76807f2b9d481a58f62fb20890......Page 129
00172___0b7080a1efc50b0591e95465b4ab6d5b......Page 130
00173___3a6a805f5299de537d8bed6a80032353......Page 131
00174___55a85aebafd14fcd36ba36916923377a......Page 132
00175___861e711aa07e532c1dae25c365f54453......Page 133
00176___485e9015fda2a6d59051d4d45c32c356......Page 134
00177___e445502d14e3b235c04a39f584dad3bf......Page 135
00178___ab78db8e659840ba832edb399dc05565......Page 136
00179___13b6e283b2d550763795b5b5eff3cf9e......Page 137
00180___95d67a797b94e6c8e9515ed5a8477154......Page 138
00181___278ac649f2c9884b6a83785cc59332a3......Page 139
00182___79816ce93b44fec0d0c8e31ab8f5d98c......Page 140
00183___415bb17412cf276fb8e8c090f6e64221......Page 141
00185___b12f3171d40c73549ab3da716f9fba73......Page 142
00186___4bef3f505c859ea4fafe48279e1144e1......Page 143
00187___82f888122936d0abc535e5e5eae01086......Page 144
00188___8a9e12e0051b3d740fdfdc4171729022......Page 145
00189___1999223eb76c18e674f582355ffd1c6b......Page 146
00190___52a7a3185f124b42a5b2a388abbfad7f......Page 147
00191___e0724028af8e44382cc65af5e4e83c81......Page 148
00192___424855c7bae91e006257f447bf912d1d......Page 149
00193___eae1ca623c962ef3a24785f350c04831......Page 150
00194___ac917ffbf721e0cf6feade65b3d88016......Page 151
00195___adcccd6a5ddeaacd60cde370689f2631......Page 152
00196___969f2ef9fd044c3c92bcc0f520e46157......Page 153
00198___c726c212ff1f2daebaf90d38f3b123f3......Page 154
00200___058a456418db59176ac64ef5cb59ced7......Page 155
00201___9df977eefb392537e9e77e6f395217a3......Page 156
00202___1c7e5dc61948599fdb78f13e57dc7452......Page 157
00203___cd2db706fd545f9f3cd20e3889e0e0e8......Page 158
00204___6f0553ed708412fbe360c1da1265d3e1......Page 159
00205___a7d068028f4e7a43c041973600e55183......Page 160
00206___cc19cdc94fbf80f7f506f8a32ef3f55d......Page 161
00207___dadea393f2dd53b921208f0150e85294......Page 162
00208___ea451dee11ee27f91a647f0ea6975dae......Page 163
00209___0ea9fa9d661961ebc510b18a8346a1c1......Page 164
00210___f6b6b16f33e4ab9f9253648964ce7c67......Page 165
00211___d3c5e33d080bd117ab88a6bf09eb1852......Page 166
00213___c18c4b62ff81772c0c35e399b9328ad5......Page 167
00214___06c52ec32a3110a176e8d792ffeb7675......Page 168
00215___25932ce2d4015e0e480ae0560bab16e2......Page 169
00216___1ee33e4de73cfb6c9fa63a3ad31b7be2......Page 170
00217___73b30a475252bbb2ce3ff238264df8e0......Page 171
00218___91f6d5a5a49059917da2c7e0bcc1ea11......Page 172
00220___44ae4e29e414c63ae6aa184440d8c689......Page 173
00221___a9a49e3921ce92d59d6c2b9dd4501596......Page 174
00222___02184fd9198c23ab7dd9c79cdc0fb211......Page 175
00223___34c60b48b00a3e18f659bafcb2f790c3......Page 176
00225___4afb0b9b30e178c55e8ca9be0291e7f9......Page 177
00226___652de5e57cc95c1847165c4dca7d1f6e......Page 178
00228___92ba36ece6899bae718fda64c3f7e305......Page 179
00229___aef34bbf6160ca0d6d1a7835e061b071......Page 180
00231___5654a23d8c2935cf83aeb3f6a3440415......Page 181
00232___dd940f82909b5212e1eafd80020c7979......Page 182
00233___b376ca4006b3c1c75ef903974d0bb6af......Page 183
00234___015b4701da5e948fc2d482d02a779584......Page 184
00235___d450c77a98fcf9795c6ab3005bb37ed2......Page 185
00236___f901a072b2c930d72e9e088ca0cf8187......Page 186
00237___a2fd3e5ac7b1072e82466570c8c54980......Page 187
00239___e54307556308812bb4e650207039b84a......Page 188
00240___c90b6cea1b8ce0f3f5d3650caf7dbcd6......Page 189
00242___bff86a23d731387c12f79a56a1090053......Page 190
00243___d44959c7951507e403d999536445fda4......Page 191
00244___2e7ee1bf216445b8e415f321fb8458d9......Page 192
00245___e3f5693c2dfaf8604e83edeb007f1e2c......Page 193
00247___87bae98867a49af25bf88f1bfe5d7b90......Page 194
00249___b0ac62e90be2ed0352940ea249b99559......Page 195
00250___09cf2d2b1022c10520918f71efd879b5......Page 196
00251___b59f858bb6b9cf962b3a5fbf0334fc55......Page 197
00252___3ed554aefb794d5a3a23e97739338970......Page 198
00255___79e0ff5c00f363885cf7f091f9afc844......Page 199
00256___8316c6e708288bdd24be9f29bdedfbdc......Page 200
00258___5d8b3072b45528ab3cfc6db9a07e64ba......Page 201
00260___267a1179046e90e811d28bf31a5725c0......Page 202
00261___00e8367ba74425d4a138fd1a8d0e20bd......Page 203
00262___717625a339e8d150763833f7e0585972......Page 204
00265___c6cba3266be4c472f9d6badb73d0ec82......Page 205
00266___5ab4a479413b590c42e9acd288a73770......Page 206
00267___19330eb7483232f34be10d411765eeaf......Page 207
00268___716a6b416b72a29bf36ae2ba6e7db5e2......Page 208
00269___799c86fddcb5e580b2c238e9de78aafe......Page 209
00270___d1c7196a52e32634f48809abc10a0f9d......Page 210
00271___f8f23945b19482c7ce1e237bb32b8f74......Page 211
00272___96c799638e8a25105cfef8e04d5ccecc......Page 212
00273___dbc4a0da526b9e4b79e70065313a3fd0......Page 213
00274___77ff27e1847eb5f8f7329281bff9d457......Page 214
00275___2f72fd02267769bf802460c98516c14b......Page 215
00276___0c21da9d7298629d225afc2fe01fbe5e......Page 216
00277___d99eddbd82b9937d53ff8edb51a8cc26......Page 217
00278___bcc6558d088faf98c0b8fc2164fed0ca......Page 218
00279___0828cf73371b6ba694e81e385e993aa1......Page 219
00281___1b9c5459e3a545e3fccb7697888b7eb3......Page 220
00282___7a2bd6d95f151864f06b9a613911bbcf......Page 221
00283___dfea613e8838563179c53eb9c77a30cb......Page 222
00284___58acafe74bf0a3fc1a508b087ec260f6......Page 223
00285___f89267ac195e2b7bc336caea6b01f5ff......Page 224
00286___694db58d4adb29b9161a7d94df2ce438......Page 225
00287___36ddb288b8fdf36887762235c92ae938......Page 226
00288___ed84cad9f97847cc4972adbd5e8f8ace......Page 227
00289___20c29e3f7c92494f418717040fc430b3......Page 228
00290___c9beaa71b619bca4bb69abeba836f523......Page 229
00291___2319b549f4d4e0ee2cb5c29775a34ca4......Page 230
00293___6af1c478d769f1c69f5e94b301f08082......Page 231
00294___c423dd63d5f033c9fa61bfd0eb4dce8f......Page 232
00298___15bfc076ef89454a414bcea447b30266......Page 233
00300___acd58012d735edc1ad183da7ab6fb54a......Page 234
00302___049753a5f15bcfdeb95cd307fe6f5802......Page 235
00303___116bfd91d85eb72b12dea5f5835ff9a2......Page 236
00304___f7f994e33698b6a3a7e77b5249f007d7......Page 237
00306___7ce0f95cb0c128047ea2d3377b468d7c......Page 238
00309___2581eed3898e38b2a504528bcbafa770......Page 239
00311___b2418b27be5c048316364ff59a198ae4......Page 240
00312___4104ae36382544a0a54770a6a8ba188a......Page 241
00313___4a043f11d258c02a4a38d6af47e07d6f......Page 242
00315___725baf2cef7e134c35a355b59f99de4d......Page 243
00317___4af53b911ae122ea34d167fdd7389206......Page 244
00318___a75c3783a958205adb44e5c5f9ee235c......Page 245
00319___b722c031367d3cbe1f55b5b786624928......Page 246
00321___dccbf53b15c0964ad0bde8ab9d2468fd......Page 247
00322___ab143829a0e5c0036c61501bbe89d520......Page 248
00324___b7474a238dfe9df066461b18d8d4de19......Page 249
00325___035cdfeeec684fe77cf33e6fae673771......Page 250
00326___1847af4f4d874c59eafcd8ea44a913fb......Page 251
00328___d946731ec26a9c595c4a11080f978eea......Page 252
00329___eb6f64f76d565d0cf2903a6ee805a760......Page 253
00331___10f2f585fb597110c63ffb0e28cb3c5b......Page 254
00332___4e72b0396c6679cd24787de3483253c6......Page 255
00333___f2eed76ad99ae969a3824c768fe9bbab......Page 256
00334___a117a2909bdfe45c0b06880e666ea2fc......Page 257
00335___4d6e40773dbcffbebdbb78f94cabf727......Page 258
00337___ebe86a4d886d354dce1431e5109210c8......Page 259
00338___5704ea3efb556698bd9fbf0966b3d348......Page 260
00339___11703aa064fc67841994281e1c0416ce......Page 261
00340___e2df09b31d1f080c295b2cd5fd54396a......Page 262
00342___00b96d195ef4c4911580606ad86d6815......Page 263
00343___d7ecee356836737969ff3990c36a400a......Page 264
00344___8e424597bbea182cc52b4c104fa78123......Page 265
00346___21c75cf4b32a0e54726a762d144bf255......Page 266
00347___2b8d81e67ee3e54629bed5a8ce757e55......Page 267
00349___c18b91929f732de1c2c63f03181e6856......Page 268
00350___39e26d34f1c738446459d5dbca57047e......Page 269
00352___1fc9d53ecaac9de5c1974bd464ed4053......Page 270
00353___d25db54768a4e1e807ea4a70a69ad935......Page 271
00354___c772c13b8704604d21eb1809c464d9d5......Page 272
00355___4069e0e14dcbfb12136a89ca56762674......Page 273
00358___884f8f0d522c70d529b568d3b3d92a5a......Page 274
00359___214078fbcb6af6a11ff24dafc2dc127f......Page 275
00360___ec07632e0c8a4aa0dfdc0b20d209ee82......Page 276
00361___7efde7975cdffda32e3477fffa565a2f......Page 277
00362___e76707a58b58b65ab1de1b8ffba2cea6......Page 278
00363___1201702125875a0e6cbf081e01ad8bf9......Page 279
00364___5b1c0580e3e9935a833c054aedc6221e......Page 280
00365___37b23b60aa5d060e6d8296c97a040dbe......Page 281
00366___452f653a321cfd5f50d6c8417b566e6f......Page 282
00367___f4b3ad294ab15ad791c60754a5c127bb......Page 283
00368___042eeab0d402da7003a0813291d35f98......Page 284
00369___d2a64720101ae9069cb78cc0b5405682......Page 285
00371___55f2af2434cef5c34caab7498a8c33ee......Page 286
00372___c4d2c718364d747392f18394d03fd0a9......Page 287
00374___55a5beb8c769f38ca1a7fa08f2a95e68......Page 288
00375___f663925d35b2b70fbf099a68c1a954cd......Page 289
00376___5d5a5b268aa32b64b15de757e9c7fd33......Page 290
00377___d1fa03f577c9436b77d33828313d11a5......Page 291
00379___cde01d14000deec4ed84cbcdb7f348fe......Page 292
00380___1fab3bdd8edfb31a2929278fb927740a......Page 293
00382___db0c5c0ae6fb59dbf0ec349316cac6f0......Page 294
00383___a3038694b59423f13f9f9027f8a97cf9......Page 295
00384___1b4b2ea5af214fdf2139cb1ae800be18......Page 296
00385___c25f1adb79469f582f495552d5fa6c48......Page 297
00386___e520a3a5827e55e031a606a657695908......Page 298
00387___8e88ee9140d3338e546c21bdfdfe90ab......Page 299
00389___915bf098deb376df48afac8a193cb0e7......Page 300
00391___03dcd813a558fec4b1226201eb506158......Page 301
00392___1e666d96278bb8a8edec1b7bd5dfa166......Page 302
00393___b18a8024d17689128b6f5a9a65f65b2d......Page 303
00394___2638670369fede57c5cf62f7bc09e14f......Page 304
00395___dc01d4b6566ab544bb64cddcdcf9a08e......Page 305
00396___5da5fa8db82e6cf2270b96ddd8e17834......Page 306
00397___223b43beec98891465d45b0178b0526b......Page 307
00398___b2fad8f872b8336c501dc836db9c1a61......Page 308
00399___34942314219dfa5620d6bd300d4b65ae......Page 309
00400___c70ca6c609c876eb934adf385db4384d......Page 310
00401___4af448a9e8d4816352e7384184111e99......Page 311
00403___e3bd1b6b08e3aa49b188fead4702af9e......Page 312
00404___978f46feeb40de2d01cc934306a9165b......Page 313
00406___6f6eae08404d02415012d10841fc55b2......Page 314
00407___0f9b374b5464aaf464826276d0eb2baf......Page 315
00408___6b4a7c592f2b5712f7fa0738003b7d4a......Page 316
00409___9e25fe2c468849932809d6ea88b372e9......Page 317
00410___2dd71990c1c3a77a3d2d85a93e6695bf......Page 318
00411___f3f6f7ee280af69eb70c5c8308500283......Page 319
00413___3edaba1af49d2a0c0abea12bad3cbc95......Page 320
00414___61e8b7045f940f4a04768113baea2573......Page 321
00416___a1a6a46c13892038d69e5227a77b3b14......Page 322
00418___7257c5abf3cf0b8d070c8162abeef635......Page 323
00419___f86220701282163a0f67695c54e44017......Page 324
00421___29feec1407e132b8b502986ba9953524......Page 325
00422___6a3f4700fc2667168cb3c110f6ede170......Page 326
00423___52fad83063feb0c7973bfa17841538c5......Page 327
00424___775fa343657870bf2486107947785e82......Page 328
00425___cd82ba030cf959a9b6cb9d500c680080......Page 329
00426___59098c2e10d63b662f9ffbdf9f7d0f18......Page 330
00427___bbe1fda848a775cda5e19353c1e83070......Page 331
00429___8ce448347adcb0e47bd595d9d76669b6......Page 332
00432___2d4bc846a67b2a45bd8eea7cdb23552e......Page 333
00433___e67839db35a9f56dab50e52adc35a29f......Page 334
00434___f4181f45abf82b4eb2b14cd9566b3462......Page 335
00435___e0e614a102d0946a076f0e7a09aa875c......Page 336
00436___9e5ac822665842f4c968d9f4a5fd95cd......Page 337
00437___bbdc54e9043a35ce197f8abe941c43e2......Page 338
00438___5a49c79d5b7e544ea7c11730c35029f5......Page 339
00439___01608872ec699f2904757877837161d4......Page 340
00441___d013642784387732242fed0e0892763c......Page 341
00442___dfab6ce47d2a8540243453ffe5d5bd7e......Page 342
00443___9679954b2158e055618a703b5370c7f4......Page 343
00444___0d53fbbea67fd239b3ef048ebeb0f450......Page 344
00446___be2467014ac66b40a2e7c1e500618498......Page 345
00447___0bfe546e35382280e0125712302ff6df......Page 346
00449___f1f47da1489f958b6405d994ee59ef50......Page 347
00450___0b0b9f358ae24c07df16b9657d7383a7......Page 348
00453___3b67f1f0a20705f5f760e9df0cd647a0......Page 349
00454___db1b731bfa3c82dd9d6284381e918efc......Page 350
00457___6ceef90c1eee582d8c45e66c92b0d348......Page 351
00459___c7c981eba189d95c3b5c57a6fda617eb......Page 352
00460___90173351901b4748a21bac8a2886e70a......Page 353
00462___d65b06fd99ccec095ba47ae0111771c3......Page 354
00464___8a74cec85565c06773c65e2a2feffbc2......Page 355
00465___28a091d23a08d3664862cfb53df66736......Page 356
00466___0c7862a313108d47ea0933110421b806......Page 357
00467___240dfc50cea7bd7bd438dadd257113e8......Page 358
00469___f3ceb52a7105819f3e10e5999f1c93ee......Page 359
00470___3e441a79251a2faac92a5c07eec13d45......Page 360
00471___b39b65a2542b4567293b81cdb6f892e2......Page 361
00472___ff5d0304cc95f39dbb1be97bc57c8d56......Page 362
00473___515d67638db74a5642dbda5ad8e1c745......Page 363
00474___c3c7e52159c95107841f013c86381194......Page 364
00475___cb96fef195df39ee533ae87311bc7329......Page 365
00478___1ad8fc0e58f3220d193af3ef0bc2d85a......Page 366
00479___45192961759393c2bf76e4f22c6acb17......Page 367
00480___db886977b15a0758353c77c4ea8c516a......Page 368
00483___c3b201ad0dd3eff955666cd3268d467a......Page 369
00484___4d0508d0f0265e7d6432b4b6824cc59c......Page 370
00486___460fbbf0aba55150b7ec7f994e5f9322......Page 371
00487___4a7d7f603ae9f1df1fba84be94df809c......Page 372
00489___2c2a4110a7a829b0848798d2ebfa5d66......Page 373
00490___15d2028e5381d3dc396857fae5392c2d......Page 374
00492___d527b1c148d94d14d7629ebd57ffbcbb......Page 375
00493___ed94851b9fa31692279c7d13bdddc5ce......Page 376
00494___f43d4276833ab17750b74bd8ce7e675e......Page 377
00495___5ca9808c7e819f2891b6feef4cc8f5b9......Page 378
00496___b39be075af3880ea8974508e82f1c584......Page 379
00497___598fa7230fddb5dc5059e255a645fa80......Page 380
00498___bbcad0082b13fc77246a7279d5314151......Page 381
00499___5f4717f1f117394cca463bbf6408f13f......Page 382
00500___58a9ef2b191e3f4abf06c4748b791ca2......Page 383
00501___75ef49730b339bdccb8b29a6ce9271a8......Page 384
00502___2153affe811880228962f92108fbb9e0......Page 385
00503___4c08b81825c8c2eeed9e8edf2452a788......Page 386
00505___07d06e9ecc28cf2822c026563f9c56ec......Page 387
00508___b499f5d5ebbbfda5b129870f1a271ff8......Page 388
00509___c53d4b770da765d5870fa22afdd86ebd......Page 389

Recommend Papers

Easy Step by Step Guide to Marketing (Easy Step by Step Guides) [2nd ed.] 0953298760, 9780953298761, 9781423746232

417 93 551KB Read more

Step by Step 9780795329852

The second volume in this enthralling collection of the British prime minister's journalistic work, tracing Hitler&

148 86 5MB Read more

Stohlman Step-by-Step 9781892214621

This book features detailed, illustrated explanations of how to carve patterns from the Al And Ann Stohlman Personal Pat

395 10 8MB Read more

Cooking Step by Step 9781465465689

A must-have cookbook for budding young chefs with over 50 mouth-watering recipes to help you cook with confidence! Intr

101 43 46MB Read more

Microsoft Word 2010 Step by Step (Step By Step (Microsoft)) [1 ed.] 0735626936, 9780735626935

Experience learning made easy-and quickly teach yourself how to create impressive documents with Word 2010. With STEP BY

477 36 18MB Read more

Crochet, Step By Step 9780744026863

443 33 57MB Read more

Fusion 360 Step by Step.

840 54 11MB Read more

JavaScript Step by Step (Step By Step (Microsoft)) [Second Edition] 0735645523, 9780735645523

Your hands-on, step-by-step guide to the fundamentals of JavaScript development. Teach yourself how to program with Java

525 83 3MB Read more

Microsoft Office Word 2003 Step by Step (Step By Step (Microsoft)) 0735615233, 9780735615236

Experience learning made easy—and quickly teach yourself how to use the word processing power in Word 2003. With STEP BY

541 28 14MB Read more

Database Solutions: A Step by Step Guide to Building Databases [2nd ed] 0321173503, 9780321173508, 9781405890342, 1405890347

Database Solutions: A step-by-step guide to building databases 2/eAre you responsible for designing and creating the dat

377 78 23MB Read more

XML Step by Step [2nd ed]
9780735614659, 0735614652

Author / Uploaded
Michael J. Young

Similar Topics
Computers
Web-design

0 0 0
Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up

File loading please wait...

Citation preview

PUBLISHED BY Microsoft Press A Division of Microsoft Corporation One Microsoft Way Redmond, Washington 98052-6399 Copyright © 2002 by Michael J. Young All rights reserved. No part of the contents of this book may be reproduced or transmitted in any form or by any means without the written permission of the publisher. Library of Congress Cataloging-in-Publication Data Young, Michael J. XML Step By Step / Michael J. Young.--2nd ed. p. cm. Includes index. ISBN 0-7356-1465-2 1. XML (Document markup language) I. Title. QA76.76.H94 Y68 2001 005.7'2--dc21

2001044924

Printed and bound in the United States of America. 1 2 3 4 5 6 7 8 9

QWT

6 5 4 3 2

Distributed in Canada by Penguin Books Canada Limited. A CIP catalogue record for this book is available from the British Library. Microsoft Press books are available through booksellers and distributors worldwide. For further information about international editions, contact your local Microsoft Corporation office or contact Microsoft Press International directly at fax (425) 936-7329. Visit our Web site at www.microsoft.com/mspress. Send comments to [email protected]. ActiveX, JScript, Microsoft, Microsoft Press, MSDN, Visual Basic, Visual Studio, and Windows are either registered trademarks or trademarks of Microsoft Corporation in the United States and/or other countries. Other product and company names mentioned herein may be the trademarks of their respective owners. The example companies, organizations, products, domain names, e-mail addresses, logos, people, places, and events depicted herein are fictitious. No association with any real company, organization, product, domain name, e-mail address, logo, person, place, or event is intended or should be inferred. Acquisitions Editor: David J. Clark Project Editor: Jean Cockburn

Body Part No. X08-24444

iii

Contents Preface ............................................................................................. vii Introduction ...................................................................................... xi Why Another XML Book? xi • What You’ll Learn in This Book xii • XML Step by Step, Internet Explorer, and MSXML xiv • Using the Companion CD xvi • Requirements xviii • How to Contact the Author xix • Microsoft Press Support Information xix

PART 1

Getting Started

1

Chapter 1

Why XML? ......................................................................................... 3 The Need for XML 4 • Displaying XML Documents 10 • SGML, HTML, and XML 11 • The Official Goals of XML 12 • Standard XML Applications 14 • Real-World Uses for XML 15 • XML Applications for Enhancing XML Documents 19

Chapter 2

Creating and Displaying Your First XML Document ........................ 21 Creating an XML Document 21 • Displaying the XML Document 29

PART 2

Creating XML Documents

43

Chapter 3

Creating Well-Formed XML Documents .......................................... 45 The Parts of a Well-Formed XML Document 46 • Adding Elements to the Document 50 • Adding Attributes to Elements 62 • Using Namespaces 69

Chapter 4

Adding Comments, Processing Instructions, and CDATA Sections ................................................... 81 Inserting Comments 81 • Using Processing Instructions 83 • Including CDATA Sections 86

iv

Chapter 5

Contents

Creating Valid XML Documents Using Document Type Definitions ................................................... 91 The Basic Criteria for a Valid XML Document 92 • The Advantages of Making an XML Document Valid 93 • Adding the Document Type Declaration 95 • Declaring Element Types 98 • Declaring Attributes 107 • Using Namespaces in Valid Documents 117 • Using an External DTD Subset 120 • Converting a Well-Formed Document to a Valid Document 125

Chapter 6

Defining and Using Entities ........................................................... 131 Entity Definitions and Classifications 131 •Declaring General Entities 135 • Declaring Parameter Entities 143 • Inserting Entity References 148 • Inserting Character References 153 • Using Predefined Entities 156 • Adding Entities to a Document 157

Chapter 7

Creating Valid XML Documents Using XML Schemas ...................................................................... 163 XML Schema Basics 165 • Declaring Elements 167 • Declaring an Element with a Simple Type 169 • Declaring Attributes 182 • Creating an XML Schema and an Instance Document 186

PART 3

Displaying XML Documents on the Web

193

Chapter 8

Displaying XML Documents Using Basic Cascading Style Sheets .............................................. 195 The Basic Steps for Using a Cascading Style Sheet 197 • Cascading in Cascading Style Sheets 211 • Setting the display Property 215 • Setting Font Properties 221 • Setting the font-variant Property 231 • Setting the color Property 232 • Setting Background Properties 235 • Setting Text Spacing and Alignment Properties 246

Chapter 9

Displaying XML Documents Using Advanced Cascading Style Sheets ....................................... 257 Setting Box Properties 258 • Using Pseudo-Elements (Internet Explorer 5.5 through 6.0 Only) 285 • Inserting HTML Elements into XML Documents 286 • Creating and Using a Full-Featured Cascading Style Sheet 291

Contents

Chapter 10

v

Displaying XML Documents Using Data Binding ....................................................................... 297 The Main Steps 298 • The First Step: Linking the XML Document to the HTML Page 299 • The Second Step: Binding HTML Elements to XML Elements 303 • Using Paging 309 • Using Scripts with the DSO 350

Chapter 11

Displaying XML Documents Using Document Object Model Scripts ......................................... 357 Linking the XML Document to the HTML Page 359 • The Structure of the DOM 360 • Accessing and Displaying XML Document Elements 367 • Accessing and Displaying XML Document Attribute Values 384 • Accessing XML Entities and Notations 388 • Traversing an Entire XML Document 392 • Checking an XML Document for Validity 398

Chapter 12

Displaying XML Documents Using XML Style Sheets ................................................................. 409 Using an XSLT Style Sheet—the Basics 411 • Using a Single XSLT Template 412 • Using Multiple Templates 432 • Using Other Select and Match Expressions 435 • Filtering and Sorting XML Data 440 • Accessing XML Attributes 451 • Referencing Namespaces in XSLT 457 • Using Conditional Structures 459

Appendix

Web Addresses for Further Information......................................... 461 General Information on XML 461 • Internet Explorer and MSXML 462 • XML Applications 462 • Namespaces 462 • URIs and URNs 462 • XML Schemas 463 • Cascading Style Sheets (CSS) 463 • Data Binding and the Data Source Object (DSO) 464 • ActiveX Data Objects (ADO) and the ADO recordset Object 464 • HTML and Dynamic HTML (DHTML) 464 • Microsoft JScript 464 • The Document Object Model (DOM) 465 • Extensible Stylesheet Language Transformations (XSLT) and XPath 465 • Author’s Web Site 465

Index .............................................................................................. 467

vii

Preface to the Second Edition I finished writing the first edition of XML Step by Step around the time of the final snowfall over the southern Rockies in the spring of 2000. Less than a year later, I was once again witnessing the last snowfalls of the season and working on XML Step by Step, this time starting the second edition. I wrote this preface to discuss my goals in writing the second edition, to describe what’s new in this edition, and to explain why Microsoft Press and I decided to create a second edition so soon after the first. My first goal in writing the second edition was to bring the book up-to-date with the many changes in XML technologies that had occurred since the book was originally published. The current version of the XML specification is still 1.0, as it was when I wrote the first edition. However, since I wrote that edition, the technologies used to display and work with XML have undergone many changes, and even the XML 1.0 specification itself has appeared in a second edition that includes error corrections and clarifications. The following are some of the important updates to the book. (Don’t worry if you haven’t heard of the XML technologies mentioned in this preface. They’re all explained in the book.) ■ To reflect the explosive growth of XML applications, I added 16 more XML applications to the list in Chapter 1. (Even with these additions, the list still represents only a small sampling of the current uses for XML.) ■ I wrote a new chapter (Chapter 7) covering XML schemas as defined by the World Wide Web Consortium (W3C) XML Schema specification, which achieved the status of recommendation in May 2001. An XML schema is used to define the content and structure of a class of XML documents. I also added several sections to Chapter 11 to explain how to check the validity of an XML document using an XML schema.

viii

Preface to the Second Edition

■ I completely revamped the final chapter in the book, which formerly covered XSL style sheets (based on the W3C’s December 1998 Extensible Stylesheet Language (XSL) Version 1.0 working draft), to cover the newer XSLT style sheets (based on the W3C’s November 1999 XSL Transformations (XSLT) Version 1.0 recommendation). I also covered many more style sheet features than before. ■ I updated the book to cover the XML features of Microsoft Internet Explorer versions 5.0 through 6.0, as well as MSXML 2.0 through 4.0. (MSXML is the software module that provides basic XML services for Internet Explorer. The first edition covered Internet Explorer versions 5.0 through 5.5, and MSXML 2.0 through 2.5.) Also, I recaptured each of the figures that shows a Windows element, such as a message box or the Internet Explorer window, using the Microsoft Windows XP Professional operating system. My second goal in revising the book was to provide new or expanded coverage on important technologies and techniques that were already available when I wrote the first edition, but that I was unable to include—or to fully cover—due to space limitations. The second edition is about 100 pages longer than the first. The following are some of the important new topics you’ll find in these additional pages: ■ I added coverage to Chapter 3 on the often confusing topic of how white space (sequences of space, tab, or line break characters) is handled in XML documents. ■ I greatly expanded the coverage on the increasingly important topic of namespaces. Namespaces are used to qualify names in XML documents so that naming conflicts can be avoided. Chapter 3 now includes a general discussion on namespaces (in the section “Using Namespaces”), and later chapters (Chapters 5, 8, 10, 11, and 12) now include information on using namespaces with specific XML technologies. ■ I added a sidebar to Chapter 3 covering the new all-inclusive URI Internet addressing scheme (“URIs, URLs, and URNs”).

xi

Introduction Extensible Markup Language, or XML, is currently the most promising language for storing and exchanging information on the World Wide Web. Although Hypertext Markup Language (HTML) is presently the most common language used to create Web pages, HTML has a limited capacity for storing information. In contrast, because XML allows you to create your own elements, attributes, and document structure, you can use it to describe virtually any kind of information—from a simple recipe to a complex database. And an XML document—in conjunction with a style sheet or a conventional HTML page— can be easily displayed in a Web browser. Because an XML document so effectively organizes and labels the information it contains, the browser can find, extract, sort, filter, arrange, and manipulate that information in highly flexible ways. XML thus provides an ideal solution for handling the rapidly expanding quantity and complexity of information that needs to be delivered on the Web.

Why Another XML Book? XML can be confusing. XML applications are appearing at an astounding rate, and XML is intimately tied to an ever-increasing number of related standards and technologies used to format, display, process, and enhance XML documents. Many of these related standards and technologies are still in their infant stages, and are rapidly changing and evolving. Most of the XML books that I have read attempt a comprehensive coverage of these technologies but get a bit lost in the maze. I believe that the typical XML book tries to survey too many XML technologies too superficially, without discriminating between the important and the unimportant, the practical and the impractical, the current and the future.

xii

Introduction

I wrote XML Step by Step to answer the most fundamental XML questions— what XML is, why it’s needed, and how it can be used—and to teach the most important, practical XML technologies available now. Although I was quite selective in choosing the topics to include in this book, I cover each of them in depth, and avoid partial solutions. (For example, because I tell you how to define XML attributes in Part 2, in Part 3 I show you how to access these attributes when you display the document.) I never truly understood XML until I started actually writing and displaying XML documents. Consequently, I gave this book a hands-on approach, including many step-by-step instructions, practical examples, and tutorial exercises. I avoided theoretical and abstract discussions that can be so difficult to understand with a topic like XML. The book and companion CD are also unique in providing a complete XML learning kit. This kit provides all the information, instruction, and software that you need to learn the practical basics of creating and displaying XML documents. The book also includes a comprehensive set of links to a wealth of XML information on the Web, which you can explore if you want to go beyond the basics.

What You’ll Learn in This Book Part 1 of this book (Chapters 1 and 2) provides a gentle introduction to XML and prepares you for the detailed information that comes later. Chapter 1 answers the basic questions I mentioned earlier—what XML is, why it’s needed, and how it’s being used to solve real-world problems. Chapter 2 provides a hands-on exercise that gives you a quick overview of the entire process of creating an XML document and displaying it in a Web browser. Part 2 (Chapters 3 through 7) focuses on the rules and techniques for creating XML documents. Chapters 3 and 4 show you how to create well-formed XML documents—documents that conform to the basic syntactical rules of XML. Chapters 5, 6, and 7 tell you how to create valid XML documents—documents that not only conform to the basic syntactical rules, but also match a specific document structure that you define either in the document itself or in a separate file. The chapters in Part 2 are based primarily on version 1.0 of the official XML specification developed by the World Wide Web Consortium (W3C). Part 3 (Chapters 8 through 12) teaches you the most important of the current techniques for displaying XML documents in Web browsers. Chapters 8, 9, and 12 explain how to display an XML document by linking a style sheet that provides the browser with formatting and other display instructions. Chapters 8

Introduction

xiii

and 9 cover cascading style sheets. A cascading style sheet (CSS) is a simple type of style sheet that allows you to precisely control the way the document content is formatted, but doesn’t allow you to modify that content. Chapter 12 explains style sheets created with XSLT (Extensible Stylesheet Language Transformations). An XSLT style sheet is a more advanced type of style sheet that allows you not only to format the document content (using CSS properties), but also to select and modify the content, giving you complete control over the displayed output. Chapters 10 and 11 teach you how to display an XML document by linking the document to a conventional HTML Web page that contains instructions for selecting and presenting the XML data. Chapter 10 explains how to do this using data binding, a straightforward technique that is suitable primarily for simple, symmetrically structured XML documents. Chapter 11 shows you how to display an XML document from an HTML page by writing a script that uses the XML Document Object Model (DOM), a much more flexible technique that allows you to display any type of XML document and any document component.

note Throughout this book, I use the term page to refer to HTML source and the term document to refer to XML source. I chose this convention to help clearly distinguish these two markup languages, which are often used in conjunction.

Part 3 focuses specifically on using the Microsoft Internet Explorer Web browser for displaying XML documents. (You’ll see more details on Internet Explorer in the following section of the Introduction.) Finally, the Appendix provides the addresses of Web sites containing an abundance of further information on most of the topics covered in this book. I also include all of these addresses in the chapters, each in the appropriate context. You’ll find a copy of the Appendix on the companion CD in the Resource Links folder, under the filename Appendix.htm. (Instructions for installing the companion CD files are given later in the Introduction.) You can visit any of these Web sites by opening Appendix.htm in your Web browser and simply clicking a link, rather than typing the address into the browser.

Introduction

xv

Internet Explorer uses the services of a separate software module to process and work with XML documents. This module is known as Microsoft XML Core Services, or just MSXML. (The product was formerly known as the Microsoft XML Parser.) When you set up Internet Explorer on a computer, it automatically installs a particular version of MSXML. For instance, Internet Explorer 6.0 automatically installs MSXML version 3.0. If you wish, you can also install a later version of MSXML so that the more advanced XML services it provides are available to the browser (in addition to the features provided by the originally installed MSXML version). For example, if you want to be able to check the validity of XML documents using the latest implementation of XML schemas (described in Chapter 7), you need to install MSXML version 4.0, which is the first version of MSXML to support this type of schema. This book covers MSXML versions 2.0 through 4.0. Because the companion CD provided with this book includes both Internet Explorer 6.0 and MSXML 4.0, you have everything you need to display the XML documents that you create using the techniques in the book. (See the next section for a description of the companion CD and for installation instructions.)

note You can download the latest version of Internet Explorer at http://www.microsoft.com/windows/ie/. You can download the latest version of MSXML through the MSDN Library on the Web at http://msdn.microsoft.com/library/.

Throughout this book, the unqualified expression Internet Explorer refers to Internet Explorer versions 5.0 through 6.0. Most of the techniques given in the book will work using any of these Internet Explorer versions, together with the version of MSXML that ships with the browser. If the book doesn’t include a reference to a specific required version, you can assume that the technique described will work with any of these versions. A few techniques, however, require Internet Explorer 5.5 through 6.0 (namely, some of the CSS properties given in Chapters 8 and 9). A few other techniques require Internet Explorer 6.0 (several of the CSS properties, plus the technique for using XSLT style sheets explained in Chapter 12). And one important technique requires that you install MSXML 4.0 (using the XML schemas described in Chapter 7 to validate XML documents). Whenever a technique requires a specific version of Internet Explorer or MSXML, the book clearly states the version requirement.

xvi

Introduction

note When the book states a version requirement, it refers to specific Internet Explorer or MSXML versions. For example, next to a feature it might state “Internet Explorer 6.0 only.” Almost certainly, a later version will also work. However, I’ve taken the conservative approach of mentioning only the versions I’ve actually tested.

Using the Companion CD The companion CD included in the back of XML Step by Step provides the following valuable resources to complement the information in the book: ■ Copies of the source files given in the numbered listings in the book. These listings (for example, Listing 2-1 in Chapter 2) provide example XML documents, style sheets, XML schema files, and HTML pages that display XML documents. Whenever I introduce a numbered listing, I also give the name of the file that contains that listing on the CD. (For example, Listing 2-1 is contained in the CD file Inventory.xml.) You’ll find all these files on the companion CD in the Example Source folder. ■ All the graphics files displayed by the example source files. These files are contained in the same CD folder as the source files, Example Source. ■ A copy of the Appendix in the Web page file Appendix.htm. This file is located in the Resource Links folder on the CD. ■ Internet Explorer version 6.0. You can use Internet Explorer 6.0 to display any of the XML documents and HTML pages provided on the companion CD. When you install Internet Explorer 6.0, it automatically installs MSXML version 3.0. (As explained earlier, MSXML is a separate software module that contains the XML processor and provides other core XML services for Internet Explorer.) ■ MSXML 4.0. You’ll need to install MSXML 4.0 in order to use the XML schemas presented in Chapter 7. ■ Microsoft XML SDK 4.0. This software development kit (SDK) provides several tools for working with XML and MSXML 4.0.

xviii

Introduction

installing MSXML 4.0 and the Microsoft XML SDK 4.0, installing Windows Installer 2.0 (which is required for installing MSXML and the XML SDK), visiting the Microsoft Press support Web site, and registering your book online with Microsoft Press. Select an option by clicking it and then simply follow the onscreen instructions.

caution To install MSXML 4.0 and the Microsoft XML SDK 4.0, you need to have Windows Installer 2.0 on your computer. If you havent previously installed Windows Installer 2.0 and you arent running Microsoft Windows XP (which includes Windows Installer 2.0), you must first use the CD installation program to install Windows Installer 2.0 before you can install MSXML 4.0 and the Microsoft XML SDK 4.0. Otherwise, when you attempt to install MSXML 4.0 and the Microsoft XML SDK 4.0, you will receive an error message indicating that the proper version of Microsoft Installer isnt present on your computer.

Requirements The following are the basic hardware and software requirements for using XML Step by Step and its companion CD: ■ To access the companion CD and to install the software that’s included on the CD, you need a computer that runs Microsoft Windows and has a CD-ROM drive and at least a 486/66-megahertz (MHz) processor (a Pentium processor is recommended). You can use Windows 98, Windows ME, Windows NT 4.0 (with Service Pack 6a or higher), Windows 2000, Windows XP, or a later version of Windows. You also need a Super VGA (800 × 600) or higher resolution monitor with 256 colors, as well as a mouse or other pointing device. ■ To view the Web sites referenced in the book, you need a connection to the Internet. However, viewing these sites isn’t required for successfully using the book; therefore, an Internet connection is optional. This book is meant to introduce you to XML, so you aren’t required to have prior knowledge of XML itself. However, several of the techniques that the book teaches for displaying XML documents use one or more of the following Web-authoring languages: HTML, Dynamic HTML (DHTML), and Microsoft JScript (the Microsoft version of the generic JavaScript scripting language).

PART 1 Getting Started

CHAPTER

3

Why XML?

1 Why XML? XML, which stands for Extensible Markup Language, was defined by the XML Working Group of the World Wide Web Consortium (W3C). This group described the language as follows: The Extensible Markup Language (XML) is a subset of SGML…Its goal is to enable generic SGML to be served, received, and processed on the Web in the way that is now possible with HTML. XML has been designed for ease of implementation and for interoperability with both SGML and HTML.

This is a quotation from version 1.0 of the official XML specification. You can read the entire document at http://www.w3.org/TR/REC-xml on the W3C Web site.

note As this book goes to press, the current version of the XML specification is still 1.0. The first edition of this specification was published in February 1998. The second edition, which merely incorporates error corrections and clarifications and does not represent a new XML version, was published in October 2000. You’ll find the text of the second edition at the URL given above (http://www.w3.org/TR/REC-xml). The XML specification has the W3C status of Recommendation. Although this status might sound a bit tentative, it actually refers to the final, approved specification. (The role of the W3C is to recommend standards, not to enforce them.)

As you can see, XML is a markup language designed specifically for delivering information over the World Wide Web, just like HTML (Hypertext Markup Language), which has been the standard language used to create Web pages since the inception of the Web. Since we already have HTML, which continues to evolve to meet additional needs, you might wonder why we require a completely new language for the Web. What is new and different about XML? What

5

Microsoft Internet Explorer displays this page as shown in the following figure:

Each element begins with a start-tag: a block of text preceded with a left angle bracket () that contains the element name and possibly other information. Most elements end with an end-tag, which is like its corresponding start-tag except that it includes only a slash (/) character followed by the element name. The element’s content is the text—if any—between the start-tag and end-tag. Notice that many of the elements in the preceding example page contain nested elements (that is, elements within other elements). Element name

Element name

An HTML element Start-tag

Content

End-tag

Why XML?

1

Chapter 1 Why XML?

6

XML Step by Step

The example HTML page contains the following elements: HTML element

Page component marked

HTML HEAD TITLE BODY H1 H2 P UL LI IMG A EM B

The entire page Heading information, such as the page title The page title, which appears in the browser’s title bar The main body of text that the browser displays A top-level heading A second-level heading A paragraph of text A bulleted list (Unordered List) An individual item within a list (List Item) An image A hyperlink to another location or page (an Anchor element) A block of italicized (EMphasized) text A block of bold text

The browser that displays the HTML page recognizes each of these standard elements and knows how to format and display them. For example, the browser typically displays an H1 heading in a large font, an H2 heading in a smaller font, and a P element in an even smaller font. It displays an LI element within an unordered list as a bulleted, indented paragraph. And it converts an A element into an underlined hyperlink that the user can click to go to a different location or page. Although the set of predefined HTML elements has expanded considerably since the first HTML version, HTML is still unsuitable for defining many types of documents. The following are examples of documents that can’t adequately be described using HTML: ■ A document that doesn’t consist of typical components (headings, paragraphs, lists, tables, and so on). For instance, HTML lacks the elements necessary to mark a musical score or a set of mathematical equations. ■ A database, such as an inventory of books. You could use an HTML page to store and display static database information (such as a list of book descriptions). However, if you wanted to sort, filter, find, and work with the information in other ways, each individual piece of information would need to be labeled (as it is in a database program such as Microsoft Access). HTML lacks the elements necessary to do this.

■ A document that you want to organize in a treelike hierarchical structure. Say, for example, that you’re writing a book and you want to mark it up into parts, chapters, A sections, B sections, C sections, and so on. A program could then use this structured document to generate a table of contents, to produce outlines with various levels of detail, to extract specific sections, and to work with the information in other ways. An HTML heading element, however, marks only the text of the heading itself to indicate how the text should be formatted. For example: Web Site Contents

Because you don’t nest the actual text and elements that belong to a document section within a heading element, these elements can’t be used to clearly indicate the hierarchical structure of a document. The solution to these limitations is XML.

The XML Solution The XML definition consists of only a bare-bones syntax. When you create an XML document, rather than use a limited set of predefined elements, you create your own elements and you assign them any names you like—hence the term extensible in Extensible Markup Language. You can therefore use XML to describe virtually any type of document, from a musical score to a database. For example, you could describe a list of books, as in the following XML document:

The first line is the XML declaration, which states that this is an XML document and gives the XML version number. (At the time of this writing, the latest XML version was 1.0.) The XML declaration is optional, although the specification states that it should be included. If you do include an XML declaration, it must appear at the very beginning of the document. The second line of the prolog consists of white space. To enhance readability, you can insert any amount of white space (spaces, tabs, or line breaks) between the components of the prolog. The XML processor ignores it. The third line of the prolog is a comment. Adding comments to an XML document is optional, but doing so can increase the document’s readability. A comment begins with the

tip In Part 2 of the book, you’ll find detailed instructions for writing not only wellformed XML documents but also valid XML documents, which meet a more stringent set of requirements.

Displaying the XML Document You can open an XML document directly within the Internet Explorer browser, just like you’d open an HTML Web page. If the XML document doesn’t contain a link to a style sheet, Internet Explorer will simply display the text of the complete document, including both the markup (the tags and comments, for example) and the character data. Internet Explorer color-codes the different document components to help you identify them, and it displays the document element as a collapsible/expandable tree to clearly indicate the document’s logical structure and to allow you to view various levels of detail. If, however, the XML document contains a link to a style sheet, Internet Explorer will display only the character data from the document’s elements, and it will format this data according to the rules you have specified in the style sheet.

Your First XML Document

■ Each element must have both a start-tag and an end-tag. Unlike HTML, XML doesn’t let you omit the end-tag—not even in situations where the browser would be able to figure out where the element ends. (In Chapter 3, however, you’ll learn a shortcut notation you can use for an empty element—that is, an element with no content.)

2

■ Elements must be properly nested. That is, if an element starts within another element, it must also end within that same element.

30

XML Step by Step

You can use either a cascading style sheet (CSS)—the same type of style sheet used for HTML pages—or an Extensible Stylesheet Language Transformations (XSLT) style sheet—a more powerful type of style sheet that employs XML syntax and can be used only for XML documents. (An XSLT style sheet lets you display attribute values and other information contained in an XML document, in addition to character data from elements.)

Display the XML Document Without a Style Sheet 1

In Windows Explorer or in a folder window, double-click the name of the file, Inventory.xml, that you saved in the previous exercise. Internet Explorer will display the document as shown here:

2

Experiment with changing the level of detail shown within the document element. Clicking the minus symbol (-) to the left of a start-tag collapses the element, while clicking the plus symbol (+) next to a collapsed element expands it. For instance, if you click the minus symbol next to the INVENTORY element, as shown here:

Chapter 2 Creating and Displaying Your First XML Document

31

Your First XML Document

2

the entire document element will be collapsed, as shown here:

Catch XML Errors in Internet Explorer Before Internet Explorer displays your XML document, its XML parser component analyzes the document contents. If the parser detects an error, Internet Explorer displays a page with an error message rather than attempting to display the document. Internet Explorer will display the error page whether or not the XML document is linked to a style sheet.

note The XML parser is the part of the XML processor that scans the XML document, analyzes its structure, and detects any errors in syntax. See the Note on page 26 for a definition of XML processor.

32

XML Step by Step

In the following exercise, you’ll investigate the Internet Explorer error-checking feature by purposely introducing an error into the Inventory.xml document. 1

In your text editor, open the Inventory.xml document that you created in a previous exercise. Change the first TITLE element from The Adventures of Huckleberry Finn

to The Adventures of Huckleberry Finn

The element-type name in the end-tag now no longer matches the element-type name in the start-tag. Remember that element-type names are case-sensitive! 2

Save the changed document.

3

In Windows Explorer or in a folder window, double-click the document filename Inventory.xml. Rather than displaying the XML document, Internet Explorer will now display the following error-message page:

Chapter 2 Creating and Displaying Your First XML Document

33

4

Because you’ll work with Inventory.xml again in later chapters, you should now restore the end-tag in the first TITLE element to its original form () and then resave the document.

note If an XML document contains more than one well-formedness error, Internet Explorer displays only the first one it encounters. You’ll need to fix the errors one at a time. After you fix each error, you’ll need to save the document and reopen it in Internet Explorer to check for additional errors. (You can quickly reopen the document or page that’s currently displayed in Internet Explorer by clicking the Refresh toolbar button or by pressing F5.)

Even though you didn’t link a style sheet to the XML document, Internet Explorer uses a default style sheet to display the document; hence the error page refers to “using XSL style sheet.” (XSL style sheets are similar to the more recent XSLT style sheets, which are covered in Chapter 12.)

tip As you work through the chapters in this book, keep in mind that you can quickly check whether an XML document is well-formed by simply opening it directly in Internet Explorer. (If you display an XML document through an HTML page, as described in Part 3, an XML document with an error will fail to display, but you won’t see an error message unless you explicitly write script code to show one.)

Your First XML Document

When you open an XML document directly in Internet Explorer, as you do in this chapter, the parser checks only whether the document is well-formed and then displays a message if it finds an error. It doesn’t check whether the document is valid.

2

note

36

XML Step by Step

The Marble Faun Nathaniel Hawthorne trade paperback 473 $10.95

Moby-Dick Herman Melville hardcover 724 $9.95

The Portrait of a Lady Henry James mass market paperback 256 $4.95

The Scarlet Letter Nathaniel Hawthorne trade paperback 253 $4.25

The Turn of the Screw Henry James trade paperback 384 $3.35

Listing 2-3.

Chapter 2 Creating and Displaying Your First XML Document

In Windows Explorer or in a folder window, double-click the Inventory01.xml filename to open the document.

Your First XML Document

Internet Explorer will open the Inventory01.xml document and display it according to the rules in the linked Inventory01.css style sheet, as shown here:

2

6

37

7

To get a feel for how you can change the XML document’s appearance by modifying the linked style sheet, open a new, empty text file in your text editor, and type in the modified CSS shown in Listing 2-4. (You’ll find a copy of this listing on the companion CD under the filename Inventory02.css.)

8

Use your text editor’s Save command to save the new style sheet on your hard disk, assigning it the filename Inventory02.css. The modified style sheet you just typed tells Internet Explorer to format the elements’ character data as follows: ■

Display each BOOK element with 12 points of space above it (margintop:12pt) and a line break above and below it (display:block), using a 10-point font (font-size:10pt).

■

Display the TITLE, AUTHOR, BINDING, and PRICE elements each on a separate line (display:block).

■

Display the TITLE element in a 12-point (font-size:12pt), bold (fontweight:bold), italic (font-style:italic) font. (Note that the 12-point fontsize specification made for the TITLE element overrides the 10-point specification made for the element’s parent, BOOK.)

■

Indent the AUTHOR, BINDING, and PRICE elements each by 15 points (margin-left:15pt).

38

XML Step by Step

■

Display the AUTHOR element in bold (font-weight:bold).

■

Do not display the PAGES element (display:none).

Inventory02.css /* File Name: Inventory02.css */ BOOK {display:block; margin-top:12pt; font-size:10pt} TITLE {display:block; font-size:12pt; font-weight:bold; font-style:italic} AUTHOR {display:block; margin-left:15pt; font-weight:bold} BINDING {display:block; margin-left:15pt} PAGES {display:none} PRICE {display:block; margin-left:15pt}

Listing 2-4. 9

In your text editor, open the Inventory.xml document. Add the following processing instruction to the end of the document prolog, directly above the INVENTORY element:

Chapter 2 Creating and Displaying Your First XML Document

39

10

To reflect the new filename you’re going to assign, change the comment near the beginning of the document from

Listing 2-5 shows the complete XML document. (You’ll find a copy of this listing on the companion CD under the filename Inventory02.xml.) 11

Use your text editor’s Save As command to save a copy of the modified document under the filename Inventory02.xml. Be sure to save it in the same file folder in which you saved Inventory02.css. Inventory02.xml

3

You can also enter an empty element—that is, one without content—into your document. You can create an empty element by placing the end-tag immediately after the start-tag, as in this example:

These two notations have the same meaning. Because an empty element has no content, you might question its usefulness. Here are two possible uses: ■ You can use an empty element to tell the XML application to perform an action or display an object. Examples from HTML are the BR empty element, which tells the browser to insert a line break, and the HR empty element, which tells it to add a horizontal dividing line. In other words, the mere presence of an element with a particular name—without any content—can provide important information to the application. ■ An empty element can store information through attributes, which you’ll learn about later in this chapter. An example from HTML is the IMG (image) empty element, which contains attributes that tell the processor where to find the graphics file and how to display it.

tip As you’ll learn in Chapter 8, a cascading style sheet can use an empty element to display an image. In Chapter 10, you’ll learn how to use data binding to access the attributes belonging to an empty or non-empty element. And in Chapters 11 and 12, you’ll learn how to use HTML scripts (Chapter 11) and XSLT style sheets (Chapter 12) to access elements (empty or non-empty) and their attributes and then perform appropriate actions.

Create Different Types of Elements 1

Open a new, empty text file in your text editor, and type in the XML document shown in Listing 3-2. (You’ll find a copy of this listing on the companion CD under the filename Inventory03.xml.) If you want, you can use the

Well-Formed Documents

Or, you can save typing by using an empty-element tag, as shown here:

60

XML Step by Step

Inventory.xml document you created in Chapter 2 (given in Listing 2-1 and included on the companion CD) as a starting point. 2

Use your text editor’s Save command to save the document on your hard disk, assigning the filename Inventory03.xml.

Inventory03.xml

note CDATA sections do not nest. That is, you cannot insert one CDATA section within another.

Adding Comments

sub-element content...

4

92

XML Step by Step

I recommend you start by learning how to create DTDs, because they embody the most basic concepts and are so ubiquitous in the XML world. You can then go on to learn XML schemas, which are potentially more complex than DTDs, but provide many more features (for example, the ability to define a data type for an element’s character data). XML schemas also offer the advantage of being written using the familiar syntax of standard XML elements and attributes. (As you’ll soon learn, DTDs employ a syntax of their own.)

tip If you decide to skip DTDs and learn only XML schemas, you should still read the first two sections of this chapter because they apply to both methods for defining valid documents.

The Basic Criteria for a Valid XML Document Every XML document must be well-formed, meaning that it must meet the minimal requirements for an XML document that are given in the XML specification. If a document isn’t well-formed, it can’t be considered an XML document. A well-formed XML document can also be valid. A valid XML document is a well-formed document that meets either of the following two additional requirements: ■ The prolog of the document includes a proper document type declaration, which contains or references a document type definition (DTD) that defines the content and structure of the XML document, and the rest of the document conforms to the content and structure defined in the DTD. ■ The document conforms to the document content and structure defined in an XML schema, which is contained in a separate file. (In Chapter 7 you’ll learn how to create a schema, and in Chapter 11 you’ll learn how to check whether a particular document conforms to a schema.)

Chapter 5 Creating Valid XML Documents Using Document Type Definitions

93

The Advantages of Making an XML Document Valid

Making an XML document valid also fosters consistency within that document. For example, a DTD or XML schema can force you to always use the same element type for describing a given piece of information (for instance, to always enter a book title using a TITLE element rather than a NAME element); it can ensure that you always assign a designated value to an attribute (for instance, hardcover rather than hardback); and it can catch misspellings or typos in element or attribute names (for instance, typing PHILUM rather than PHYLUM for an element name). Making XML documents valid is especially useful for ensuring uniformity among a group of similar documents. In fact, the XML standard defines a DTD as “a grammar for a class of documents.” Consider, for example, a Web publishing company that needs all its editors to create XML documents that conform to a common structure. Creating a single DTD or XML schema and using it for all documents can ensure that these documents uniformly comply with the required structure, and that editors don’t add arbitrary new elements, place information in the wrong order, assign the wrong data types to attributes, and so on. Of course, the document must be run through a processor that checks its validity. Including a DTD or XML schema and checking validity is especially important if the documents are going to be processed by custom software (such as a Web page script) that expects a particular document content and structure. If all users of the software use a common appropriate DTD or XML schema for their XML

Document Type Definitions

If, however, you want to make sure that your document conforms to a specific structure or set of standards, providing a DTD or XML schema that describes the structure or standards allows an XML processor to check whether your document is in conformance. In other words, a DTD or XML schema provides a standard blueprint to the processor so that in checking the validity of the document, it can enforce the desired structure and guarantee that your document meets the required standards. If any part of the document doesn’t conform to the DTD or XML schema specification, the processor can display an error message so that you can edit the document and make it conform.

5

Creating a valid XML document might seem to be a lot of unnecessary bother: You must first fully define the document’s content and structure in a DTD or XML schema and then create the document itself, following all the DTD or schema specifications. It might seem much easier to just immediately add whatever elements and attributes you need, as you did in the examples of wellformed documents in previous chapters.

94

XML Step by Step

documents, and if the documents are checked for validity, the users can be sure that their documents will be recognized by the processing software. For example, if a group of mathematicians are creating mathematical documents that will be displayed using a particular program, they could all include in their documents a common DTD that defines the required structure, elements, attributes, and other features. In fact, most of the “real-world” XML applications listed at the end of Chapter 1, such as MathML, consist of a standard DTD or XML schema that all users of the application use with their XML documents, so that checking the documents for validity ensures that they conform to the application’s structure and will be recognized by any software designed for that application.

note The Microsoft Internet Explorer processor will check a document for validity only if the document contains a document type declaration and you open the document through an HTML Web page (using the techniques you’ll learn in Chapters 10 and 11), or if you use an XML schema as explained in Chapter 11. If you open an XML document—one with or without a style sheet—directly in Internet Explorer (as you have done so far in this book and will do in Chapters 8, 9, and 12), the processor will check the entire document—including any document type declaration it contains—for well-formedness and will display a fatal error message for any infraction it encounters. However, the Internet Explorer processor will not check the document for validity, even if it contains a document type declaration. To test a document with a DTD or XML schema for validity and to see messages for any well-formedness or validity errors the document contains, you can use one of the validity checking scripts (contained in HTML Web pages) that are given in “Checking an XML Document for Validity” on page 396. (These scripts are also provided on the companion CD.) You might want to read the instructions in that section now so that you can begin checking the validity of the XML documents you create.

Chapter 5 Creating Valid XML Documents Using Document Type Definitions

95

Adding the Document Type Declaration A document type declaration is a block of XML markup that you add to the prolog of a valid XML document. It can go anywhere within the prolog—outside of other markup—following the XML declaration. (Recall that if you include the XML declaration, it must be at the very beginning of the document.)

Prolog

Document element

A document type declaration defines the content and structure of the document. If you open a document without a document type declaration (or XML schema) in Internet Explorer, the Internet Explorer processor will merely check that the document is well-formed. If, however, you open a document with a document type declaration in Internet Explorer, the processor will, under certain circumstances, check the document for validity as well as for well-formedness, and your document must therefore conform to all declarations within the document type declaration. (See the note at the end of the previous section for a description of the circumstances under which Internet Explorer checks for validity.) You won’t, for example, be able to include any elements or attributes in the document that you haven’t declared in the document type declaration. And every element and attribute that you do include must match the specifications (such as the allowable content of an element or the permissible type of an attribute value) expressed in the corresponding declaration.

Document Type Definitions

5

98

XML Step by Step

Well-Formedness and Validity Constraints Well-formedness constraints are a set of rules given in the XML specification that you must follow—in addition to the rules specified in the formal XML grammar—to create a well-formed document. Because an XML document must be well-formed, any violation of a well-formedness constraint or any other failure to achieve well-formedness is considered a fatal error. When the XML processor encounters a fatal error, it must stop normal processing of the document and not attempt to recover. Validity constraints are a further set of rules in the XML specification that you must follow if you’ve chosen to create a valid document by defining a DTD. (They don’t apply if you’ve chosen to create a valid document using an XML schema.) Because validity is optional for an XML document, a violation of a validity constraint is considered only an error, as opposed to a fatal error. When a validating XML processor (that is, one that checks documents for validity) encounters an error, it can simply report the problem and attempt to recover from it. Validity constraints consist of specific rules for creating a proper document type declaration with its DTD, and for creating a document that conforms to the specifications within your DTD.

Declaring Element Types In a valid XML document created using a DTD, you must explicitly declare the type of every element that you use in the document in an element type declaration within the DTD. An element type declaration indicates the name of the element type and the allowable content of the element (often specifying the order in which child elements can occur). Taken together, the element type declarations in the DTD map out the entire content and logical structure of the document. That is, the element type declarations indicate the element types that the document contains, the order of the elements, and the contents of these elements.

The Form of an Element Type Declaration An element type declaration has the following general form:

Chapter 5 Creating Valid XML Documents Using Document Type Definitions

99

Here, Name is the name of the element type being declared. (To review the rules for legal element names, see “The Anatomy of an Element” on page 53.) And contentspec is the content specification, which defines what the element can contain. The next section describes the different types of content specifications you can use. The following is a declaration of an element type named TITLE, which is permitted to contain only character data (no child elements would be allowed):

And here’s a declaration for an element type named GENERAL, which can contain any type of content:

5

As a final example, here’s a complete XML document with two element types. The declaration of the COLLECTION element type indicates that it can contain one or more CD elements, and the declaration of the CD element type specifies that it can contain only character data. Notice that the document conforms to these declarations and is therefore valid:

] >

Here’s a complete list of the keywords you can use to define tokenized type attributes and the constraints they impose on the attribute values: ■ ID. In each element the attribute must have a unique value. The value must begin with a letter or underscore (_) followed by zero or more letters, digits, periods (.), hyphens (-), or underscores. However, the XML specification states that ID attribute values beginning with the letters xml (in any combination of uppercase or lowercase letters) are “reserved for standardization.” Although Internet Explorer doesn’t enforce this restriction, it’s better not to begin names with xml to avoid future problems. In addition, a particular element type can have only one ID type attribute, and the attribute’s default declaration must be either #REQUIRED or #IMPLIED (described later in the chapter). You can see an example of this type of attribute in the INVENTORY document given above.

note #REQUIRED and #IMPLIED are the two forms of default declaration in which you don’t specify a default value for the attribute. It doesn’t make sense to give a default value to an ID type attribute, since the attribute must have a unique value in each attribute specification.

■ IDREF. The attribute value must match the value of some ID type attribute in an element within the document. In other words, this type of attribute refers to the unique identifier of another attribute.

Document Type Definitions

5

You could also insert the article title wherever you needed it by including an entity reference (&title;) as shown in this element:

Title: &title;

Types of Entities Entities can be a bit confusing at first because they come in so many different varieties. Although the material in this section might seem a little abstract at this point (before you’ve seen the details and examples), having this information to refer back to should make your study of entities considerably easier. Entities can be classified in three different ways: ■ General vs. parameter. A general entity contains XML text or other text or nontext data that you can use within the document element. Both examples of entities shown in the previous section (title and topics) are general entities. A parameter entity contains XML text that you can use within the DTD. In the XML specification, the unqualified term entity refers to a general entity. ■ Internal vs. external. An internal entity consists of a quoted string in the entity declaration in the DTD (such as the title entity in the previous section). An external entity is a separate file that has been declared as an entity in the DTD (such as the topics entity in the previous section). ■ Parsed vs. unparsed. A parsed entity contains XML text (character data, markup, markup declarations, or a combination of these).

Defining Entities

As you’ll see later, the entity mechanism is also useful for modularizing your XML documents: You can store blocks of markup declarations or content for elements in separate files and combine them in XML documents in various ways by declaring the files as external entities. And entities are indispensable for identifying non-XML data in an XML document, such as the graphics data for an image.

6

If you happen to be a programmer, you’ll recognize the similarity between the XML entity mechanism and defined constants in a programming language (such as those declared using the #define preprocessor directive in C).

134

XML Step by Step

When you insert a reference to a parsed entity into the document, the reference is replaced with the entity’s contents (also known as its replacement text), which become an integral part of the document. The XML parser scans the entity’s contents in the same way it scans text you have typed directly into the document. Both example entities shown in the previous section (title and topics) are parsed entities. An unparsed entity can contain any type of data: XML text or, more commonly, non-XML data. Non-XML data can be either text data (such as a title) or nontext data (such as graphics data for an image). Because an unparsed entity typically does not contain XML, you can’t insert its contents into the document using an entity reference and the XML parser doesn’t scan its contents. However, you can identify the entity by assigning the entity name to an ENTITY or ENTITIES type attribute, so that the application can access the entity’s name and description and do what it wants with the data. Because entities are classified in these three ways, and each classification has two categories, theoretically there are eight potential types of entities, as diagrammed here:

entity

general

internal

parsed

unparsed

parameter

external

parsed

unparsed

internal

parsed

unparsed

external

parsed

unparsed

However, XML does not provide the three entity types that are barred out in the figure, and thus XML actually has only five entity types, which you’ll learn how to define and use in this chapter:

Chapter 6 Defining and Using Entities

135

■ General internal parsed ■ General external parsed ■ General external unparsed ■ Parameter internal parsed ■ Parameter external parsed

You create an entity by declaring it in the document’s DTD, using a type of markup declaration similar to that used to declare elements and attributes. In the following sections, you’ll learn how to declare each type of general entity.

6

Declaring General Entities

Declaring a General Internal Parsed Entity

Here, EntityName is the name of the entity. You can select any name provided that you follow these rules: ■ The name must begin with a letter or underscore (_), followed by zero or more letters, digits, periods (.), hyphens (-), or underscores. ■ The XML specification states that names beginning with the letters xml (in any combination of uppercase or lowercase letters) are “reserved for standardization.” Although Microsoft Internet Explorer doesn’t enforce this restriction, it’s better not to begin names with xml to avoid future problems. ■ Remember that case is significant in all text within markup, including entity names. Thus, an entity named Bowser is a different entity than one named bowser.

Defining Entities

A declaration for a general internal parsed entity has the following form:

Chapter 6 Defining and Using Entities

137

The title entity contains character data plus an element (SUBTITLE). According to the declarations in the DTD, this content can be validly inserted within a TITLEPAGE, INTRODUCTION, or SECTION element, as shown here:

Title: &title; Author: Michael J. Young

Declaring a General External Parsed Entity A declaration for a general external parsed entity has the following form:

Here, EntityName is the name of the entity. You can select any name as long as you follow the general entity naming rules provided in the previous section. SystemLiteral is a system identifier that describes the location of the file containing the entity data. The system identifier can be delimited using either single quotes (') or double quotes ("), and it can contain any characters except the quote character used to delimit it. The system identifier specifies the URI (Uniform Resource Indicator) of the file containing the entity data. Currently, the most common form of URI is a traditional URL (Uniform Resource Locator). (See the sidebar “URIs, URLs, and URNs” on page 73.) You can use a fully qualified URL, such as:

Or, you can use a partial URL that specifies a location relative to the location of the XML document containing the URL, such as:

Defining Entities

Title: The Story of XML The Future Language of the Internet Author: Michael J. Young

6

The XML processor will replace the entity reference (&title;) with the entity contents, and process the contents just as if you had typed them into the document at the position of the reference, like this:

138

XML Step by Step

Relative URLs in XML documents work just like relative URLs in HTML pages. For more details on exactly how they work, see “Using an External DTD Subset Only” on page 121. The entity file contains the entity’s replacement text, which can include only items that can legally be inserted into an element (character data, nested elements, and so on, as described in “Types of Content in an Element” on page 54). As you’ll learn later in this chapter, you can ultimately insert a general external parsed entity only within an element’s content, and not within an attribute’s value.

note In a general external parsed entity file, you can optionally include a text declaration in addition to the entity’s replacement text. The text declaration must come at the very beginning of the file. For information, see the sidebar “Characters, Encoding, and Languages” on page 77.

As an example, the following DTD defines the external file Topics.xml as a general external parsed entity:

] >

Here are the contents of the Topics.xml file: Topics The Need for XML The Official Goals of XML Standard XML Applications Real-World Uses for XML

Chapter 6 Defining and Using Entities

139

This particular external entity file contains two of the items that you can include in an XML element: a nested element and a block of character data. Its contents can be validly inserted within an INTRODUCTION element (which can have any type of content), as shown in this example:

Here’s what this article covers: &topics;

Declaring a General External Unparsed Entity A declaration for a general external unparsed entity has this form:

Here, EntityName is the name of the entity. You can select any name, provided that you follow the general entity naming rules given in “Declaring a General Internal Parsed Entity” earlier in this chapter. SystemLiteral is a system identifier that describes the location of the file containing the entity data. It works the same way as the system identifier for describing the location of a general external parsed entity, which I explained in the previous section.

note The keyword NDATA indicates that the entity file contains unparsed data. This keyword derives from SGML, where it stands for notation data.

Defining Entities

Here’s what this article covers: Topics The Need for XML The Official Goals of XML Standard XML Applications Real-World Uses for XML

6

The XML processor will replace the entity reference (&topics;) with the replacement text from the external entity file, and process the text just as if you had typed it into the document at the position of the reference, like this:

140

XML Step by Step

NotationName is the name of a notation declared in the DTD. The notation describes the format of the data contained in the entity file or gives the location of a program that can process that data. I’ll explain notation declarations in the next section. The general external unparsed entity file can contain any type of text or nontext data. It should, of course, conform to the format description provided by the specified notation. For example, the DTD in the following XML document defines the file Faun.gif (which contains an image of a book cover) as a general external unparsed entity named faun. The name of this entity’s notation is GIF, which is defined to point to the location of a program that can display a graphics file in the GIF format (ShowGif.exe). The DTD also defines an empty element named COVERIMAGE, and an ENTITY type attribute for that element named Source:

] >

The Marble Faun Nathaniel Hawthorne

In the document element, the Source attribute of the COVERIMAGE element is assigned the name of the external entity that contains the graphics data for the cover image to be displayed. Because Source has the ENTITY type, you can assign it the name of a general external unparsed entity. In fact, the only way you can use this type of entity is to assign its name to an ENTITY or ENTITIES type attribute.

Chapter 6 Defining and Using Entities

141

note

6

Unlike an external parsed entity file, a general external unparsed entity file is not accessed directly by the XML processor. Rather, the processor merely provides the entity name, system identifier, and notation name to the application. Likewise, the processor doesn’t access a location or program indicated by a notation, but only passes the notation name and system identifier to the application. In fact, the Internet Explorer XML processor doesn’t even check whether a general external unparsed entity file, or the target of a notation, exists. The application can do what it wants with the entity and notation information. For example, it might run the program associated with the notation and have it display the data in the entity file. In Chapter 11, you’ll learn how to write Web page scripts that access entities and notations.

A notation describes a particular data format. It does this by providing the address of a description of the format, the address of a program that can handle data in that format, or a simple format description. You can use a notation to describe the format of a general external unparsed entity (as you saw in the previous section), or you can assign a notation to an attribute that has the NOTATION enumerated type (as described in “Specifying an Enumerated Type” in Chapter 5). A notation has the following general form:

Here, NotationName is the notation name. You can choose any name you want, provided that it begins with a letter or underscore (_), followed by zero or more letters, digits, periods (.), hyphens (-), or underscores. You should normally choose a meaningful name that indicates the format. For example, if you define a notation to describe the bitmap format, you might name it BMP. (However, the XML specification states that names beginning with the letters xml, in any combination of uppercase or lowercase letters, are “reserved for standardization.” Although Internet Explorer doesn’t enforce this restriction, it’s better not to begin names with xml to avoid future problems.) SystemLiteral is a system identifier that can be delimited using either single quotes (') or double quotes ("), and can contain any characters except the quotation character used to delimit it. You can include in the system identifier any format description that would be meaningful to the application that is going to display or handle the XML document. (Remember that the XML processor

Defining Entities

Declaring a Notation

Chapter 6 Defining and Using Entities

143

Declaring Parameter Entities You declare a parameter entity using a form of markup declaration similar to that used for general entities. In the following sections, you’ll learn how to declare both types of parameter entities.

Declaring a Parameter Internal Parsed Entity A declaration for a parameter internal parsed entity has the following general form:

■ The XML specification states that names beginning with the letters xml (in any combination of uppercase or lowercase letters) are “reserved for standardization.” Although Internet Explorer doesn’t enforce this restriction, it’s better not to begin names with xml to avoid future problems. ■ Remember that case is significant in all text within markup, including entity names. Thus, an entity named Spot is a different entity than one named spot. EntityValue is the value of the entity. The value you assign a parameter internal entity is a series of characters delimited with quotes, known as a quoted string or literal. You can assign any literal value to a parameter internal entity, provided that you observe these rules: ■ The string can be delimited using either single quotes (') or double quotes ("). ■ The string cannot contain the same quotation character used to delimit it. ■ The string cannot include an ampersand (&) except to begin a character or general entity reference. Nor can it include the percent sign (%) (for an exception, see the sidebar “An Additional Location for Parameter Entity References” on page 151).

Defining Entities

■ The name must begin with a letter or underscore (_), followed by zero or more letters, digits, periods (.), hyphens (-), or underscores.

6

Here, EntityName is the name of the entity. You can select any name, provided that you follow these rules:

146

XML Step by Step

The entity file contains the entity’s replacement text, which must consist of complete markup declarations of the types allowed in a DTD—specifically, element type declarations, attribute-list declarations, entity declarations, notation declarations, processing instructions, or comments. (I described these types of markup declarations in “Creating the Document Type Definition” in Chapter 5.) You can also include parameter entity references between markup declarations, and you can include IGNORE and INCLUDE sections. I described IGNORE and INCLUDE sections in “Conditionally Ignoring Sections of an External DTD Subset” in Chapter 5. (For exceptions to the guidelines given in this paragraph, see the sidebar “An Additional Location for Parameter Entity References” on page 151.)

note In a parameter external entity file, you can optionally include a text declaration in addition to the entity’s replacement text. The text declaration must come at the very beginning of the file. For information, see the sidebar “Characters, Encoding, and Languages” on page 77.

You can use parameter external entities to store groups of related declarations. Say, for example, that your business sells books, CDs, posters, and other items. You could place the declarations for each type of item in a separate file. This would allow you to combine these groups of declarations in various ways. For instance, you might want to create an XML document that describes only your inventory of books and CDs. To do this, you could include your book and CD declarations in the document’s DTD by using parameter external entities, as shown in this example XML document:

Adding Entities to a Document In the following exercise, you’ll get some hands-on experience with entities by adding several general entities to the Inventory Valid.xml example document that you created in Chapter 5.

Add Entities to the Example Document 1

In your text editor, open the Inventory Valid.xml document you created in “Converting a Well-Formed Document to a Valid Document” in Chapter 5. (The document is given in Listing 5-2 and on the companion CD.)

2

At the beginning of the document’s DTD (the block of text delimited with [] characters near the top of the document), add the following entity and notation declarations:

to:

Given this declaration, the following MOUNTAIN element is valid:

New Mexico 13161 Wheeler

as is this one:

180

XML Step by Step

As you’ve seen, you can control the occurrences of individual elements within an xsd:sequence, xsd:choice, or xsd:all group by assigning values—other than the default value of 1—to the minOccurs and maxOccurs attributes of the individual elements within the group. You can also include a minOccurs or maxOccurs attribute specification within the xsd:sequence, xsd:choice, or xsd:all element itself to control the number of occurrences of the entire group. (As with xsd:element elements, the default value of both attributes is 1.) You’ll see an example of this practice in the declaration of the PARA element given in the next section.

Declaring an Element with Mixed Content An element declared with mixed content can contain any type of character data, interspersed with any child elements that are declared in the type definition. To specify a mixed content element, declare the element with element content exactly as described in the previous section, but add the mixed="true" attribute specification to the xsd:complexType element’s start-tag. For example, according to the following declaration, a TITLE element can contain any type of character data before or after its one child element, SUBTITLE:

Here’s an example of a valid element: Moby-Dick Or, The Whale

Recall from Chapter 5 that if you declare an element with “mixed content” using a DTD, the specified child elements can occur in any order and with any number of repetitions (zero or more), interspersed with any amount of character data. The following schema declaration emulates a mixed content declaration in a DTD. Specifically, it stipulates that a PARA element can contain BOLD, ITALIC, or UNDERLINE child elements in any order and with any number of repetitions (zero or more), interspersed with character data.

If the PartNum attribute specification is omitted from the start-tag of the element for which it is declared, the XML processor will pass the value 0-00-0 to the application. If the PartNum attribute specification is included, the processor will pass the specified value. Note that to include a default attribute specification in the xsd:attribute element, the use attribute must be set to optional or omitted. The fixed attribute provides an alternative to the default attribute, as shown in this attribute declaration:

caution Keep in mind that a default attribute value, specified using the default or fixed attribute, won’t be passed to the application (such as a script in an HTML page) unless the XML document is validated against the schema when the document is loaded, as demonstrated in the validity-testing page given in “Checking an XML Document for Validity Using an XML Schema” on page 400.

note For a description of all xsd:attribute attributes, see the section “3.2.2 XML Representation of Attribute Declaration Schema Components” in the “XML Schema Part 1: Structures” page at http://www.w3.org/TR/xmlschema-1/.

To add an attribute to an element declaration, so that the attribute can (or must) be used with that element, you must place the xsd:attribute schema element within the element’s content model. How you do this depends upon the type of the element, as discussed in the following two sections.

XML Schemas

If the Priority attribute specification is omitted from the element’s start-tag, the XML processor will pass the value high to the application (as with the default attribute). If the Priority attribute specification is included, it must have the value high (and of course, the processor will pass this value to the application). Note that you can’t have both a fixed and a default attribute specification in the same xsd:attribute element.

7

184

XML Step by Step

Adding Attributes to an Element with Element, Mixed, or Empty Content To add an attribute to an element with element content, mixed content, or empty content, you simply include the attribute declaration within the element’s content model inside the xsd:complexType schema element. You must, however, include the attribute declaration after any xsd:sequence, xsd:choice, or xsd:all element. The following is an example of a declaration for an element (FILM) with element content that includes a declaration for an attribute (Class) at the end of its content model:

Here is a valid FILM element according to this declaration:

The Net Sandra Bullock

Chapter 7 Creating Valid XML Documents Using XML Schemas

185

The following is an example of a declaration for an element with empty content (IMAGE) that includes a declaration for an attribute (Source) as the only declaration within its content model:

Here’s an example of a valid element conforming to this declaration:

And here’s a conforming AUTHOR element: 473

The xsd:simpleContent element included as a child of xsd:complexType indicates that you are going to derive a new complex type from a simple type, rather than build up a content model as was done in the previous examples. The xsd:extension element within xsd:simpleContent does the actual deriving. Its base attribute specifies the simple type of the PAGES element’s character data. The xsd:attribute element within xsd:extension declares the attribute that PAGES can include along with the character data (thereby “extending” the derived type beyond mere character data).

Creating an XML Schema and an Instance Document In the following exercises you’ll first create an XML schema and then an XML instance document that conforms to the schema. These files illustrate almost all the techniques covered in this chapter.

Create the Schema 1

Open a new, empty text file in your text editor, and type in the XML schema given in Listing 7-3. (You’ll find a copy of this listing on the companion CD under the filename Inventory Schema.xsd.)

Chapter 7 Creating Valid XML Documents Using XML Schemas

187

Inventory Schema.xsd

The Raven Edgar Allan Poe 1845 This is a floating margin note.

Once upon a midnight dreary, while I pondered, weak and weary, Over many a quaint and curious volume of forgotten lore—

280

XML Step by Step

Display a Floating Image 1

In your text editor, open the Raven.css style sheet file given in Listing 9-1 and provided on the companion CD.

2

Modify the style sheet so that it appears as shown in Listing 9-5. The main addition to the original style sheet is the rule for the IMAGE elements: IMAGE {background-image:url(Raven.bmp); background-repeat:no-repeat; background-position:center; width:89px; height:58px; float:left}

IMAGE is an empty element (which you’ll add to the XML document later) designed for displaying a floating image. The element contains no text, but is assigned a background image to display instead (through the first three property settings in the rule). You assigned the element’s width and height properties the exact width and height of the image. Because the image file is bitmapped, it’s important to assign the size in pixels so that the whole image will be displayed on any monitor and in any graphics mode. Note that if you didn’t assign width and

282

XML Step by Step

Because you assigned the IMAGE elements the float:left property setting in the style sheet, they will float to the left of each STANZA element (which contain the following document text). 7

Use your text editor’s Save As command to save a copy of the modified document under the filename Raven02.xml. You can see the complete document in Listing 9-6. A copy is also provided on the companion CD under the filename Raven02.xml.

Raven02.xml

The second form of data island conforms more closely to the basic XML tenet of keeping the data itself (the XML document) separate from the formatting and processing information (the style sheet, or in this chapter, the HTML page). In particular, the second form makes it easier to maintain the XML document, especially when a single document is displayed in several different HTML pages. Therefore, you’ll see only the second form of data island in the examples in this book. You should assign to the data island’s ID attribute a unique identifier, which you’ll use to access the XML document from within the HTML page. (In the previous example, I assigned ID the value dsoInventory.)

Data Binding

10

The Adventures of Huckleberry Finn Mark Twain mass market paperback 298 $5.49

Leaves of Grass Walt Whitman hardcover 462 $7.75

The Legend of Sleepy Hollow Washington Irving mass market paperback 98 $2.95

The Marble Faun Nathaniel Hawthorne trade paperback 473 $10.95

Moby-Dick Herman Melville hardcover 724 $9.95

The Portrait of a Lady Henry James mass market paperback 256 $4.95

305

Data Binding

Chapter 10 Displaying XML Documents Using Data Binding

306

XML Step by Step

The Scarlet Letter Nathaniel Hawthorne trade paperback 253 $4.25

The Turn of the Screw Henry James trade paperback 384 $3.35

Listing 10-1.

When you bind the table to the XML document, the data belonging to each record element is displayed in a separate table row, with each of its child field elements displayed in a separate cell within that row. As an example, the HTML page in Listing 10-2 contains a table bound to the data in the Inventory.xml document of Listing 10-1. (You’ll find a copy of Listing 10-2 on the companion CD under the filename Inventory Table.htm.) Inventory Table.htm

314

XML Step by Step

|< First Page



< Previous Page



Next Page >



Last Page >|

Title	Author	Binding	Pages	Price

Listing 10-4.

Chapter 10 Displaying XML Documents Using Data Binding

315

Using a Nested Table to Display a Hierarchical Record Set In the previous sections, you learned how to use a single table to display an XML document structured as a simple record set. (A simple record set is described at the beginning of the earlier section “Using a Single HTML Table to Display a Simple Record Set.”) You’ll now learn how to use nested tables to display an XML document whose elements are structured in a hierarchical record set. In a hierarchical record set, each record can contain, in addition to an optional fixed set of fields, zero or more nested records (see the following Note). Listing 10-5 shows an example of an XML document structured as a hierarchical record set. (You’ll find a copy of this listing on the companion CD under the filename Inventory Hierarchy.xml.) In this document, the root element (INVENTORY) contains a series of CATEGORY records. Each CATEGORY record begins with a single CATNAME field, which contains character data only, and then has two or more nested BOOK records. Each nested BOOK record has exactly five fields (TITLE, AUTHOR, BINDING, PAGES, and PRICE).

As a general rule, if an element contains one or more children (or attributes, as explained later), or if the same element type occurs more than once within a given parent, the DSO interprets the element or elements as a record (or set of records), rather than as a field (or set of fields). In Inventory Hierarchy.xml, the CATEGORY and BOOK elements meet both of these criteria, so they are clearly considered to be records. (The CATNAME, TITLE, AUTHOR, BINDING, PAGES, and PRICE elements meet neither criteria, so they are considered to be fields.) A TABLE element is the only type of HTML element that you can bind to an XML record.

10

note

Inventory of Classic English Literature

Listing 10-6.

10

Classic English Literature

Title	Author	Binding	Pages	Price

319

Data Binding

Chapter 10 Displaying XML Documents Using Data Binding

320

XML Step by Step

In Listing 10-6, the outer table is bound to the XML document, as you can see in its start-tag: Mark Twain mass market paperback 298 $5.49

In data binding, the AUTHOR element would be interpreted like this: 1835Mark Twain

As a result, the DSO would store the element as a nested record, not as a field. (Recall that field elements can contain only character data and not child elements.) Therefore, the record set would become a hierarchical record set rather than a simple record set, and you would have to display the nested record using a nested table, as described in the section “Using a Nested Table to Display a Hierarchical Record Set” earlier in the chapter. You need one more fact, however, to be able to display both the character data (Mark Twain) and the attribute of such a nested record: The DSO uses the special name $TEXT to refer to all of the character data within an element, including all text within any child elements but not including attribute values. Thus, it would interpret the AUTHOR element as follows:

1835 Mark Twain

You could use $TEXT as the field name to bind a table cell to the AUTHOR record’s character data content.

note The previous discussion stated that the DSO treats an attribute as if it were a child element. One difference, however, is that $TEXT includes the text of all child elements but doesn’t include attribute values.

Chapter 10 Displaying XML Documents Using Data Binding

347

Listing 10-12 contains an HTML page that demonstrates all of the techniques given in this section. (You’ll find a copy of this page on the companion CD under the filename Inventory Attribute.htm.) This page displays the Inventory Valid.xml XML document (which you’ll find in Listing 5-2 and on the companion CD). Inventory Attribute.htm

For more information on data islands, see “The First Step: Linking the XML Document to the HTML Page” in Chapter 10. As you learned in Chapter 10, the ID that you assign to the data island refers to the document’s DSO. You use the XMLDocument member of the DSO to access the DOM, as shown in this line of script code: Document = dsoBook.XMLDocument;

tip If you want to access more than one XML document from an HTML page, you can insert a data island for each. You can even include more than one data island for a single XML document. (The latter technique might be useful for maintaining several different versions of the XML data if your page modifies the contents of the DOM data cached in memory. This chapter, however, doesn’t cover the techniques for modifying DOM data.)

Document Object Model Scripts

Thus, adding a data island to an HTML page causes Internet Explorer to create both a DSO (represented directly by the data island’s ID) and a DOM (accessed through the DSO’s XMLDocument member).

11

Specifically, the XMLDocument member contains the root object of the DOM, known as the Document node. You’ll use the Document node to access all the other DOM objects.

360

XML Step by Step

note When you open an XML document through a data island in an HTML page, Internet Explorer checks the document for well-formedness and also—if the document includes a document type declaration—for validity. If the document contains an error, the DOM nodes will be empty of data and a DOM script will fail in its attempt to display the XML data. However, you won’t see a message indicating the specific error in the XML document. To see a helpful description of any well-formedness or validity error in the linked XML document, you can test that document using one of the validity testing pages presented in “Checking an XML Document for Validity” on page 396.

The Structure of the DOM In the DOM, the programming objects that represent the XML document are known as nodes. When Internet Explorer processes a linked XML document and stores it within a DOM, it creates a node for each of the basic components of the XML document, such as the elements, attributes, and processing instructions. The DOM uses different types of nodes to represent different types of XML components. For example, an element is stored in an Element node and an attribute in an Attribute node. Table 11-1 lists the most important of these node types. Node type

XML document component that the node object represents

Node name (nodeName property)

Node value (nodeValue property)

Document

The entire XML document (This is the root node of the DOM node hierarchy. It’s used to access all other nodes.) An element

#document

null

Element type name (for example, BOOK)

null (any character data contained in the element is in one or more child Text nodes)

Element

continued

362

XML Step by Step

note In the Microsoft XML DOM documentation (cited in the Tip on page 356), each of the node names listed in the first column of Table 11-1 is prefaced with IXMLDOM—for example, IXMLDOMDocument, IXMLDOMElement, and IXMLDOMText. (These are the names of the programming interfaces for each node type.) Note also that the common node properties and methods (described later in this chapter) are listed under the name IXMLDOMNode.

You can obtain each of the node names (listed in the third column) from the node’s nodeName property. The names beginning with a # character are standard names for nodes representing XML components not named in the document. (For example, a comment is not given a name in an XML document. Therefore, the DOM uses the standard name #comment.) The other node names derive from the names assigned to the corresponding components in the XML document. (For example, the Element node representing an element of type BOOK would also be named BOOK.) You can obtain each of the node values (listed in the last column) from the node’s nodeValue property. Notice that for certain types of nodes (namely, Document, Element, DocumentType, Entity, and Notation nodes), nodeValue is always set to null, indicating that a node of that type doesn’t have a value (either because the corresponding document component doesn’t have a value, or because the DOM stores the component’s value in a child Text or Attribute node). You’ll learn more about many of the node types listed in Table 11-1 later in the chapter.

note For a type of node that has a value, such as an Attribute node, if the corresponding document component contains no text (for instance, an attribute is assigned an empty string), nodeValue is set to an empty string, not to null.

The DOM organizes the XML document’s nodes in a treelike hierarchical structure that mirrors the hierarchical structure of the document itself. It creates a single Document node that represents the entire XML document and serves as the root of this hierarchy. Note that the logical hierarchical structure of the XML elements, of which the document element is the root, is only a branch of the hierarchical structure of DOM nodes, which encompasses the entire document.

Chapter 11 Displaying XML Documents Using Document Object Model Scripts

363

Consider, for instance, the example XML document in Listing 11-1. (You’ll find a copy of this listing on the companion CD under the filename Inventory Dom.xml.) This document contains an XML declaration, a comment, and a root element that includes child elements as well as attributes. The following figure shows the hierarchical organization of the nodes that the DOM creates to represent this document. For each component of the example document, the figure indicates the type of node used to represent the component (for example, Document, Comment, and Element) as well as the node’s name (given in parentheses—for example, #document, #comment, and INVENTORY). Document (#document )

ProcessingInstruction (xml )

Comment (#comment )

Element (INVENTORY)

Attribute (version )

Element (TITLE)

Element (PAGES)

Element (PRICE)

Element (BOOK)

Attribute (Binding )

Element (TITLE)

Element (AUTHOR)

Element (PRICE)

Element (AUTHOR)

Text Attribute Text Text Text (#text ) (Born ) (#text ) (#text ) (#text )

Attribute (Binding )

Element (PAGES)

Element (TITLE)

Text Attribute Text Text Text (#text ) (Born ) (#text ) (#text ) (#text )

Element (AUTHOR)

Text Attribute (#text ) (Born )

Element (PAGES)

Element (PRICE)

Text Text Text (#text ) (#text ) (#text )

11

Attribute (Binding )

Element (BOOK)

Document Object Model Scripts

Element (BOOK)

364

XML Step by Step

Inventory Dom.xml

DomDemo Fixed.htm

378

XML Step by Step

HTMLCode += "Title: " + Document.documentElement.childNodes(i).childNodes(0).text + "
" + "Author: " + Document.documentElement.childNodes(i).childNodes(1).text + "
" + "Binding: " + Document.documentElement.childNodes(i).childNodes(2).text + "
" + "Number of pages: " + "" + Document.documentElement.childNodes(i).childNodes(3).text + "
" + "Price: " + Document.documentElement.childNodes(i).childNodes(4).text + "

"; } DisplayDIV.innerHTML=HTMLCode;

Book Inventory

Listing 11-4.

The script in the example page uses the length property to determine the number of BOOK elements contained within the root element. (The length property is a member of the NodeList collection object provided by the childNodes property of the root element node. See Table 11-4.) The script contains a for loop that is executed once for each BOOK element and includes the code to display each of these elements:

Chapter 11 Displaying XML Documents Using Document Object Model Scripts

379

for (i=0; i < Document.documentElement.childNodes.length; i++) { /* code to display a BOOK element... */ }

The script stores all of these blocks of HTML markup in the variable HTMLCode. After the for loop, when all blocks have been generated and stored in HTMLCode, the script assigns the HTML markup to the innerHTML property of the DIV element in the BODY of the page (this element has the ID DisplayDIV): DisplayDIV.innerHTML=HTMLCode;

Document Object Model Scripts

for (i=0; i < Document.documentElement.childNodes.length; i++) { HTMLCode += "Title: " + Document.documentElement.childNodes(i).childNodes(0).text + "
" + "Author: " + Document.documentElement.childNodes(i).childNodes(1).text + "
" + "Binding: " + Document.documentElement.childNodes(i).childNodes(2).text + "
" + "Number of pages: " + "" + Document.documentElement.childNodes(i).childNodes(3).text + "
" + "Price: " + Document.documentElement.childNodes(i).childNodes(4).text + "

"; }

11

Because the number of BOOK elements is unknown, the page can’t use a fixed set of SPAN elements in its BODY to display the data (as did the previous example page in Listing 11-3). Rather, for each BOOK element, the script dynamically generates the entire block of HTML markup necessary to display the element:

380

XML Step by Step

The DIV element then immediately renders the HTML and displays the results, which you saw in the previous figure. To convince yourself that the page works regardless of the number of BOOK elements the XML document contains, you might edit the data island in this page so that it displays Inventory Big.xml, which contains twice as many BOOK elements as Inventory.xml:

Using Other Ways to Access Elements The example scripts you’ve seen so far have accessed Element nodes by traversing the node hierarchy using the childNodes or the firstChild node property to move from one node to an adjoining node. Keep in mind that you can use the lastChild, previousSibling, nextSibling, and parentNode node properties in analogous ways. Table 11-2 describes all these properties.

note The childNodes, firstChild, and lastChild properties allow you to access only nonattribute child nodes, while the previousSibling and nextSibling properties let you access any type of sibling node. For a Document or Attribute node, the parentNode property is always set to null.

Another way to access XML elements is to use the getElementsByTagName property to extract all elements that have a particular type name (such as TITLE). This method is available for a Document node, as described in Table 11-3, as well as for an Element node, as described in Table 11-6. If you call the method for a Document node, it returns a collection of Element nodes for all elements in the document that have the specified type name. For example, the following statement obtains a collection of nodes for all elements in the document that have the name BOOK: NodeList = Document.getElementsByTagName(“BOOK”);

If you call getElementsByTagName for an Element node, as shown in the following example, it returns a collection of nodes for all matching elements that are descendents of the Element node: NodeList = Element.getElementsByTagName(“AUTHOR”);

Chapter 11 Displaying XML Documents Using Document Object Model Scripts

381

tip If you pass the value “*” to getElementsByTagName, it returns a collection of nodes for all elements (all elements in the document if you call the method for a Document node and all descendent elements if you call it for an Element node).

Element node method

Description

Example

getAttribute (attr-name)

Returns the value of the element’s attribute that has the specified name.

AttValue = Element.getAttribute ("InStock");

getAttributeNode (attr-name)

Returns the Attribute node representing the element’s attribute that has the specified name.

Attribute = Element.getAttributeNode ("InStock");

getElementsByTagName (type-name)

Returns a NodeList collection of Element nodes for all of this element’s descendent elements that have the specified type name. If you pass "*", it returns nodes for all descendent elements.

AuthorElementCollection = Element.getElementsByTagName ("AUTHOR");

note If the type name of the elements you want to retrieve includes a namespace prefix, you’ll need to include that prefix in the value you pass to getElementsByTagName, as in the following example: NodeList = Document.getElementsByTagName("inventory:BOOK");

For information on namespaces, see “Using Namespaces” on page 69.

The getElementsByTagName method supplies the Element nodes in the form of a NodeList collection object. You can therefore access the individual nodes using any of the techniques discussed in “Using a NodeList Object” on page

Document Object Model Scripts

11

Table 11-6. Useful methods provided by Element nodes. With Element nodes you can also use any of the common node properties listed in Table 11-2.

Chapter 11 Displaying XML Documents Using Document Object Model Scripts

383

GetElements.htm

INVENTORY (BOOK)*> BOOK (TITLE, AUTHOR, BINDING, PAGES, PRICE)> BOOK Review ENTITY #IMPLIED> TITLE (#PCDATA)> AUTHOR (#PCDATA)> BINDING (#PCDATA)> PAGES (#PCDATA)> PRICE (#PCDATA)>

Chapter 11 Displaying XML Documents Using Document Object Model Scripts

389

Get Entity Information

Listing 11-8.

Each BOOK element in the example XML document contains an ENTITY type attribute named Review, which is assigned the name of an unparsed entity that contains a review of the particular book. The example HTML page includes a script that demonstrates the basic steps a DOM script must perform to extract all of the information about an entity when it encounters an attribute that has the ENTITY or ENTITIES type. Specifically, the script extracts information on the unparsed entity assigned to the Review attribute within the first BOOK element. It displays this information in an “alert” message box that looks like this:

Chapter 11 Displaying XML Documents Using Document Object Model Scripts

391

Here’s a brief explanation of the basic steps that the script performs: 1

The script obtains the Attribute node for the Review attribute in the first BOOK element: Attribute Document.documentElement.childNodes(0).attributes(0);

2

The script uses the dataType node property (see Table 11-2) to determine whether the attribute has the ENTITY type: if (Attribute.dataType == “entity”) { /* obtain entity information ... */ }

The script performs the remaining steps only if the attribute does have the ENTITY type. That is, the remaining steps are included in the if statement, and are executed only when the if condition is true. 3

The script obtains the Entity node for the DTD declaration of the entity assigned to the attribute:

4

The script obtains the entity’s system identifier, which specifies the URI of the file containing the entity data. The system identifier is stored as the value of an Attribute node named SYSTEM: DisplayText += “entity file = “ + Entity.attributes.getNamedItem(“SYSTEM”).nodeValue + “\n”;

Document Object Model Scripts

The Document property doctype (explained in Table 11-3) provides a DocumentType node representing the document type declaration. The DocumentType property entities provides a NamedNodeMap collection of Entity nodes for all entity declarations in the DTD. The Entity node for the specific entity assigned to the attribute is obtained by passing the entity’s name (Attribute.nodeValue) to the NamedNodeMap’s getNamedItem method, which you saw in Table 11-7.

11

Entity = Document.doctype.entities.getNamedItem(Attribute.nodeValue);

Chapter 11 Displaying XML Documents Using Document Object Model Scripts

393

Create the Node-Traversing Page 1

Open a new, empty text file in your text editor, and type in the HTML page shown in Listing 11-9. (You’ll find a copy of this listing on the companion CD under the filename ShowNodes.htm).

2

Use your text editor’s Save command to save the document on your hard disk, assigning it the filename ShowNodes.htm.

ShowNodes.htm

DTD Validity Tester

Listing 11-10.

The HTML page contains a script that it runs when the browser first opens the window for the page:

The script first obtains the Document node for the XML document that is linked through the data island: Document = dsoTest.XMLDocument;

Document Object Model Scripts

message = "parseError.errorCode: " + Document.parseError.errorCode + "\n" + "parseError.filepos: " + Document.parseError.filepos + "\n" + "parseError.line: " + Document.parseError.line + "\n" + "parseError.linepos: " + Document.parseError.linepos + "\n" + "parseError.reason: " + Document.parseError.reason + "\n" + "parseError.srcText: " + Document.parseError.srcText + "\n" + "parseError.url: " + Document.parseError.url; alert (message);

11

By the time the script receives control, the Internet Explorer XML processor has already loaded—or attempted to load—the XML document, and the document’s error status is contained in the Document node’s parseError property. The parseError property is a programming object with its own set of properties. The script concludes by displaying all of the parseError properties:

402

XML Step by Step

The parseError properties fully describe the XML document’s error status. If the document is error-free, parseError.errorCode is set to zero and the other properties are set to either zero or blank. If the document has one or more errors, parseError.errorCode contains the numeric code for the first error, and the other properties describe this error.

Checking an XML Document for Validity Using an XML Schema This section presents an HTML page that checks a document for validity with respect to an XML schema. The page contains a script that opens both an XML document and an XML schema file, and checks whether the XML document is well-formed and whether it conforms to the schema. The page reports any wellformedness errors in the XML document as well as any failure of the XML document to conform to the constraints defined in the XML schema. Internet Explorer itself reports any errors contained in the XML schema file.

caution To use the XML validity-testing page presented here, MSXML version 4.0 must be installed on the computer. For information on MSXML, see “XML Step by Step, Internet Explorer, and MSXML” in the Introduction.

How to Use the XML Schema Validity-Testing Page 1

In your text editor, open the XML schema validity-testing page, Validity Test Schema.htm. (You’ll find this file on the companion CD and in Listing 11-11 in the next section.)

2

Edit the values of the three variables at the beginning of the script as follows:

Variable to edit

Set to this value

XMLFileURL XMLNamespaceName

The URL of the XML document If the XML document’s root element belongs to a namespace (a default namespace or one assigned through a namespace prefix), then set XMLNamespaceName to the corresponding namespace name. Otherwise, set it to an empty string. For information on namespaces, see “Using Namespaces” on page 69. The URL of the XML schema file

SchemaFileURL

Chapter 11 Displaying XML Documents Using Document Object Model Scripts

403

Typically, the validity-testing page, the XML document, and the XML schema file are all located in the same folder; in this case you would need to assign XMLFileURL and SchemaFileURL just the simple filenames. For example, to test the validity of the Book Instance.xml document against the Book Schema.xsd XML schema (assuming that the Book Instance.xml document element doesn’t belong to a namespace and that all three files are in the same folder), you would set the variables as follows: var XMLFileURL = “Book Instance.xml”; var XMLNamespaceName = “”; var SchemaFileURL = “Book Schema.xsd”;

3

Use your text editor’s Save command to save the modified page.

4

Open the page in Internet Explorer: If the XML document is well formed and conforms to the schema, you’ll see the following message box:

■

If the XML document contains a well-formedness or validity error, you’ll see a message box similar to the following one:

Document Object Model Scripts

■

11

You’ll now see one of the following three messages:

404

XML Step by Step

■

If the XML schema itself contains an error—either a well-formedness error or a violation of one of the rules of the XML Schema definition language—the message box will not appear. Rather, Internet Explorer will display an error icon at the left end of its status bar. Double-clicking this icon: will display an error message such as the following one:

(If you check the Always Display This Message When A Page Contains Errors option, from then on the error message will appear automatically and you won’t need to double-click the error icon in the status bar to display the message.)

note If the XML schema contains an error and you have a debugger installed, such as the one provided with Microsoft Visual Studio .NET, you might see a different type of error message than the one described here.

How the XML Schema Validity-Testing Page Works The XML schema validity-testing page is given in Listing 11-11. (You’ll find a copy of this listing on the companion CD under the filename Validity Test Schema.htm.)

Chapter 11 Displaying XML Documents Using Document Object Model Scripts

405

Validity Test Schema.htm

Listing 12-1.

Using XSLT Style Sheets

Moby-Dick

Herman Melville

hardcover 724 $9.95

Listing 12-2.

Here’s how Internet Explorer displays the XML document, following the instructions in the style sheet:

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

415

Every XSLT style sheet must have the following document element:

416

XML Step by Step

note Keep in mind that a location path consisting of the root operator (/) does not match the node for the document (or root) element of the XML document. Rather, it matches the XSLT root node representing the entire document, of which the document element is an immediate child. (The XSLT root node is thus equivalent to the root Document node of the Document Object Model, discussed in Chapter 11.)

When Internet Explorer processes a style sheet, it first looks for a template that matches the XSLT root node. If it finds such a template, it then carries out the transformation instructions contained in that template. In the example style sheet, the template matching the XSLT root node is the sole template and it therefore contains instructions for transforming the complete XML document. As you’ll learn later, a style sheet sometimes divides the task of transforming the document among several templates, each of which matches a particular XML document component or set of components and transforms the corresponding branch or branches of the document. For information on the way the browser applies the templates contained in a style sheet, see the following sidebar “How Internet Explorer Applies XSLT Templates.”

How Internet Explorer Applies XSLT Templates The simple style sheet in Listing 12-1 contains a single template that matches the XSLT root node. It’s permissible, however, for a style sheet to omit the template matching the XSLT root node, to have several templates, or even to have no templates at all. It’s important to understand how the browser transforms the document in each situation. In all cases, the browser begins by “applying a template” to the XSLT root node. The following are the steps that it takes when it applies a template to the root node or to any other node: 1

It looks for a template defined in the style sheet that matches the node.

2

If it finds a matching template, it turns over the transformation of the node to that template. That is, it executes the transformation instructions contained in the template.

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

3

417

If it doesn’t find a matching template, it uses the appropriate built-in template. A built-in template is one whose behavior is defined by the XSLT specification, rather than in an xsl:template element that you include in your style sheet. The particular built-in template the browser uses depends upon the type of the node that is being transformed, as follows: ■

The built-in template for the XSLT root node applies a template to each child node of the root node—that is, for each child node it performs steps 1 through 3. The child nodes of the root node include the document element node, and possibly one or more comment or processing instruction nodes.

■

The built-in template for an element node applies a template to each child node of the element node—that is, it performs steps 1 through 3 for each of these nodes. The possible child nodes of an element node include a text node, which represents any character data contained directly within the element, as well as nested element nodes, comment nodes, and processing instruction nodes.

■

The built-in template for a text node displays the text—that is, it outputs the character data associated with the text node’s parent element node.

The built-in template for a comment or processing instruction node does nothing—that is, it doesn’t display the node’s text content. (These types of nodes don’t have child nodes.)

For instance, when the browser processes the example style sheet in Listing 12-1, it immediately finds a template matching the XSLT root node and simply turns over control to that template. If that style sheet also contained a template matching the BOOK element node, that template would never be used (unless the root node template contained an xsl:apply-templates element that selected the BOOK element node, as explained later in the chapter). If the example style sheet contained only a template matching the BOOK node, continued

Using XSLT Style Sheets

■

12

The built-in template for an attribute node also displays the associated text (in this case, the attribute’s value). However, if an element has an attribute, the attribute node is not considered to be a child of the element node; therefore, the built-in template for an element node does not apply a template to the attribute node. The browser applies a template to an attribute node only if one of the templates that you write explicitly selects the attribute node within an xsl:apply-templates element, as explained later in the chapter.

418

XML Step by Step

continued

comment—the browser wouldn’t find a matching template and would use its built-in template, which would do nothing. For the second child node— the one representing the BOOK document element—the browser would find the matching template in the style sheet and would turn over handling of the BOOK node and all its children to that template. The browser would not look for any additional templates (unless the BOOK template contained an xsl:apply-templates element). You can deduce from the three steps outlined in this sidebar what the browser would do if the linked XSLT style sheet contained just an empty xsl:stylesheet element with no templates. You’re right: The browser would display all of the character data found in the XML document’s elements, but would not display any attribute values or the content of any comments or processing instructions.

The following is the complete template in the example style sheet:

Book Inventory

Book Inventory Author:
Title:

Using XSLT Style Sheets

Assume that the style sheet used to display this document contains the following template:

12

Mark Twain

mass market paperback 298 $5.49

The Adventures of Tom Sawyer

Mark Twain

mass market paperback 205 $4.75

The Ambassadors

Henry James

mass market paperback 305 $5.95

424

XML Step by Step

Price:
Binding type:
Number of pages:

This template uses the technique discussed in the previous section. Notice that the location path assigned to each select attribute starts with the document element, which in this case is INVENTORY (for example, INVENTORY/BOOK/AUTHOR). Each location path, however, describes a collection of three different elements. In XSLT, a collection of elements or other nodes is known as a node-set. For example, the location path INVENTORY/BOOK/AUTHOR describes a node-set consisting of all three AUTHOR elements. The xsl:value-of element uses only the first element in the node-set assigned to its select attribute. The style sheet would thus display the content of only the first BOOK element, as shown here:

One technique for displaying all of the elements or other types of nodes in a node-set is to use the xsl:for-each element, which repeats the output from the elements contained within it, once for each node in a specified node-set. The XSLT style sheet in Listing 12-3 demonstrates this technique. This style sheet is linked to the XML document in Listing 12-4. (You’ll find copies of both listings on the companion CD under the filenames XsltDemo02.xsl and XsltDemo.xml.)

426

XML Step by Step

The Adventures of Huckleberry Finn

Mark Twain

mass market paperback 298 $5.49

The Adventures of Tom Sawyer

Mark Twain

mass market paperback 205 $4.75

The Ambassadors

Henry James

mass market paperback 305 $5.95

The Awakening

Kate Chopin

mass market paperback 195 $4.95

12

Billy Budd

Herman Melville

mass market paperback 195 $4.49

A Connecticut Yankee in King Arthur's Court

Mark Twain

mass market paperback 385 $5.49

Joan of Arc

Mark Twain

trade paperback 465 $6.95

Leaves of Grass

Walt Whitman

hardcover 462 $7.75

The Legend of Sleepy Hollow

427

Using XSLT Style Sheets

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

mass market paperback 256 $4.95

Roughing It

Mark Twain

mass market paperback 324 $5.25

The Scarlet Letter

Nathaniel Hawthorne

trade paperback 253 $4.25

The Turn of the Screw

Henry James

trade paperback 384 $3.35

Listing 12-4.

The template in the style sheet of Listing 12-3 contains the following for-each element:

Title:

12

429

Using XSLT Style Sheets

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

430

XML Step by Step

Author:
Binding type:
Number of pages:
Price:

The xsl:for-each element has two main effects: ■ The browser carries out the instructions within the xsl:for-each element once for every XML element (or other type of node) that is contained in the node-set described by the location path assigned to the xsl:for-each element’s select attribute. In this example, the instructions are repeated once for each BOOK element within the INVENTORY document element. The location path assigned to the xsl:for-each element’s select attribute works just like a location path assigned to the xsl:value-of element’s select attribute, except that you typically assign a location path that describes a node-set that includes more than one node. ■ Each time the browser carries out the instructions within the xsl:foreach element, one of the nodes contained in the node-set assigned to the select attribute becomes the current context node within the scope of the xsl:for-each element. The nodes become the current context node in the order of their appearance in the XML document. In the example style sheet, each BOOK element (within the INVENTORY element, within the XSLT root node) would become the current context node in turn, as shown here:

This template would match a TITLE element that is a child of a PREVIOUS element located anywhere in the document. (Similarly, if you wanted a template that matched only the TITLE element that’s an immediate child of BOOK, you would set its match attribute to the relative location path BOOK/TITLE.)

Using XSLT Style Sheets

If you wanted a template that matched only the TITLE element within PREVIOUS, you could set the match attribute equal to the relative location path PREVIOUS/TITLE, as in this example:

12

The location path in the following start-tag would select the first SHIRT element in the example because one of its COLOR child elements contains “green:”

To select only SHIRT elements in which all child COLOR elements contain “green,” you could use the filter clause given in the following start-tag:

(This location path wouldn’t select either of the SHIRT elements in the example document.) The not is a Boolean operator you can use in a filter clause to reverse the true or false value of the expression in the parentheses that follow the not operator. You can test the value of a specific child element by including a number in square brackets ([]) following the element name, where [1] refers to the first child element with the specified name. For example, the location path in the following start-tag selects SHIRT elements in which the second COLOR child element contains “green:”

(This location path would select the first SHIRT element in the example document.) If a location step yields a node-set containing more than one node (a common situation), you can select an individual node by appending a filter clause that contains just the number of the node you want to select, where [1] selects the first node in the set. For instance, the location path in the following start-tag selects all the child elements of the first BOOK element within INVENTORY:

And the following xsl:value-of element displays the text content of the second BOOK element within INVENTORY. (Without the filter clause, the xsl:value-of element would display the content of the first BOOK element.)

A final type of filter clause contains just an element name. It indicates that the preceding node must have at least one child element with the specified name,

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

445

without regard to the child elements’ content. For example, consider an XML document that contains the following document element:

XML Step by Step companion CD

Visual Basic--Game Programming companion disk

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

447

Listing 12-6.

Using XSLT Style Sheets

Author:
Title:
Binding type:
Number of pages:
Price:

12

Book Inventory

Book Inventory Trade Paperback Books

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

449

Number of pages:
Price:

Listing 12-7.

Both style sheets are designed to be linked to the XML document in Listing 12-4 (XsltDemo.xml). They both use the following select attribute specification to cause the browser to display only the trade paperback books: select=”INVENTORY/BOOK[BINDING=’trade paperback’]”

And they both use the following xsl:sort elements to sort the BOOK elements:

The following is the first part of the output, which is the same for both style sheets:

Using XSLT Style Sheets

These three xsl:sort elements cause the browser to sort the BOOK elements in ascending alphabetical order by the authors’ last names, then in ascending alphabetical order by the authors’ first names, and then in descending numerical order by the numbers of pages in the books (so that a particular author’s books are arranged from longest to shortest).

12

450

XML Step by Step

The style sheet in Listing 12-6 uses an xsl:for-each element to display the multiple BOOK elements. In this style sheet, the select attribute specification is added to the xsl:for-each start-tag, and the xsl:sort elements are inserted at the beginning of the xsl:for-each element’s content. This causes the transformation instructions within xsl:for-each to be applied once for each trade paperback book, in the specified sort order. The style sheet in Listing 12-7 uses an apply-templates element, together with a separate matching template, to display the multiple BOOK elements. In this style sheet, the select attribute specification is added to the xsl:apply-templates starttag, and the xsl:sort elements are placed within the xsl:apply-templates element. This causes the browser to apply a template to each BOOK element that stores a trade paperback book, in the specified sort order. For each of these BOOK elements, the browser finds and applies the matching template defined in the style sheet:

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

451

Books in Stock

Books In Stock

Title	Author	Binding Type	Number of Pages	Price
	(born ) Chapter 12 Displaying XML Documents Using XSLT Style Sheets 453

Listing 12-8. XsltDemo06.xml

The style sheet displays the BOOK elements within an HTML table rather than in a list of SPAN elements as in the previous examples. It displays the value of the Born attribute, following the value of the AUTHOR element, by using the xsl:value-of element. The following elements create the table cell displaying these values:
(born )

Here’s the way the Internet Explorer displays the document:

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

455

Adding an Attribute to a Literal Result Element You can add an attribute with a known, constant value to a literal result element by simply entering the attribute specification into the element starttag that appears in the style sheet. You’ve seen many examples in the style sheets shown in this chapter, such as the STYLE attribute in the following literal result SPAN element:

Using XSLT Style Sheets

If, however, you want to obtain the attribute’s value from the XML file, you need to use the xsl:attribute element. For example, consider an XML document with the following document element, in which the COLORCODE elements indicate the color that should be used for displaying each book’s title:

12

The Ambassadors red

The Awakening

continued

458

XML Step by Step

Collection Inventory Books

CDs

Listing 12-10.

The example style sheet globally declares (in the xsl:stylesheet start-tag) two namespace prefixes that it can use to reference the two namespaces found in the XML document:

The style sheet uses the book namespace prefix to refer to the XML elements that belong to the default namespace named http://www.mjyOnline.com/books, and it uses the cd namespace prefix to refer to the XML elements that belong to the explicit namespace named http://www.mjyOnline.com/cds. (In the style sheet, you can use any namespace prefixes you want, provided that the namespace names match those in the XML document.) Here’s an example of how the style sheet uses the book namespace prefix:

Chapter 12 Displaying XML Documents Using XSLT Style Sheets

459

And here’s an example of how it uses the cd namespace prefix:

Using Conditional Structures This final section introduces the XSLT conditional elements, which allow you to selectively include transformation instructions in your style sheet based on conditions such as the value of an element or attribute. The simplest conditional element is xsl:if, which allows you to include or exclude a block of instructions based on a single test. Assume, for example, that the following template is in a style sheet linked to the XsltDemo06.xml XML document given in Listing 12-9. This template displays a simple list of the book titles. The xsl:if element within the template displays “out of stock!” to the left of the title for any BOOK element whose InStock attribute is set to no:

460

XML Step by Step

titles, indicating the size category of each book by displaying one, two, or three asterisks to the title’s left. (One asterisk indicates a length less than or equal to 300 pages, two asterisks a length greater than 300 but less than or equal to 500, and three asterisks a length greater than 500.)

When these three conditional elements are arranged as in the example, the browser uses the instructions contained in the first xsl:when element with a test attribute that evaluates to true. If none of the xsl:when elements have true test values, it uses the instructions contained in the xsl:otherwise element at the end (if it’s included). In the example, the “instructions” consist of only one to three asterisk characters to be copied to the output.

Appendix

XML Schemas ■ For information on XML schemas as supported by Internet Explorer and MSXML 4.0, see the topic “XML Schemas” in the XML SDK documentation provided by the MSDN Library on the Web at http://msdn.microsoft.com/library/. ■ You’ll find the complete text of the W3C XML Schema specification in the following three pages on the Web: “XML Schema Part 0: Primer” at http://www.w3.org/TR/xmlschema-0/, “XML Schema Part 1: Structures” at http://www.w3.org/TR/xmlschema-1/, and “XML Schema Part 2: Datatypes” at http://www.w3.org/TR/xmlschema-2/.

Cascading Style Sheets (CSS) ■ To learn about the CSS features supported by Internet Explorer, see the topic “Cascading Style Sheets” in the MSDN Library on the Web at http://msdn.microsoft.com/library/. ■ For a list of all the Internet Explorer color names that you can use for assigning colors in a cascading style sheet, see the topic “Color Table” in the MSDN Library on the Web at http://msdn.microsoft.com/library/. ■ The W3C publishes the specification for Cascading Style Sheets Level 1 (CSS1) at http://www.w3.org/TR/REC-CSS1. ■ The W3C publishes the specification for Cascading Style Sheets Level 2 (CSS2) at http://www.w3.org/TR/REC-CSS2. ■ For an introduction to the techniques for using cascading style sheets with HTML, see the topic “2.1 A brief CSS2 tutorial for HTML” in the W3C page “2 Introduction to CSS2” at http://www.w3.org/TR/REC-CSS2/intro.html.

463

464

Appendix

Data Binding and the Data Source Object (DSO) ■ For information on using data binding and the DSO with Internet Explorer, see the topics “Binding the XML Data Source Object to Data,” “Use the C++ XML Data Source Object,” and “Use the Master/Detail Feature with the C++ XML Data Source Object” in the XML SDK documentation provided by the MSDN Library on the Web at http://msdn.microsoft.com/library/.

ActiveX Data Objects (ADO) and the ADO recordset Object ■ For general information on ADO and the ADO recordset object, see the topic “Microsoft ActiveX Data Objects (ADO)” in the MSDN Library on the Web at http://msdn.microsoft.com/library/. ■ The Microsoft home page for information on ADO is located at http://www.microsoft.com/data/ado/.

HTML and Dynamic HTML (DHTML) ■ For information on working with HTML and DHTML as implemented in Internet Explorer, see the topic “HTML and Dynamic HTML” in the MSDN Library on the Web at http://msdn.microsoft.com/library/. ■ To read the official specification for the latest version of HTML 4, see the following Web site, provided by the W3C: http://www.w3.org/TR/html4/.

Microsoft JScript ■ For information on JScript and other Microsoft Web scripting technologies, see the topic “Scripting” in the MSDN Library on the Web at http://msdn.microsoft.com/library/.

467

Index Symbols : (colon) in attribute names, 64 , (comma) in cascading style sheet selectors, 205 / (slash), in cascading style sheets comments, 201 [ ] (square brackets), 96 & (ampersand), 133, 136 < > (angle brackets), 5, 335 * (asterisk), in cascading style sheets comments, 201 @ (at sign) in attribute names, 451 % (percent character), 136

A A HTML element, 6, 328 absolute positioning property, 272–276 Access, 8 AML (Astronomical Markup Language), 18 ampersand (&), 335, 143 ancestor elements, 205, 212 angle brackets (< >), 5, 335 anonymous type definitions, 173 ANY keyword, 100 APPLET HTML element, 328 applications, SGML. See also SGML (Structured Generalized Markup Language) defined, 11 applications, XML. See also SGML (Structured Generalized Markup Language); XML (Extensible Markup Language) defined, 11–12, 14–15 apply-templates element. See xsl: apply-templates element asterisk (*) character, in cascading style sheets comments, 201

Astronomical Markup Language (AML), 18 at signs (@) in attribute names, 451 attribute-list declarations defined, 107 in document type definitions (DTDs), 97 form of, 108–109 illustrated, 108 multiple, 109 Attribute node, 361, 396 attributes adding to elements, 62–68 binding HTML elements to, 344–349 character references in, 155, 157 converting content to, 66–68 declaring in schemas, 169–170, 181–185 declaring in valid XML documents, 107–117 default declaration forms AttValue default declaration form, 117 #FIXED AttValue default declaration form, 117 illustrated, 116 #IMPLIED default declaration form, 116 overview, 116 #REQUIRED default declaration form, 116 enumerated type, 110, 114–116 form of declaration, 108–109 in Inventory Valid.xml example, 127 in Inventory04.xml example, 66–68 naming, 63–64 overview, 62–63 referencing in XSL patterns, 451, 452, 452–453, 454 required, 107, 116 rules for creating, 63–64 rules for values, 65–66 specifying type, 107, 110–117 string type, 110 tokenized type ENTITIES type attribute, 113, 134, 140, 388, 390 ENTITY type attribute, 112, 140, 388, 390, 391 ID type attribute, 111 IDREF type attribute, 111–112 IDREFS type attribute, 112

Index

precedence in, 205, 208, 211–215 properties for backgrounds, 235–245 for borders, 263–267 for boxes, 257–285 for displays, 215–218 inheritance of settings, 203–204, 213 in Inventory01 example, 201–202 for margins, 259–262 overview, 203 for padding, 268–269 for size, 270–272 specifying keyword values, 219 for text alignment, 246–256 for text spacing, 246–256 Raven.css example, 260, 260–261 Raven01.css example, 277, 277 Raven02.css example, 280–281, 281 Raven04.css example, 294, 294–295 rules for defined, 200 example, 200–201 illustrated, 200 multiple, 204 steps for using, 197–211 using to display XML documents, 34–41 versions of, 196 vs. XSL style sheets, 10, 409–410, 411, 421 white space in, 58 case sensitivity in cascading style sheets, 203 in element-type names, 29, 54 in text markup, 54, 136 CDATA sections defined, 55 examples, 56, 87 form of, 86–87 where to place, 88 CDATASection node, 361 CDF (Channel Definition Format), 17 Channel Definition Format (CDF), 17 channels, role of XML, 17 character data adding, 55 defined, 27, 54 encoding and languages, 77–80

469

examples, 53, 55, 62 inserting special characters, 153–156 in mixed element content, 105–107 retrieving, 373–374 in schemas, 184–186 character references defined, 55 example, 55 inserting, 153–156 predefined, 156–157 as type of entity, 150 Chemical Markup Language (CML), 18 Chess Markup Language (ChessML), 18 child elements attributes as, 451 in Book.xml example, 369, 370, 375–376 defined, 52 in element content, 100, 101–105 and filters, 443 in Inventory03.xml example, 62 and property inheritance in cascading style sheets, 203 specifying mixed content, 105–107 childNodes node property, 365, 370, 371, 374, 380 clear property, in cascading style sheets setting, 283–284 specifying CSS keyword values, 284, 284 CML (Chemical Markup Language), 18 Collection Default Valid.xml (Listing 5-1) listing, 118–119, 118–120 Collection Default.xml (Listing 3-5) listing, 74–75, 74–75 collections NamedNodeMap collection objects getNamedItem NamedNodeMap method, 386, 386, 392 item NamedNodeMap method, 386, 386 length NamedNodeMap property, 386, 386 list of methods and properties, 386 nextNode NamedNodeMap method, 386 reset NamedNodeMap method, 386 using,385–386, 386, 392 NodeList collections objects item NodeList method, 374

470

Index

length NodeList property, 374, 378 list of methods and properties, 374 nextNode NodeList method, 374 reset NodeList method, 374 using, 374 vs. NamedNodeMap collection objects, 386 Collection.xml (Listing 3-4) listing, 71–72, 71 colon (:) in attribute names, 64 color background (See background-color property, in cascading style sheets) text (See color property, in cascading style sheets) color property, in cascading style sheets and inheritance, 232 setting, 232–234 specifying color values, 233–234 specifying CSS keyword values, 219 comma (,) in cascading style sheet selectors, 204 Comment node, 361 comments in cascading style sheets, 201 in XML documents defined, 56 examples, 56, 82 form of, 82 inserting, 25, 81–83 in Inventory Valid.xml example, 127 in Inventory.xml example, 25, 25 overview, 25 in Parts.xml example, 47, 49 as type of element content, 56, 62 where to place, 82–83 comparison operators, 442, 442 conditional structures in XSLT, 459–460 content model, 101 court documents (Open XML Court Interface), 17 CSS. See cascading style sheets (CSS) current-record data binding, 323. See also single-record data binding

D data binding and document type definitions, 337–344 how it works, 308 overview, 10, 308 single-record, 303, 322–327 steps for using binding HTML elements to XML elements, 297–298, 303–349 linking XML documents to HTML pages, 297, 299–302 table vx. single-record, 303– 304 (See also table data binding) techniques for non-table HTML elements, 328–336 types of, 303–304 white space in, 58 data islands defined, 299 and Document Object Model (DOM), 359–360, 369 sample forms, 299–302 Data Source Object (DSO) programming model addNew recordset method, 336 cancelUpdate recordset method, 336 delete recordset method, 336 move recordset method, 324 movefirst recordset method, 324 movelast recordset method, 324 movenext recordset method, 324 moveprevious recordset method, 324 overview, 302 and single-record binding, 322 updating cached XML data, 336 using scripts with, 350-356 vs. Document Object Model (DOM), 357 databases and XML, 7–8, 16 DATAFORMATAS HTML attribute, 334–335 DATAPAGESIZE HTML attribute, 309 dataType node property, 365 .dbf files, 8 declaration block, in cascading style sheets, 200–201 declarations in schemas, 167–181

472

Index

DOM. See Document Object Model (DOM) DomDemo Fixed.htm file (Listing 11-3) displaying in Internet Explorer, 369, 369 listing, 367–369 script, 367–369 DomDemo Variable.htm file (Listing 11-4) displaying in Internet Explorer, 376, 377 listing, 376, 377–378 script, 377–378 double quotes, 49, 159. See also quoted strings DSOs. See Data Source Object (DSO) programming model duplicate names, 142 Dynamic HTML (DHTML), 373

E EDT-ML (Electronic Thesis and Dissertation Markup Language), 18 electronic business cards, 18 Electronic Thesis and Dissertation Markup Language (EDT-ML), 18 element content choice form of model, 102–105 overview, 100–101 sequence form of model, 101–102 specifying mixed content, 105–107 types of, 54–56 Element nodes accessing, 370, 375–376, 380–384 defined, 361 getAttribute method, 381 getAttributeNode method, 381 getElementByTagName method, 381 element type declarations declaring, 96, 97, 98–107 examples, 99 form, 99 in schemas, 167–181 specifying content, 100–107 element-type names, 53–54 elements. See also document element adding attributes, 62–68, 183–186 ancestor, 205, 212 in cascading style sheet rules, 200–206 CDATA sections in, 55

character references in, 55, 153–156 comments in, 56, 62 creating types of, 59–62 declaring types, 96, 97, 98–107, 167–181 displaying variable numbers of, 376–380, 422–430 empty, 59, 64, 100 general entity references in, 55, 148–150, 150 naming, 8, 53–54 processing instructions in, 56 pseudo, 285 retrieving character data, 375–376 type names, 53–54 types of content, 54–56 white space in, 56–58 empty elements, 59, 63, 100 EMPTY keyword, 100 encoding languages and characters, 77–79 end-tags, 5, 27, 28–29, 53 entities accessing, 388–392 adding to Valid.xml example, 157–162 declaring, 97, 135–147 external, defined, 133–135 as files, 132 general, defined, 133–135 general external parsed as available type of entity, 134 declaring, 138–139 example, 138–139 naming, 138 referencing, 148, 149, 152–153 specifying system literal, 138 specifying URL, 138 syntax, 138 general external unparsed as available entity type, 134 declaring, 139–141 examples, 142, 152–153 Inventory Entity example, 388–392 naming, 139 referencing, 148, 150 specifying notation, 141 specifying system literal, 139, 392 syntax, 139

Index

internal, defined, 133–135 naming, 135 overview, 131, 132–133 parameter, defined, 133–135 parameter external parsed as available type of entity, 137 declaring, 145–147 example, 146–147 naming, 145 referencing, 148, 150 specifying system literal, 145 specifying URL, 145 syntax, 145 when to use, 146 parsed, defined, 133–135 as quoted strings, 132 referencing, 148–153 types of, 133–135 general vs. parameter, 133–135 internal vs. external, 133–135 parsed vs. unparsed, 133–135, 141 unparsed, defined, 133–135 uses for, 132–133 entities DocumentType property, 391 ENTITIES type attribute, 113, 133–135, 140, 388, 390 Entity node, 361, 391 entity references illustrated, 55 inserting, 148–153 predefined, 156–157 ENTITY type attribute, 112–113, 135, 140, 390–391 enumerated type attributes, 110, 114–116 EOF DSO recordset property, 325 equations, mathematical, 6 errors, XML, catching in Internet Explorer, 31–33 events, defined, 302 extensible, defined, 7 Extensible Forms Description Language (XFDL), 17 Extensible Markup Language. See XML (Extensible Markup Language) Extensible Stylesheet Language. See XSLT (Extensible Stylesheet Language Transformations)

473

external DTD subsets, 120–125, 147–149 external defined, 133–135 general, parsed as availlable type of entity, 134 declaring, 138–139 example, 138–139 naming, 138 referencing, 148–153, 149 specifying system literal, 138 specifying URL, 138 syntax, 138 general, unparsed as available type of entity, 134 declaring, 139–141 examples, 142, 152–153 Inventory Entity example, 388–392 naming, 139 referencing, 148, 150 specifying notation, 141 specifying system literal, 139, 392 syntax, 139 parameter, parsed as available type of entity, 134 declaring, 145–147 example, 146–147 naming, 145 referencing, 148, 150 specifying system literal, 145 specifying URL, 145 syntax, 145 when to use, 146 parsed vs. unparsed, 133–135, 141

F Federal Express, 18 filtering XML data referencing attributes in, 451 using XSLT style sheets, 440–445 XsltDemo04.xsl example, 447–448, 450–452 XsltDemo05.xsl example, 448–449, 450–451 XsltDemo06.xsl example, 452, 452–453, 454 FindBooks script, 352–355 firstChild node property, 365, 377, 380 firstPage TABLE element method, 309

474

Index

#FIXED AttValue default declaration form, 117 float property, in cascading style sheets and block elements, 259, 277 creating margin notes, 277–279 displaying floating images, 280–282 setting, 276–283 specifying CSS keyword values, 277 font-family property, in cascading style sheets, 213, 222–224 font-size property, in cascading style sheets absolute vs. relative size values, 228 example, 201–202 and inheritance, 203 setting, 224–229 specifying percentage values, 227 specifying size values, 227 font-style property, in cascading style sheets, 202, 214, 229 font-variant property, in cascading style sheets, 231 font-weight property, in cascading style sheets, 202, 229–231 fonts, setting CSS properties, 221–231 for-each element. See xsl:for-each element FRAME HTML element, 329 functions in XSLT style sheets, 432

G GedML (Genealogical Data in XML), 18 Genealogical Data in XML (GedML), 18 general entities adding to Valid.xml example document, 157–162 as available types of entities, 134 declaring, 135–142 defined, 133–135 external parsed as available type of entity, 134 declaring, 138–139 example, 138–139 naming, 138 referencing, 148, 149, 152–153 specifying system literal, 138 specifying URL, 138 syntax, 138

external unparsed as available entity type, 134 declaring, 139–141 examples, 142, 152–153 Inventory Entity example, 388–392 naming, 139 referencing, 148, 150 specifying notation, 141 specifying system literal, 139, 392 syntax, 139 internal parsed assigning values, 136–137 as available type of entity, 134 declaring, 135–137 examples, 137, 152–153 naming, 135 referencing, 148, 149, 152–153 syntax, 136 referencing, 55, 148, 149–150, 152–153 Geography Markup Language (GML), 18 getAttribute Element node method, 381 getAttributeNode Element node method, 381 getElementByTagName as Document node method, 372, 380, 382 as Element node method, 381 GetElements.htm file (Listing 11-5) displaying in Internet Explorer, 382, 382 listing, 383–384 getNamedItem NamedNodeMap method, 386, 387, 391, 392

H HEAD HTML element, 6 height property, in cascading style sheets setting, 270–272 specifying values, 271–272 hierarchically structured documents, 7, 8–9, 16 HRMML (Human Resource Management Markup Language), 17 .htm files DomDemo Fixed.htm file displaying in Internet Explorer, 369, 369 listing, 367–369 DomDemo Variable.htm file displaying in Internet Explorer, 376, 377

Index

listing, 376, 377–378 script, 377–378 GetElements.htm file displaying in Internet Explorer, 382, 382 listing, 383–384 Inventory Attribute.htm file displaying in Internet Explorer, 348, 348 listing, 347, 347–348, 348–349 Inventory Big Table.htm file displaying in Internet Explorer, 310, 310 listing, 313–314 Inventory Entity.htm file listing, 388, 389–390 script, 391–392 Inventory Find.htm file displaying in Internet Explorer, 352, 352 FindBooks script in, 352–355 listing, 350, 350–352, 352 Inventory Hierarchy Valid.htm file creating from Inventory Hierarchy.htm file, 342 displaying in Internet Explorer, 344, 344 listing, 342, 342–343, 344 Inventory Hierarchy.htm file displaying in Internet Explorer, 321, 321 listing, 318, 318–319 making changes to, 342 Inventory Single.htm file displaying in Internet Explorer, 327, 327 listing, 326, 326–327 Inventory Table.htm file displaying in Internet Explorer, 308, 308 listing, 306, 306–307 ShowNodes.htm file displaying in Internet Explorer, 397, 397 listing, 393, 393–394, 394–397 Validity Test DTD.htm file, 398–401, 4–401 Validity Test Schema.htm, 402–404, 405–406, 406–407 HTML. See Hypertext Markup Language (HTML) Human Resource Management Markup Language (HRMML), 17 Hypertext Markup Language (HTML) angle brackets () in cascading style sheets, 5 binding elements to XML attributes, 344–349

475

binding elements to XML elements, 10 binding elements to XML fields, 328, 330–335 BUTTON element, 324, 329 creating pages that display XML documents one at a time, 325–327 DATAFORMATAS attribute, 334–335 DATAPAGESIZE attribute, 309 displaying documents in Internet Explorer, 5 elements of, 4–7 inserting elements into XML documents, 286–287 limitations of, 6–7 linking XML documents to HTML pages, 297, 299–303, 359–360 lists of elements, 6, 328, 328–330 ONCLICK attribute, 324, 325 relationship to SGML, 11 relationship to XML, 3–4, 7, 12 rendering markup contained in XML fields, 328, 336–337 as SGML application, 11 and single-record data binding, 322–327 SPAN element, 307, 330, 334, 342–343, 373 start-tags and end-tags, 5 TABLE element, 303, 307, 308–310 firstPage method, 309 lastPage method, 309 nextPage method, 309 previousPage method, 309

I ID type attribute, 111 IDREF type attribute, 111–112 IDREFS type attribute, 112 IFRAME HTML element, 329 IGNORE keyword, 124–125 iLingo, 17 IMG HTML element, 329, 331, 331–333 #IMPLIED default declaration form, 116 @import directive defined, 208 and order of precedence, 208, 214 specifying URL value, 208–209 importing style sheets, 208, 214

476

Index

INCLUDE keyword, 124–125 inherited properties for text spacing and alignment, 246 vs. noninherited properties in cascading style sheets, 203–204, 213 innerText HTML property, 373, 395 INPUT HTML element, 329 INPUT TYPE=BUTTON HTML element, 329 INPUT TYPE=HIDDEN HTML element, 329 INPUT TYPE=PASSWORD HTML element, 329 INPUT TYPE=RADIO HTML element, 329 INPUT TYPE=TEXT HTML element, 329, 336 instance document, 186, 190–192 instructions. See processing instructions insurance-related data, exchanging, 17 internal DTD subsets, 122–124 internal entities defined, 133–135 general, parsed assigning values, 136–137 as available type of entity, 134 examples, 136–137, 152–153 naming, 135 referencing, 148, 150, 152–153 syntax, 136 parameter, parsed assigning values, 144 as available type of entity, 134 declaring, 143–145 example, 144–145 naming, 143–144 referencing, 148, 149 syntax, 143–147 Internet Explorer catching XML errors, 31–33 default XSL style sheet, 33 displaying Book.htm document, 322–323, 321 displaying DomDemo Fixed.htm document, 376, 377 displaying DomDemo Variable.htm document, 374, 375 displaying GetElements.htm document, 41, 41

displaying HTML documents, 5 displaying Inventory Attribute.htm document, 347, 347 displaying Inventory Big Table.htm document, 310, 310 displaying Inventory Big.xml document, 310, 310 displaying Inventory Find.htm document, 350, 350 displaying Inventory Hierarchy Valid.htm document, 342–343 displaying Inventory Hierarchy.htm document, 318–319 displaying Inventory Image.htm document, 331–332 displaying Inventory Single.htm document, 325–327 displaying Inventory Table.htm document, 306–307 displaying Inventory.htm document, 304–306, 376–377 displaying Inventory01.xml document, 37, 37, 198–200 displaying Inventory02.xml document, 41, 41 displaying Inventory03.xml document, 61, 61 displaying Inventory04.xml document, 66, 66–68 displaying Leaves.xml document, 238 displaying Raven.xml document, 261–262 displaying Raven01.xml document, 280, 280 displaying Raven02.xml document, 283, 283, 285, 285 displaying Raven03.xml document, 289 displaying Raven04.xml document, 294, 294 displaying ShowNodes.htm document, 397, 397 displaying XML documents, 10, 12, 26, 29–41 with cascading style sheets, 34–41, 37, 41, 202, 202 overview, 29–30 without cascading style sheets, 30, 30–31 displaying XsltDemo.xml document, 446, 446

Index

displaying XsltDemo01.xml document, 414, 414 displaying XsltDemo06.xml document, 455, 455 order of precedence for processing cascading style sheets rules, 214–215 and XML comment text, 82–83 XML error-checking feature, 31–33 and XML processing instructions, 84 and XML processor, 307 Inventory Attribute.htm file (Listing 8-14) listing, 346–347 Inventory Attribute.htm file (Listing 10-12) displaying in Internet Explorer, 348, 348 listing, 345, 347, 345–348 Inventory Attributes.xml file (Listing 11-6), 383–384, 385 Inventory Attributes.xml file (Listing 11-6), 385 Inventory Big Table.htm file (Listing 10-4) displaying in Internet Explorer, 310, 310 listing, 313–314, 313–314 Inventory Big.xml file (Listing 10-3) displaying in Internet Explorer, 308 Inventory Big.xml file (Listing 10-3) displaying in Internet Explorer, 308, 308 listing, 311–313, 311–313 Inventory DOM.xml file (Listing 11-1) hierarchical DOM organization, 363, 363 listing, 364, 364 and ShowNodes.htm example, 396, 397 Inventory Entity.htm file (Listing 11-8) listing, 388–390, 388–390 script, 391–392 Inventory Entity.xml file (Listing 11-7), 388, 388–389 Inventory Find.htm file (Listing 10-13) displaying in Internet Explorer, 350, 350 FindBooks script in, 352–356 listing, 350–352, 350–352 Inventory Hierarchy Valid.htm file (Listing 10-11) creating from Inventory Hierarchy.htm, 339–340 displaying in Internet Explorer, 342, 342 listing, 338–341, 338–341

477

Inventory Hierarchy Valid.xml file (Listing 10-10) creating from Hierarchy.xml, 337–338 listing, 342–343, 342–343 Inventory Hierarchy.htm file (Listing 10-6) listing, 318–319, 318–319 making changes to, 337–342 Inventory Hierarchy.xml file (Listing 10-5) listing, 315–318, 315–318 making changes to, 335–337 Inventory Image.htm file (Listing 10-9) displaying in Internet Explorer, 333, 333 listing, 332–333, 332–333 Inventory Image.xml file (Listing 10-8) listing, 331–332, 331–332 Inventory Instance.xml (Listing 7-4) listing, 190–192, 190–192 Inventory Schema.xsd (Listing 7-3) listing, 187–189, 187–189 Inventory Single.htm file (Listing 10-7) displaying in Internet Explorer, 327, 327 listing, 326–327, 326–327 Inventory Table.htm file (Listing 10-2) displaying in Internet Explorer, 308, 308 listing, 306–307, 306–307 Inventory Valid Entity.xml file (Listing 6-1) creating from Inventory Valid.xml file, 157–158 listing, 160–162, 160–162 and ShowNodes.htm example, 393–397 Inventory Valid.xml file (Listing 5-1) making changes to, 157–158 Inventory Valid.xml file (Listing 5-2) creating from Inventory.xml file, 125–129 listing, 127–129, 127–129 Inventory01.css file Listing 2-2, 34, 34 Listing 7-1, 201, 201 Inventory02.css file (Listing 2-4), 38, 38, 67 Inventory.xml file displaying in Internet Explorer, 30, 30, 308, 308, 377, 377 and Document Object Model (DOM), 376–380 Listing 2-1, 23–24, 23–24 Listing 8-1, 302

478

Index

Listing 10-1, 304–307, 304–307 Inventory01.xml file displaying in Internet Explorer, 37, 37, 202, 202, 207, 207 Inventory02.xml file (Listing 2-5) displaying in Internet Explorer, 40–41, 41 listing, 39–40, 39–40 Inventory03.xml file (Listing 3-2) displaying in Internet Explorer, 61, 61 listing, 59–61, 60–61 Inventory04.xml file (Listing 3-3) displaying in Internet Explorer, 68, 68 listing, 66–67, 67, 67 ISO/IEC 10646 character set, 153–154 item NamedNodeMap method, 386, 386 item NodeList method, 374

J JavaScript, 10. See also script code JScript, 325. See also script code

L LABEL HTML element, 329 languages, and encoding, 77–80 lastChild node property, 365, 380 lastPage TABLE element method, 309 Leaves.css file (Listing 8-3), 236, 236 Leaves.xml file (Listing 8-4) displaying in Internet Explorer, 238, 238 listing, 236, 236–237 legal documents (Open XML Court Interface), 17 length NamedNodeMap property, 386, 386 length NodeList property, 374, 378 length Text node property, 376 letter-spacing property, in cascading style sheets inheritance, 246 setting, 246–247 line-height property, in cascading style sheets inheritance, 246 setting, 254–255 specifying size values, 254–255 lists, bulleted and numbered, 219–221 literal result elements, 455–456

literals as attribute values, 65–66, 137 as entity values, 136, 139, 141–142, 144 quote marks as delimiters, 49 location path expression in XSLT style sheets, 437–438 lower case. See case sensitivity

M margin-bottom property, in cascading style sheets Raven.xml example, 260–262 setting, 259–262 specifying percentage values, 259–260 specifying size values, 227, 259 margin-left property, in cascading style sheets Raven.xml example, 260–262 setting, 259–262 specifying percentage values, 259–260 specifying size values, 227, 259 margin note, creating, 277–279 margin properties, in cascading style sheets, 257, 259–262 margin-right property, in cascading style sheets Raven.xml example, 260–262 setting, 259–262 specifying percentage values, 259–260 specifying size values, 227, 259 margin-top property, in cascading style sheets example, 202 and inheritance, 203 Raven.xml example, 260–262 setting, 259–262 specifying percentage values, 259–260 specifying size values, 227, 259 markup, defined, 27 markup declarations overview, 96 and parameter entities, 144, 145 and standalone document declarations, 49, 159 types of, 96–97 markup languages, 3. See also Hypertext Markup Language (HTML); SGML (Structured Generalized Markup

Index

Language); XML (Extensible Markup Language) MARQUEE HTML element, 329 match XSL attribute, 415, 433–435, 438 mathematical equations, 6, 18 Mathematical Markup Language (MathML), 18 MathML (Mathematical Markup Language), 18 .mdb files, 8 methods defined, 300 Document nodes getElementsByTagName method, 373, 380–383 nodeFromID method, 373 DSO programming model addNew recordset method, 336 cancelUpdate recordset method, 336 delete recordset method, 336 move method, 324 movenext method, 324, 324 moveprevious method, 324, 324 Element nodes getAttribute method, 381 getAttributeNode method, 381 getElementsByTagName method, 381 HTML TABLE elements firstPage method, 309 lastPage method, 309 nextPage method, 309 previousPage method, 309 NamedNodeMap collection objects getNamedItem NamedNodeMap method, 386, 391, 392 item NamedNodeMap method, 386, 386 list of methods and properties, 386 nextNode NamedNodeMap method, 386 reset NamedNodeMap method, 386 NodeList collection objects item NodeList method, 374 list of methods and properties, 374 nextNode NodeList method, 374 reset NodeList method, 374 substringData Text node method, 376 Microsoft Access, 6, 8, 16 Microsoft JScript, 325. See also script code Microsoft Visual Studio applications, 21

479

minimalist XML documents, 51 move DSO recordset method, 324 movefirst DSO recordset method, 324 movelast DSO recordset method, 324 movenext DSO recordset method, 324 moveprevious DSO recordset method, 324 multiple attribute-list declarations, 109 multiple cascading style sheets, 211 multiple rules, 204 multiple XSL templates, 432–435 Music XML, 18 musical scores, 6, 7

N name tokens, 113–114 NamedNodeMap collection objects getNamedItem NamedNodeMap method, 386, 386, 391–392 item NamedNodeMap method, 386, 386 length NamedNodeMap method, 386, 386 list of methods and properties, 386 nextNode NamedNodeMap method, 386 reset NamedNodeMap method, 386 using, 385–386, 386, 391–392 namespaces and colon in attribute names, 64 referencing, 206 using, 69–77, 117–120, 286–287 xsl: namespace designation, 415 in XSLT, 457–459 naming attributes, 63–64 duplicate names, 142 element types, 8, 53–54 entities, 137 named type definitions in schemas, 173 notation, 141 nested elements, 27, 28, 50–52, 54, 315–322 news and information, exchanging, 18 News Markup Language (NML), 18 nextNode NamedNodeMap method, 386 nextNode NodeList method, 374 nextPage TABLE element method, 309 nextSibling node property, 365, 380 NML (News Markup Language), 18

480

Index

NMTOKEN type attribute, 113 NMTOKENS type attribute, 113–114 nodeFrom ID Document node method, 373 NodeList collection objects item NodeList method, 374 length NodeList property, 374, 376 list of methods and properties, 374 nextNode NodeList method, 374 reset NodeList method, 374 using, 374 vs. NamedNodeMap collection objects, 386 nodeName node property, 360–362, 365, 387 nodes defined, 360 methods, 366 (See also NodeList collection objects) name characteristics, 362 obtaining names, 362 obtaining values, 362 organizing in XML documents, 362–363 as programming objects, 364 properties attributes node property, 365, 382, 385 childNodes node property, 365, 371, 374, 380 dataType node property, 365 firstChild node property, 365, 375, 380 lastChild node property, 365, 380 nextSibling node property, 365, 380 nodeName node property, 365, 360–362, 387 nodeTypeString node property, 365, 397 nodeValue node property, 360-362, 365, 375, 387 ownerDocument node property, 365 parentNode node property, 365, 380 previousSibling node property, 365, 380 text node property, 365, 371, 375–376 xml node property, 365 ShowNodes.htm example, 393–398 types of, 360, 360–361, 362 nodeType node property, 365 nodeTypeString node property, 365, 397 nodeValue node property, 360–361, 365, 387 notation declaring, 141–142

in Inventory Entity.htm example, 389–390 naming, 141 overview, 141 specifying system literal, 141–142 specifying URL, 145 NOTATION keyword, 115 Notation node, 361, 392 notations Document Type property, 392 numbered lists, creating, 219–221

O OFX (Open Financial Exchange), 17 OMF (Weather Observation Markup Format), 18 ONCLICK HTML attribute, 324 Open Financial Exchange (OFX), 17 Open Software Description (OSD), 17 Open XML Court Interface (OXCI), 17 opening XML documents in Internet Explorer, 10, 12, 26, 29–41, 210–211 OSD (Open Software Description), 17 ownerDocument node property, 365 OXCI (Open XML Court Interface), 17

P package tracking, 18 padding-bottom property, in cascading style sheets setting, 268–269 specifying percentage values, 268–269 specifying size values, 268–269 padding-left property, in cascading style sheets setting, 268–269 specifying percentage values, 268–269 specifying size values, 268–269 padding properties, in cascading style sheets, 257, 268–269 padding-right properties, in cascading style sheets setting, 268–269 specifying percentage values, 268–269 specifying size values, 268–269 padding-top properties, in cascading style sheets

Index

setting, 268–269 specifying percentage values, 268–269 specifying size values, 268–269 paging, 309–315 parameter entities as available types of entities, 134 declaring, 143–147 defined, 133–135 external parsed as available type of entity, 134 declaring, 145–147 naming, 145 referencing, 148, 150 specifying system literal, 145 specifying URL, 145 syntax, 145 when to use, 146 internal parsed assigning values, 144 as available type of entity, 134 declaring, 143–145 example, 145 naming, 143–144 referencing, 148, 150 syntax, 143 locations, 151 parent elements, 52. See also child elements parentNode node property, 365, 380 parsed entities defined, 133–135 general, external as available type of entity, 134 declaring, 137–139 example, 138–139 naming, 137 referencing, 148, 149, 152–153 specifying system literal, 137 specifying URL, 137 syntax, 137 general, internal assigning values, 136–137 as available type of entity, 134 character references in, 156 declaring, 135–137 examples, 137, 152–153 naming, 135

481

referencing, 148, 149, 152–153 syntax, 136 parameter, external as available type of entity, 134 declaring, 145–147 example, 146–147 naming, 145 referencing, 148, 149 specifying system literal, 145 specifying URL, 145 syntax, 145 when to use, 146 parameter, internal assigning values, 143 as available type of entity, 134 declaring, 143–145 example, 145 referencing, 148, 149 syntax, 143 vs. unparsed entities, 133–135, 141 parseError property, 366, 399–401 parser, XML, 31–33, 55 Parts.xml file (Listing 3-1) listing, 47–49, 47–49 well-formed document example, 46, 49 path operators and filtering, 438, 440 overview, 419 and sorting, 443, 445–446 patterns, in XSL templates and filters, 440–445 overview, 413 root of document, 412–415 and select attribute, 418, 428 and sorting, 445–446 percent character (%), 136 positioning properties, in cascading style sheets, 250, 258–259, 272–273, 272–276 precedence, in cascading style sheets, 205, 207–209, 211–214 predefined entity references, 156–157, 335 previousPage TABLE element method, 309 previousSibling node property, 365, 380 Printing Industry Markup Language (PrintML), 17

482

Index

PrintML (Printing Industry Markup Language), 17 processing instructions defined, 26–27, 56 form of, 84 overview, 83 Parts.xml example, 47, 49 as type of element content, 56 uses for, 84–85 where to place, 85–86 xml-stylesheet, 209–211 ProcessingInstruction node, 361, 397 processor. See XML processor prologs adding document type declarations, 95–99 adding xml-stylesheet processing instructions, 209–210, 211, 214, 411 Inventory.xml example, 23–25, 26 overview, 25–26, 26 Parts.xml example, 47 properties in cascading style sheets background-color property, 203, 232–234, 235 background-image property, 209, 235–236, 238 background-position property, 203, 235, 241–245 background-repeat property, 203, 235, 239–243 border-color property, 233, 263, 267 border-style property, 219, 263–265 border-width property, 219, 263–266 box properties, 203, 251, 257–259, 268 clear property, 283–284 color property, 219, 232–235 defined, 200–201 display property, 203, 215–218 float property, 276–283 font-family property, 213, 222–224 font-size property, 224 font-style property, 229 font-variant property, 231 font-weight property, 229 height property, 270 illustrated, 200, 202

inherited vs. noninherited, 203–204, 213 letter-spacing property, 246–247 line-height property, 246, 254–255 margin-bottom property, 259–262 margin-left property, 259–262 margin-right property, 259–262 margin-top property, 202, 203, 259–262 overview, 201–202, 202 padding-bottom property, 268–269 padding-left property, 268–269 padding-right property, 268–269 padding-top property, 268–269 positioning properties, 272–276 specifying keyword values, 219 text-align property, 246, 250–252 text-decoration property, 246, 255–256 text-indent property, 246, 253–254 text-transform property, 246, 255 vertical-align property, 203, 246, 248–250 width property, 270–272 defined, 302 DocumentType node entities Document Type property, 291 notations Document Type property, 292 nodes, 360–367 attributes node property, 365, 384, 385 childNodes node property, 365, 371, 374, 380 dataType node property, 365 firstChild node property, 365, 375, 380 lastChild node property, 365, 380 nextSibling node property, 365, 380 nodeName node property, 362, 365, 387 nodeType node property, 365 nodeTypeString node property, 365, 397 nodeValue node property, 362, 365, 375, 387 ownerDocument node property, 365 parentNode node property, 365, 380 previousSibling node property, 365, 380 text node property, 366, 371, 375–376 xml node property, 364 and STYLE attribute, 206–207, 212 psudo-elements, 285

Index

Q quoted strings as attribute values, 65–66 as entity values, 132, 136, inserting quotes using character reference, 159 quote marks as delimiters, 49

R Raven01.css file (Listing 9-3), 277 Raven01.css file (Listing 9-4), 278 Raven02.css file (Listing 9-5), 280–282 Raven04.css file (Listing 9-9), 294–295 Raven.css file (Listing 9-1), 260, 260 Raven01.xml file (Listing 9-4) creating from Raven.xml file, 278 displaying in Internet Explorer, 279, 279 listing, 278–279, 278–279 Raven02.xml file (Listing 9-6) creating from Raven.xml file, 280–283 displaying in Internet Explorer, 283, 283, 284, 284 listing, 280–282 Raven03.xml file (Listing 9-7) displaying in Internet Explorer, 289 listing, 289–290, 289–290 Raven04.xml file (Listing 9-8) displaying in Internet Explorer, 294, 294 listing, 291–293, 291–293 Raven.xml file (Listing 9-2) displaying in Internet Explorer, 261, 262 listing, 260–261 testing for validity, 398 Raven.xml file (Listing 9-4) making changes to, 278 Real Estate Transaction Markup Language (RETML), 17 recordsets DSO overview, 298 and Inventory Single.htm example, 325–326 methods and properties, 321, 321–322, 323, 334, 334 sample script, 350–356

483

storing XML data as, 300–301 using nested table to display, 315–318 using simple table to display, 303–312 recursive function calls, 397 references. See character references; entity references relative positioning property, 272–276 #REQUIRED default declaration form, 116 reset NamedNodeMap method, 386 reset NodeList method, 374 root element, 8, 27. See also document element rules for cascading style sheets, 200–206 applying to multiple elements, 204 defined, 200 illustrated, 200 multiple elements in, 204 precedence among, 205, 208, 211–215 and imported style sheets, 208, 214

S saving documents, 22, 23, 39 script code in Book.htm file, 367–369 and Data Source Objects, 350–356 in DomDemo Variables.htm file, 375–376, 376–377 FindBooks script, 350–354 in Inventory Entity.htm file, 388–391 in Inventory Find.htm file, 382–383 overview, 10 ShowElements script, 380, 381 in ShowNodes.htm file, 393–396 in Validity Test DTD.htm file, 398 SELECT HTML element, 328 select XSL attribute xsl:apply-templates element, 432, 433, 449 xsl:for-each element, 429–430, 433, 450–451 xsl:value-of element, 418–419, 421, 433 selector, in cascading style sheets contextual, 205 defined, 200 illustrated, 200 SGML (Structured Generalized Markup Language)

Index

inheritance, 246 setting, 250–253 specifying CSS keyword values, 249–250 text editors, 21, 22 text-indent property, in cascading style sheets inheritance, 246 setting, 253–254 specifying size values, 253–254 text node property, 360-361, 369, 370–373 Text nodes defined, 360 length property, 374 role in retrieving character data, 375, 375–376 substring Data method, 376 text-transform property, in cascading style sheets inheritance, 251 setting, 255 specifying CSS keyword values, 255 Theological Markup Language (ThML), 18 ThML (Theological Markup Language), 18 TITLE HTML element, 6 tokenized type attributes ENTITIES type attribute, 113, 133–135, 140, 388, 390 ENTITY type attribute, 112, 133–135, 140, 388, 390, 391 ID type attribute, 111 IDREF type attribute, 111–112 IDREFS type attribute, 112 NMTOKEN type attribute, 113 NMTOKENS type attribute, 113–114 specifying, 110–111 Topics.xml external file example, 132, 138–139 treelike structure. See hierarchically structured documents type definitions, in schemas, 168–181 anonymous type, 173 complex type, 175–176 named type, 173 simple type, 168–172 TYPE=CHECKBOOK HTML element, 331

485

U uniform resource identifiers (URIs) in general external parsed entities, 137 in general external unparsed entities, 142 in parameter external parsed entities, 145 specifying, 122 Uniform Resource Locators (URLs) definition, 73 and imported style sheets, 208–209 partial, 209, 210 relative, 209, 210 specifying values, 209 in xml-stylesheet processing instruction, 209–211, 411 Uniform Resource Names (URNs), 73 unparsed entities defined, 133–135 using, 386–391 vs. parsed entities, 133–135, 141 upper case. See case sensitivity URIs. See uniform resource identifiers (URIs) URLs. See Uniform Resource Locators (URLs) URNs. See Uniform Resource Names (URNs)

V valid XML documents advantages of, 93–94 converting well-formed documents, 125–129 criteria for, 92 testing for validity, 398–407 vs. well-formed XML documents, 45, 91–92, 125–129 Validity Test DTD.htm file (Listing 11-10), 398, 400, 400–401 Validity Test Schema.htm (Listing 11-11), 402–404, 405–406 validity testing, 398–407 value-of element. See xsl:value-of element VBScript, 10 Vector Markup Language (VML), 17 version numbers, XML, 25 vertical-align property, in cascading style sheets inheritance, 203, 246

488

Index

Raven03.xml file displaying in Internet Explorer, 289 listing, 289–290, 289–290 Raven04.xml file displaying in Internet Explorer, 291, 291 listing, 292–293, 292–293 Topics.xml external file example, 132, 138–139 XsltDemo.xml file displaying in Internet Explorer, 431 listing, 425–429, 429 and XsltDemo04.xsl example, 446–447, 446–447 and XsltDemo05.xsl example, 448–449 XsltDemo01.xml file displaying in Internet Explorer, 414, 414 XsltDemo06.xml file displaying in Internet Explorer, 455, 455 listing, 451–452, 451–452 XML for Publishers and Printers (XPP), 17 XML HTML element. See data islands XML Linking Language (XLink), 19 xml node property, 366 XML parser, 31–33, 55 XML Path Language (XPath), 410 XML Pointer Language (XPointer), 19 XML processor, defined, 26 XML Schema, 19, 163–192 basics, 165–167 creating an XML schema, 186–191 declaring elements, 167–181 defined, 19 and validity testing, 402–407 xml-stylesheet processing instruction, 209–211, 214, 411 XPointer (XML Pointer Language), 19 XPP (XML for Publishers and Printers), 17 .xsd files Book Schema.xsd file listing, 165, 165 Inventory Schema.xsd listing, 186–187, 186–188 xsl:apply-templates element examples, 446, 448–449 and select attribute, 432 and sort attribute, 445–446

using, 432–435 xsl:for-each element examples, 446–449 and select attribute, 432 and sort attribute, 445–446 using, 424, 430–431 xsl:stylesheet element, 413 processing instructions, 409–410 XSLT (Extensible Stylesheet Language Transformations) cascading vs. XSLT style sheets, 10, 12, 407–408, 409, 410, 418 comparison operators, 440–442, 459 conditional structures, 409, 459 creating style sheets, 411 default Internet Explorer style sheet, 33, 416, 417 defined, 19 linking style sheets to XML documents, 9 linking to XML documents, 409–410 namespaces, 462 overview, 407–408 templates in Internet Explorer, 414–416 version 1.0 recommendation, 410 XSLT style sheets, white space in, 58 XsltDemo06.xml file (Listing 12-9) displaying in Internet Explorer, 454, 455 listing, 453–454 XsltDemo01.xsl file (Listing 12-1), 413, 413, 414, 418 XsltDemo02.xsl file (Listing 12-3), 424. 425, 432, 436 XsltDemo03.xsl file (Listing 12-5), 432 XsltDemo04.xsl file (Listing 12-6),446 XsltDemo05.xsl file (Listing 12-7), 446, 448 XsltDemo06.xsl file (Listing 12-8), 452, 452–453, 459 XsltDemo.xml file (Listing 12-4) displaying in Internet Explorer, 450 listing, 425–429 and XsltDemo04.xsl example, 446, 450 and XsltDemo05.xsl example, 446, 448 xsl:template element, 415, 417, 435, 439 xsl:value-of element, 420–421, 424, 432, 444, 451

Biography Michael J. Young has been writing books for computer users and developers since 1986. His more than two dozen titles include Visual Basic—Game Programming for Windows, the best-selling Running Microsoft Office series, and Microsoft Office XP Inside Out, all from Microsoft Press. (He wrote the Office books together with Michael Halvorson.) His developer books have covered MS-DOS, Windows, C, C++, Visual Basic, Java, XML, animation, game, and graphics programming. He is the author of the popular first edition of XML Step by Step, which won the top award, “Distinguished Technical Communication,” in the 2000-2001 International Technical Publications Competition of the Society for Technical Communication. Books planned for the future will center on the use and programming of Microsoft Office, as well as XML, Java, and other Web publishing topics. Michael graduated from Stanford University with a degree in philosophy. He later studied computer science at several California colleges, completing coursework through the first year of graduate studies. Currently, he lives and works in Taos, New Mexico. You can contact him and find out more about what he does through his Web site at http://www.mjyOnline.com.