Lady’s Gitweb - Shushe/blob - README.markdown

   1 <!--
   2 SPDX-FileCopyrightText: 2024, 2025, 2026 Lady <https://www.ladys.computer/about/#lady>
   3 SPDX-License-Identifier: CC0-1.0
   4 -->
   5 # ⛩📰 书社
   6
   7 <b>A make·file for X·M·L.</b>
   8
   9 <dfn>⛩📰 书社</dfn> aims to make it easy to generate websites with
  10   X·S·L·T and G·N·U Make.
  11 It is consequently only a good choice for people who like X·S·L·T and
  12   G·N·U Make and wish it were easier to make websites with them.
  13
  14 It makes things easier by :⁠—
  15
  16 - Automatically identifying source files and characterizing them by
  17     type (X·M·L, text, or asset).
  18
  19 - Parsing supported text types into X·M·L trees.
  20
  21 - Enabling easy inclusion of source files within each other.
  22
  23 It aims to do this with zero dependencies beyond the programs already
  24   installed on your computer†.
  25
  26 † Assuming an operating system with a fairly featureful, and
  27   Posix‐compliant, development setup (e·g, Macintosh ≥ version 10.8).
  28 In fact, on Linux you will probably need to install a few programs:
  29   `libxml2-utils`, `xsltproc`, `sharutils`, and `pax`.
  30
  31 ## Nomenclature
  32
  33 <i lang="cmn-Hans">书社</i> is a Chinese word meaning “publishing
  34   house”.
  35
  36 The first character, <i lang="cmn-Hans">书</i>, is the simplified form
  37   of “document”.
  38
  39 The second character, <i lang="cmn-Hans">社</i>, contemporarily means
  40   “association”, but historically referred to the god of the soil and
  41   related altars or festivities.
  42 In Japanese, it is an alternate spelling for <i lang="ja">やしろ</i>,
  43   the word for “Shinto shrine”.
  44
  45 The name <i lang="cmn-Hans">书社</i> was chosen to play on this pun, as
  46   it is intended as a publishing program for webshrines.
  47
  48 In Ascii environments, ⛩📰 书社 should be written `Shushe`, following
  49   the pinyin transliteration.
  50
  51 ## Prerequisites
  52
  53 In most cases, ⛩📰 书社 aims to require only functionality which is
  54   present in all Posix‐compliant (`POSIX.1-2001`) operating systems.
  55 There are a few exceptions.
  56 Details on particular programs are given below; if a program is not
  57   listed, it is assumed that any Posix‐compliant implementation will
  58   work.
  59
  60 ### `diff`
  61
  62 This is a Posix utility, but ⛩📰 书社 depends on functionality
  63   introduced after `POSIX.1-2001` (the `-u` option, introduced in
  64   `POSIX.1-2008`).
  65 Macintosh systems somewhat interestingly implement this option
  66   correctly in legacy mode (`COMMAND_MODE=legacy`) but incorrectly by
  67   default (despite claiming `POSIX.1-2008` conformance for this
  68   utility).
  69 [Note this erroneous comment claiming nanosecond & timezone are
  70   extensions rather than standardized.][rdar-92753335]
  71 Despite this, the default Macintosh implementation will still work with
  72   ⛩📰 书社, with the caveat that the timestamp will only include a
  73   fractional component when a Posix‐compliant (e·g, Macintosh legacy or
  74   G·N·U) implementation is used.
  75
  76 ### `file`
  77
  78 This is a Posix utility, but it was considered optional in
  79   `POSIX.1-2001` (altho it was made mandatory in `POSIX.1-2008`) and
  80   ⛩📰 书社 currently depends on unspecified behaviour.
  81 It requires support for the following additional options :⁠—
  82
  83 - **`-C`**, when supplied with `-m`, must be useable to compile a
  84     `.mgc` magicfile for use with future invocations of `file`.
  85
  86 - **`--files-from`** must be useable to provide a file that `file`
  87     should read file·names from, and `-` must be useable in this
  88     context to specify the standard input.
  89
  90 - **`--mime-type`** must cause `file` to print the internet media type
  91     of the file with no charset parameter.
  92
  93 - **`--separator`** must be useable to set the separator that `file`
  94     uses to separate file names from types.
  95
  96 These options are implemented by the [Fine Free File Command][F3C],
  97   which is used by most operating systems.
  98
  99 ### `git`
 100
 101 This is not a Posix utility.
 102 Usage of `git` is optional, but recommended (and activated by default).
 103 To disable it, set `GIT=`.
 104
 105 ### `make`
 106
 107 This is a Posix utility, but it is considered an optional Software
 108   Development utility and ⛩📰 书社 currently depends on unspecified
 109   behaviour.
 110 ⛩📰 书社 requires specifically the G·N·U version of `make`, and
 111   depends on functionality present in version 3.81 or later.
 112 It is not expected to work in previous versions, or with other
 113   implementations of Make.
 114
 115 ### `pax`
 116
 117 This is a Posix utility, but it is not included in the Linux Standard
 118   Base or installed by default in many distributions.
 119 ⛩📰 书社 only requires support for the `ustar` format.
 120
 121 ### `uudecode` and `uuencode`
 122
 123 These are Posix utilities, but they were considered optional in
 124   `POSIX.1-2001` (altho they are made mandatory in `POSIX.1-2008`) and
 125   they are not included in the Linux Standard Base or installed by
 126   default in many distributions.
 127 The G·N·U [Sharutils][] package provides one implementation.
 128
 129 ### `xmlcatalog` and `xmllint`
 130
 131 These are not a Posix utilities.
 132 They are a part of `libxml2`, but may need to be installed separately
 133   on some platforms (e·g by the name `libxml2-utils`).
 134
 135 ### `xsltproc`
 136
 137 This is not a Posix utility.
 138 It is a part of `libxslt`, but may need to be installed separately on
 139   some platforms.
 140
 141 ## Basic Usage
 142
 143 Place source files in `sources/` and run `make install` to compile
 144   the result to `public/`.
 145 Compilation involves the following steps :⁠—
 146
 147 1. ⛩📰 书社 compiles all of the magic files in `magic/` into a single
 148     file, `build/magic.mgc`.
 149
 150 2. ⛩📰 书社 processes all of the parsers in `parsers/` and determines
 151     the list of supported plaintext types.
 152
 153 3. ⛩📰 书社 identifies all of the source files and includes and uses
 154     `build/magic.mgc` to classify them by media type.
 155
 156 4. ⛩📰 书社 parses all plaintext and X·M·L source files and includes
 157     and then builds a dependency tree between them.
 158
 159 5. ⛩📰 书社 uses the dependency tree to establish prerequisites for
 160     each output file.
 161
 162 6. ⛩📰 书社 compiles each output file to `build/result`.
 163
 164 7. ⛩📰 书社 copies most output files from `build/result` to
 165      `build/public`, but it does some additional processing instead on
 166      those which indicate a non‐X·M·L desired final output form.
 167
 168 8. ⛩📰 书社 copies the final resulting files to `public`.
 169
 170 You can use `make list` to list each identified source file or include
 171   alongside its computed type and dependencies.
 172 As this is a Make‐based program, steps will only be run if the
 173   corresponding buildfile or output file is older than its
 174   prerequisites.
 175
 176 ## Name·spaces
 177
 178 The ⛩📰 书社 name·space is `urn:fdc:ladys.computer:20231231:Shu1She4`.
 179
 180 This document uses a few name·space prefixes, with the following
 181   meanings :⁠—
 182
 183 |     Prefix | Expansion                                     |
 184 | ---------: | :-------------------------------------------- |
 185 | `catalog:` | `urn:oasis:names:tc:entity:xmlns:xml:catalog` |
 186 |    `exsl:` | `http://exslt.org/common`                     |
 187 | `exslstr:` | `http://exslt.org/strings`                    |
 188 |    `html:` | `http://www.w3.org/1999/xhtml`                |
 189 |     `rdf:` | `http://www.w3.org/1999/02/22-rdf-syntax-ns#` |
 190 |     `svg:` | `http://www.w3.org/2000/svg`                  |
 191 |   `xlink:` | `http://www.w3.org/1999/xlink`                |
 192 |    `xslt:` | `http://www.w3.org/1999/XSL/Transform`        |
 193 |    `书社:` | `urn:fdc:ladys.computer:20231231:Shu1She4`    |
 194
 195 ## Setup and Configuration
 196
 197 ⛩📰 书社 depends on the following programs to run.
 198 In every case, you may supply your own implementation by overriding the
 199   corresponding (allcaps) variable (e·g, set `MKDIR` to supply your own
 200   `mkdir` implementation).
 201
 202 - `awk`
 203 - `cat`
 204 - `cd`
 205 - `cksum`
 206 - `cp`
 207 - `date`
 208 - `diff`
 209 - `file`
 210 - `find`
 211 - `git` (optional; set `GIT=` to disable)
 212 - `grep`
 213 - `ln`
 214 - `mkdir`
 215 - `mv`
 216 - `od`
 217 - `pax` (only when generating archives)
 218 - `printf`
 219 - `rm`
 220 - `sed`
 221 - `sleep`
 222 - `test`
 223 - `touch`
 224 - `tr`
 225 - `uuencode`
 226 - `uudecode`
 227 - `xargs`
 228 - `xmlcatalog` (provided by `libxml2`)
 229 - `xmllint` (provided by `libxml2`)
 230 - `xsltproc` (provided by `libxslt`)
 231
 232 The following additional variables can be used to control the behaviour
 233   of ⛩📰 书社 :⁠—
 234
 235 - **`SRCDIR`:**
 236   The location of the source files (default: `sources`).
 237   Multiple source directories can be provided, so long as the same
 238     file subpath doesn’t exist in more than one of them.
 239
 240 - **`INCLUDEDIR`:**
 241   The location of source includes (default: `sources/includes`).
 242   This can be inside of `SRCDIR`, but needn’t be.
 243   Multiple include directories can be provided, so long as the same
 244     file subpath doesn’t exist in more than one of them.
 245
 246 - **`DATADIR`:**
 247   If set to the location of a directory, ⛩📰 书社 will run a two‐stage
 248     build.
 249   In the first stage, only files in `SRCDIR` which match
 250     `FINDDATARULES` (see below) will be built, with files in `DATADIR`
 251     serving as includes.
 252   In the second stage, the remaining files in `SRCDIR` will be built,
 253     with the files built during the first stage, in addition to any
 254     files in `INCLUDEDIR`, serving as includes.
 255   Files built during the first stage are copied into `DESTDIR`
 256     alongside those from the second stage when installing.
 257
 258   This functionality is intended for sites where the bulk of the site
 259     can be built from a few data files which are expensive to create.
 260
 261 - **`BUILDDIR`:**
 262   The location of the (temporary) build directory (default: `build`).
 263   `make clean` will delete this, and it is recommended that it not be
 264     used for programs aside from ⛩📰 书社.
 265
 266 - **`DESTDIR`:**
 267   The location of directory to output files to (default: `public`).
 268   `make install` will overwrite files in this directory which
 269     correspond to those in `SRCDIR`.
 270   It _will not_ touch other files, including those generated from files
 271     in `SRCDIR` which have since been deleted.
 272
 273   Files are first compiled to `$(BUILDDIR)/public` before they are
 274     copied to `DESTDIR`, so this folder is relatively quick and
 275     inexpensive to re·create.
 276   It’s reasonable to simply delete it before every `make install` to
 277     ensure stale content is removed, assuming copies are quick on your
 278     file·system.
 279
 280 - **`THISDIR`:**
 281   The location of the ⛩📰 书社 `GNUmakefile`.
 282   This should be set automatically when calling Make and shouldn¦t ever
 283     need to be set manually.
 284   This variable is used to find the ⛩📰 书社 `lib/` folder, which is
 285     expected to be in the same location.
 286
 287 - **`MAGIC`:**
 288   A white·space‐separated list of magic files to use (default:
 289     `$(THISDIR)/magic/*`).
 290
 291 - **`EXTRAMAGIC`:**
 292   The value of this variable is appended to `MAGIC` by default, to
 293     enable additional magic files without overriding the existing
 294     ones (default: empty).
 295
 296 - **`FINDRULES`:**
 297   Rules to use with `find` when searching for source files.
 298   The default ignores files that start with a period or hyphen‐minus,
 299     those which end with a cloparen, and those which contain a hash,
 300     buck, percent, asterisk, colon, semi, eroteme, bracket, backslash,
 301     or pipe.
 302   It is important that these rules not produce any output, as anything
 303     printed to `stdout` will be considered a result of the find.
 304
 305 - **`EXTRAFINDRULES`:**
 306   The value of this variable is appended to `FINDRULES` by default, to
 307     enable additional rules without overriding the existing ones
 308     (default: empty).
 309
 310 - **`FINDINCLUDERULES`:**
 311   Rules to use with `find` when searching for includes (default:
 312     `$(FINDRULES)`).
 313
 314 - **`EXTRAFINDINCLUDERULES`:**
 315   The value of this variable is appended to `FINDINCLUDERULES` by
 316     default, to enable additional rules without overriding the existing
 317     ones (default: empty).
 318
 319 - **`DATAOPTS`:**
 320   Additional options to use when calling Make during the first stage of
 321     a two‐stage build using `DATADIR` (default: empty).
 322
 323   This can be used to override variables which are only applicable
 324     during the second stage.
 325   Note that when supplying this variable on the shell, it will need to
 326     be twice‐quoted.
 327
 328 - **`DATAEXT`:**
 329   A list of file extensions which signify “data” files during a
 330     two‐stage build using `DATADIR`  (default: `rdf`).
 331
 332 - **`FINDDATARULES`:**
 333   Rules to use with `find` when searching for data files.
 334   By default, these rules are derived from `DATAEXT`.
 335
 336 - **`EXTRAFINDDATARULES`:**
 337   The value of this variable is appended to `FINDDATARULES` by
 338     default, to enable additional rules without overriding the existing
 339     ones (default: empty).
 340
 341 - **`FINDFILTERONLY`:**
 342   A semicolon‐separated list of regular expressions, at least one of
 343     which the paths for sources and includes are required to match,
 344     unless empty (default: empty).
 345
 346 - **`FINDFILTEROUT`:**
 347   A semicolon‐separated list of regular expressions, each of which
 348     matches paths that should _not_ be considered sources or includes
 349     (default: empty).
 350
 351 - **`FINDINCLUDEFILTERONLY`:**
 352   A semicolon‐separated list of regular expressions, at least one of
 353     which the paths for includes are required to match, unless empty
 354     (default: empty).
 355
 356   Note that only paths which already match `FINDFILTERONLY` are
 357     considered.
 358
 359 - **`FINDINCLUDEFILTEROUT`:**
 360   A semicolon‐separated list of regular expressions, each of which
 361     matches paths that should _not_ be considered includes, but may
 362     still be considered sources (default: empty).
 363
 364 - **`FINDFILTERONLYEXTENDED`:**
 365   If non·empty, `FINDFILTERONLY` is an extended regular expression;
 366     otherwise, it is basic (default: empty).
 367
 368 - **`FINDFILTEROUTEXTENDED`:**
 369   If non·empty, `FINDFILTEROUT` is an extended regular expression;
 370     otherwise, it is basic (default: matches `FINDFILTERONLYEXTENDED`).
 371
 372 - **`FINDINCLUDEFILTERONLYEXTENDED`:**
 373   If non·empty, `FINDINCLUDEFILTERONLY` is an extended regular
 374     expression; otherwise, it is basic (default: matches
 375     `FINDFILTERONLYEXTENDED`).
 376
 377 - **`FINDINCLUDEFILTEROUTEXTENDED`:**
 378   If non·empty, `FINDINCLUDEFILTEROUT` is an extended regular
 379     expression; otherwise, it is basic (default: `1` if either
 380     `FINDFILTEROUTEXTENDED` or `FINDINCLUDEFILTERONLYEXTENDED` is
 381     non·empty).
 382
 383 - **`PARSERS`:**
 384   A white·space‐separated list of parsers to use (default:
 385     `$(THISDIR)/parsers/*.xslt`).
 386
 387 - **`EXTRAPARSERS`:**
 388   The value of this variable is appended to `PARSERS` by default, to
 389     enable additional parsers without overriding the existing ones
 390     (default: empty).
 391
 392 - **`PARSERLIBS`:**
 393   A white·space‐separated list of parser dependencies (default:
 394     `$(THISDIR)/lib/split.xslt`).
 395
 396 - **`EXTRAPARSERLIBS`:**
 397   The value of this variable is appended to `PARSERLIBS` by default, to
 398     enable additional parser dependencies without overriding the
 399     existing ones (default: empty).
 400
 401 - **`TRANSFORMS`:**
 402   A white·space‐separated list of transforms to use (default:
 403     `$(THISDIR)/transforms/*.xslt`).
 404
 405 - **`EXTRATRANSFORMS`:**
 406   The value of this variable is appended to `TRANSFORMS` by default, to
 407     enable additional transforms without overriding the existing ones
 408     (default: empty).
 409
 410 - **`TRANSFORMLIBS`:**
 411   A white·space‐separated list of transform dependencies (default:
 412     `$(THISDIR)/lib/serialize.xslt`).
 413
 414 - **`EXTRATRANSFORMLIBS`:**
 415   The value of this variable is appended to `TRANSFORMLIBS` by default,
 416     to enable additional transform dependencies without overriding the
 417     existing ones (default: empty).
 418
 419 - **`XMLTYPES`:**
 420   A white·space‐separated list of media types or media type suffixes to
 421     consider X·M·L (default: `application/xml text/xml +xml`).
 422
 423 - **`FINALIZE`:**
 424   A program to run on (unspecial) X·M·L files after they are
 425     transformed (default: `xmllint --nonet --nsclean`).
 426   This variable can be used for postprocessing.
 427
 428 - **`THISREV`:**
 429   The current version of ⛩📰 书社 (default: derived from the current
 430     git tag/branch/commit).
 431
 432 - **`SRCREV`:**
 433   The current version of the source files (default: derived from the
 434     current git tag/branch/commit).
 435
 436 - **`QUIET`:**
 437   If this variable has a value, informative messages will not be
 438     printed (default: empty).
 439   Informative messages print to stderr, not stdout, so disabling them
 440     usually shouldn’t be necessary.
 441   This does not (cannot) disable messages from Make itself, for which
 442     the `-s`, `--silent` ∕ `--quiet` Make option is more likely to be
 443     useful.
 444
 445 - **`VERBOSE`:**
 446   If this variable has a value, every recipe instruction will be
 447     printed when it runs (default: empty).
 448   This is helpful for debugging, but typically too noisy for general
 449     usage.
 450
 451 ## Source Files
 452
 453 Source files may be placed in `SRCDIR` in any manner; the file
 454   structure used there will match the output.
 455 The type of source files is _not_ determined by file extension, but
 456   rather by magic number; this means that files **must** begin with
 457   something recognizable.
 458 Supported magic numbers include :⁠—
 459
 460 - `<?xml` for `application/xml` files
 461 - `#!js` for `text/javascript` files
 462 - `@charset "` for `text/css` files
 463 - `#!tsv` for `text/tab-separated-values` files
 464 - `%%` for `text/record-jar` files (unregistered; see
 465     [[draft-phillips-record-jar-01][]])
 466
 467 Text formats with associated X·S·L·T parsers are wrapped in a H·T·M·L
 468   `<script>` element whose `@type` gives its media type, and then
 469   passed to the parser to process.
 470 Source files whose media type does not have an associated X·S·L·T
 471   parser are considered “assets” and will not be transformed.
 472
 473 **☡ For compatibility with this program, source file·names must not
 474   contain Ascii white·space, colons (`:`), semis (`;`), pipes (`|`),
 475   bucks (`$`), percents (`%`), hashes (`#`), asterisks (`*`), brackets
 476   (`[` or `]`), erotemes (`?`), backslashes (`\`), or control
 477   characters, must not begin with a hyphen‐minus (`-`), and must not end
 478   with a cloparen (`)`).**
 479 The former characters have the potential to conflict with make syntax,
 480   a leading hyphen‐minus is confusable for a commandline argument, and a
 481   trailing cloparen [activates a bug in G·N·U Make
 482   3.81][so-17148468-comment].
 483
 484 ## Parsers
 485
 486 Parsers are used to convert plaintext files into X·M·L trees, as well
 487   as convert plaintext formats which are already included inline in
 488   existing source X·M·L documents.
 489 ⛩📰 书社 comes with some parsers; namely :⁠—
 490
 491 - **`parsers/plain.xslt`:**
 492   Wraps `text/plain` contents in a `<html:pre>` element.
 493
 494 - **`parsers/psj.xslt`:**
 495   Converts `text/pipe-separated-jar` contents into an `<html:table>`
 496     element.
 497
 498 - **`parsers/record-jar.xslt`:**
 499   Converts `text/record-jar` contents into a `<html:div>` of
 500     `<html:dl>` elements (one for each record).
 501
 502 - **`parsers/tsv.xslt`:**
 503   Converts `text/tab-separated-values` contents into an `<html:table>`
 504     element.
 505
 506 New ⛩📰 书社 parsers which target plaintext formats should have an
 507   `<xslt:template>` element with no `@name` or `@mode` and whose
 508   `@match` attribute…
 509
 510 - Starts with an appropriately‐name·spaced qualified name for a
 511     `<html:script>` element.
 512
 513 - Follows this with the string `[@type=`.
 514
 515 - Follows this with a quoted string giving a media type supported by
 516     the parser.
 517   Media type parameters are _not_ supported.
 518
 519 - Follows this with the string `]`.
 520
 521 For example, the trivial `text/plain` parser is defined as follows :⁠—
 522
 523 ```xml
 524 <?xml version="1.0"?>
 525 <transform
 526   xmlns="http://www.w3.org/1999/XSL/Transform"
 527   xmlns:html="http://www.w3.org/1999/xhtml"
 528   xmlns:书社="urn:fdc:ladys.computer:20231231:Shu1She4"
 529   version="1.0"
 530 >
 531   <书社:id>example:text/plain</书社:id>
 532   <template match="html:script[@type='text/plain']">
 533     <html:pre><value-of select="."/></html:pre>
 534   </template>
 535 </transform>
 536 ```
 537
 538 ⛩📰 书社 will scan the provided parsers for this pattern to determine
 539   the set of allowed plaintext file types.
 540 Multiple such `<xslt:template>` elements may be provided in a single
 541   parser, for example if the parser supports multiple media types.
 542 Alternatively, you can set the `@书社:supported-media-types` attribute
 543   on the root element of the parser to override media type support
 544   detection.
 545
 546 Even when `@书社:supported-media-types` is set, it is a requirement
 547   that each parser transform any `<html:script>` elements with a
 548   `@type` which matches their registered types into something else.
 549 Otherwise the parser will be stuck in an endless loop.
 550 The result tree of applying the transform to the `<html:script>`
 551   element will be reparsed (in case any new `<html:script>` elements
 552   were added in its subtree), and a `@书社:parsed-by` attribute will be
 553   added to each toplevel element in the result.
 554 The value of this attribute will be the value of the `<书社:id>`
 555   toplevel element in the parser.
 556
 557 Parsers **should** have an `<书社:id>` and, if present, it **must** be
 558   unique.
 559
 560 It is possible for parsers to support zero plaintext types.
 561 This is useful when targeting specific dialects of X·M·L; parsers in
 562   this sense operate on the same basic principles as transforms
 563   (described below).
 564 The major distinction between X·M·L parsers and transforms is where in
 565   the process the transformation happens:
 566 Parsers are applied _prior_ to embedding (and can be used to generate
 567   embeds); transforms are applied _after_.
 568
 569 It is **strongly recommended** that auxiliary templates in parsers be
 570   name·spaced (by `@name` or `@mode`) whenever possible, to avoid
 571   conflicts between parsers.
 572
 573 ### Attributes added during parsing
 574
 575 ⛩📰 书社 will add a few attributes to elements which result from
 576   parsing plaintext `<html:script>` elements.
 577 These include :⁠—
 578
 579 - A `@书社:parsed-by` attribute, giving a space‐separated list of
 580     parsers which parsed the node.
 581   (Generally, this will be a list of one, but it is possible for the
 582     result of a parse to be another plaintext node, which may be parsed
 583     by a different parser.)
 584
 585 - A `@书社:media-type` attribute, giving the identified media type of
 586     the plaintext node.
 587
 588 ### Parsed metadata
 589
 590 It is possible to extract metadata from a document at the same time as
 591   it is being parsed.
 592 This is done by creating result elements in the `书社:about` mode;
 593   these should be R·D·F property elements which apply to the conceptual
 594   entity that is the document being parsed.
 595
 596 During transformation, metadata for the file with identifier `$FILE`
 597   can be read from the children of
 598   `$书社:about//*[@rdf:about=$FILE]/nie:interpretedAs/*`.
 599
 600 ## Output Redirection
 601
 602 By default, ⛩📰 书社 installs files to the same location in `DESTDIR`
 603   as they were placed in their `SRCDIR`.
 604 This behaviour can be customized by setting the `@书社:destination`
 605   attribute on the root element, whose value can give a different path.
 606 This attribute is read after parsing, but before transformation (where
 607   it is silently dropped).
 608
 609 Multiple destinations can be provided if the same file should be output to multiple places.
 610 The file is retransformed each time, with the value of the `DESTINATION` global param set appropriately.
 611
 612 ## Embedding
 613
 614 Documents can be embedded in other documents using a `<书社:link>`
 615   element with `@xlink:show="embed"` and an `@xlink:actuate` which is
 616   absent or `"none"`.
 617 The `@xlink:href`s of these elements should have the format
 618   `about:shushe?source=<path>`, where `<path>` provides the path to the
 619   file within `SRCDIR`.
 620 Includes, which do not generate outputs of their own but may still be
 621   freely embedded, instead use the format
 622   `about:shushe?include=<path>`, where `<path>` provides the path
 623   within `INCLUDEDIR`.
 624 If `<path>` indicates a directory and ends with a slash (`/`),
 625   everything within that directory will be embedded.
 626
 627 Embeds are replaced with the parsed contents of a file, unless the file
 628   is an asset, in which case an `<html:object>` element is produced
 629   instead (with the contents of the asset file provided as a base64
 630   `data:` u·r·i).
 631 Embed replacements will be given a `@书社:identifier` attribute whose
 632   value will match the `@xlink:href` of the embed.
 633
 634 Embedding takes place after parsing but before transformation, so
 635   parsers are able to generate their own embeds.
 636 ⛩📰 书社 is able to detect the transitive embed dependencies of files
 637   and update them accordingly; it will signal an error if the
 638   dependencies are recursive.
 639
 640 ### Attributes added during expansion
 641
 642 ⛩📰 书社 will add a few attributes to toplevel result elements, both
 643   in the main document and any embedded documents, during the expansion
 644   phase prior to the main transformation.
 645 These include :⁠—
 646
 647 - A `@书社:cksum` attribute giving the `cksum` checksum of the
 648     corresponding source file.
 649
 650 - A `@书社:mtime` attribute giving the last modified time of the
 651     corresponding source file.
 652
 653 - A `@书社:identifier` attribute giving the ⛩📰 书社 identifier
 654     (i·e, starting with `about:shushe?`) of the corresponding source
 655     file.
 656
 657 - For elements in the `html` namespace, an `itemscope` attribute and an
 658     `itemtype` attribute with a value of
 659     `urn:fdc:ladys.computer:20231231:Shu1She4:document` (for the main
 660     document) or `urn:fdc:ladys.computer:20231231:Shu1She4:embed` (for
 661     embedded documents).
 662   These attributes are used to scope any nested `<html:meta>` elements
 663     with `@itemprop` attributes to their containing documents.
 664
 665 ## Soft Dependencies
 666
 667 When a file depends only on the metadata of another file, and not its
 668   contents, it can be added as a soft dependency rather than an embed.
 669 Soft dependencies are indicated using a `<书社:link>` element with an
 670   `@xlink:show` of `"other"`, `"none"`, or absent, and an
 671   `@xlink:actuate` which is absent or `"none"`.
 672 A change to a soft dependency requires a file to be rebuilt, but no
 673   embedding occurs automatically.
 674 Because there is no automatic embedding, soft dependencies are allowed
 675   to be recursive.
 676
 677 The `@xlink:href`s of soft dependency `<书社:link>`s are processed in
 678   exactly the same fashion as embeds, described above.
 679
 680 If the value of `@xlink:show` is `"other"`, the soft dependency is
 681   transitive.
 682 Any dependencies of the indicated file which have a `@name` which
 683   matches that of the referencing `<书社:link>` element will also be
 684   treated as soft dependencies.
 685 If no `@name` is given, it is treated as the empty string.
 686
 687 When a document is embedded directly, all of its soft dependencies are
 688   also treated as soft dependencies of the embedding object.
 689 However, a document is embedded in a transitive soft dependency, the
 690   embed is treated exactly as tho it were itself a transitive soft
 691   dependency.
 692 That means it must have a matching `@name` to be included, and
 693   like·wise for any embeds or soft dependencies it contains.
 694
 695 If the value of `@xlink:show` is `"none"` or absent, the soft
 696   dependency is not transitive and its own dependencies are not
 697   checked.
 698
 699 ## Transforms
 700
 701 Transforms are used to convert X·M·L files into their final output,
 702   after all necessary parsing and embedding has taken place.
 703 ⛩📰 书社 comes with some transforms; namely :⁠—
 704
 705 - **`transforms/asset.xslt`:**
 706   Converts `<html:object>` elements which correspond to recognized
 707     media types into the appropriate H·T·M·L elements, and deletes
 708     `<html:style>` elements from the body of the document while
 709     transferring them to the head.
 710   This conversion happens during the finalization phase, after the main
 711     transformation.
 712
 713 - **`transforms/expansion.xslt`:**
 714   Performs embedding, as described above.
 715
 716 - **`transforms/metadata.xslt`:**
 717   Provides basic `<html:head>` metadata.
 718   This metadata is generated from `<html:meta>` elements with one of
 719     the following `@itemprop` attributes :⁠—
 720
 721   - **`urn:fdc:ladys.computer:20231231:Shu1She4:title`:**
 722     Provides the title of the page.
 723
 724   ⛩📰 书社 automatically encapsulates H·T·M·L embeds so that their
 725     metadata does not propagate up to the embedding document.
 726   To undo this behaviour, remove the `@itemscope` and `@itemtype`
 727     attributes from the embed during the transformation phase.
 728
 729 - **`transforms/serialization.xslt`:**
 730   Replaces `<书社:serialize-xml>` elements with the (escaped)
 731     serialized X·M·L of their contents.
 732   This replacement happens during the finalization phase, after most
 733     other transformations have taken place.
 734
 735   If a `@with-namespaces` attribute is provided, any name·space nodes
 736     on the toplevel serialized elements whose U·R·I’s correspond to the
 737     definitions of the provided prefixes, as defined for the
 738     `<书社:serialize-xml>` element, will be declared using name·space
 739     attributes on the serialized elements.
 740   Otherwise, only name·space nodes which _differ_ from the definitions
 741     on the `<书社:serialize-xml>` element will be declared.
 742   The string `#default` may be used to represent the default
 743     name·space.
 744   Multiple prefixes may be provided, separated by white·space.
 745
 746   When it comes to name·spaces used internally by ⛩📰 书社, the
 747     prefix used by ⛩📰 书社 may be declared _in addition to_ the
 748     prefix(es) used in the source document(s).
 749   It is not possible to selectively only declare one prefix for a
 750     name·space to the exclusion of others.
 751
 752   `<书社:raw-output>` elements may be used inside of
 753     `<书社:serialize-xml>` elements to inject raw output into the
 754     serialized X·M·L.
 755
 756 The following are recommendations on effective creation of
 757   transforms :⁠—
 758
 759 - Make template matchers as specific as possible.
 760   It is likely an error if two transforms have templates which match
 761     the same element (unless the templates have different priority).
 762
 763 - Name·space templates (with `@name` or `@mode`) whenever possible.
 764
 765 - Set `@exclude-result-prefixes` on the root `xslt:transform` element
 766     to reduce the number of declared name·spaces in the final result.
 767
 768 ## Global Params
 769
 770 The following params are made available globally in parsers and
 771   transforms :⁠—
 772
 773 - **`BUILDTIME`:**
 774   The current time.
 775
 776 - **`IDENTIFIER`:**
 777   The ⛩📰 书社 identifier of the source file (a u·r·i beginning with
 778     `about:shushe`).
 779
 780 - **`SRCREV`:**
 781   The value of the `SRCREV` variable (if present).
 782
 783 - **`THISREV`:**
 784   The value of the `THISREV` variable (if present).
 785
 786 In transforms, the following params are additionally available :⁠—
 787
 788 - **`DESTINATION`:**
 789   The destination being targeted by this transform.
 790
 791 - **`书社:about`:**
 792   R·D·F metadata about all of the documents ⛩📰 书社 knows about.
 793   Use `$书社:about//*[@rdf:about=$IDENTIFIER]` to get the metadata for
 794     the current document.
 795
 796 - **`书社:source`:**
 797   The parsed source document being transformed, prior to any expansion.
 798
 799 - **`书社:expansion`:**
 800   The document after the all embeds have been expanded.
 801   Unavailable during the `书社:expand` stage.
 802
 803 - **`书社:result`:**
 804   The document after the main set of transformations have been applied.
 805   Only available during the `书社:finalize` stage, where it is used to
 806     apply output wrapping and other clean·up.
 807
 808 ## Output Wrapping
 809
 810 Provided at least one toplevel result element belongs to the H·T·M·L
 811   namespace, ⛩📰 书社 will wrap the final output of the transforms in
 812   appropriate `<html:html>` and `<html:body>` elements, so it is not
 813   necessary for transforms to do this explicitly.
 814 If a toplevel result element _is_ a `<html:html>` and `<html:body>`
 815   element, it will be merged with the one that ⛩📰 书社 creates.
 816 Consequently, wrapping the result in a `<html:body>` element can be
 817   used to enable wrapping for non‐H·T·M·L content, when desired.
 818
 819 As a part of this process, after performing the initial transform
 820   ⛩📰 书社 will match in the following modes to fill in areas of the
 821   wrapper :⁠—
 822
 823 - **`书社:header`:**
 824   The result of matching in this mode is prepended into the
 825     `<html:body>` of the output (before the transformation result).
 826
 827 - **`书社:footer`:**
 828   The result of matching in this mode is appended into the
 829     `<html:body>` of the output (after the transformation result).
 830
 831 - **`书社:metadata`:**
 832   The result of matching in this mode is inserted into the
 833     `<html:head>` of the output.
 834
 835 The document being matched will contain the full transform result
 836   prior to wrapping as well as an `<书社:id>` element for each
 837   transform.
 838 The latter elements can be matched to enable transforms to provide
 839   content _without_ matching any elements in the result; for example,
 840   the following transform adds a link to a stylesheet to the
 841   `<html:head>` of every page :⁠—
 842
 843 ```xml
 844 <?xml version="1.0"?>
 845 <transform
 846   xmlns="http://www.w3.org/1999/XSL/Transform"
 847   xmlns:html="http://www.w3.org/1999/xhtml"
 848   xmlns:xslt="http://www.w3.org/1999/XSL/Transform"
 849   xmlns:书社="urn:fdc:ladys.computer:20231231:Shu1She4"
 850   exclude-result-prefixes="书社"
 851   version="1.0"
 852 >
 853   <书社:id>example:add-stylesheet-links.xslt</书社:id>
 854   <template match="书社:id[string(.)='example:add-stylesheet-links.xslt']" mode="书社:metadata">
 855     <html:link rel="stylesheet" type="text/css" href="/style.css"/>
 856   </template>
 857 </transform>
 858 ```
 859
 860 Output wrapping can be entirely disabled by adding a
 861   `@书社:disable-output-wrapping` attribute to the top‐level element in
 862   the result tree.
 863 It will not be performed on outputs whose root elements are
 864   `<书社:archive>`, `<书社:base64-binary>`, or `<书社:raw-text>`
 865   (described below), or on result trees which do not contain a toplevel
 866   element in the H·T·M·L namespace.
 867
 868 ## Applying Attributes
 869
 870 The `<书社:apply-attributes>` element will apply any attributes on the
 871   element to the element(s) it wraps.
 872 It is especially useful in combination with embeds.
 873
 874 The `<书社:apply-attributes-to-root>` element will apply any attributes
 875   on the element to the root node of the final transformation result.
 876 It is especially useful in combination with output wrapping.
 877
 878 In both cases, attributes from various sources are combined with
 879   white·space between them.
 880 Attribute application takes place after each stage of the
 881   transformation, including after the initial embedding phase.
 882
 883 Both elements ignore attributes in the `xml:` name·space, except for
 884   `@xml:lang`, which ignores all but the first definition (including
 885   any already present on the root element).
 886 On H·T·M·L and S·V·G elements, `@lang` has the same behaviour as
 887   `@xml:lang`.
 888
 889 ## Other Kinds of Output
 890
 891 There are a few special elements in the `书社:` name·space which, if
 892   they appear as the toplevel element in a transformation result, cause
 893   ⛩📰 书社 to produce something other than an X·M·L file.
 894 They are :⁠—
 895
 896 - **`<书社:archive>`:**
 897   Each child element with a `@书社:archived-as` attribute will be
 898     archived as a separate file in a resulting tarball (this attribute
 899     gives the file name).
 900   These elements will be processed the same as the root elements of any
 901     other file (e·g, they will be wrapped; they can themselves specify
 902     non X·M·L output types, ⁊·c).
 903   Other child elements will be ignored.
 904
 905   If the `<书社:archive>` element is given an `@书社:expanded`
 906     attribute, rather than producing a tarball ⛩📰 书社 will output
 907     the directory which expanding the tarball would produce.
 908   This mechanism can be used to generate multiple files from a single
 909     source, provided all of the files are contained with·in the same
 910     directory.
 911
 912 - **`<书社:base64-binary>`:**
 913   The text nodes in the transformation result will, after removing all
 914     Ascii whitespace, be treated as a Base·64 string, which is then
 915     decoded.
 916
 917 - **`<书社:raw-text>`:**
 918   A plaintext (U·T·F‐8) file will be produced from the text nodes in
 919     the transformation result.
 920
 921 ## License
 922
 923 This repository conforms to [REUSE][].
 924
 925 Most source files are licensed under the terms of the <cite>Mozilla
 926   Public License, version 2.0</cite>.
 927
 928 [F3C]: <https://darwinsys.com/file/>
 929 [REUSE]: <https://reuse.software/spec/>
 930 [Sharutils]: <https://www.gnu.org/software/sharutils/>
 931 [draft-phillips-record-jar-01]: <https://datatracker.ietf.org/doc/html/draft-phillips-record-jar-01>
 932 [rdar-92753335]: <https://github.com/apple-oss-distributions/patch_cmds/blob/5084833f90df1b0e0924ea56f94c0199b3b8bbc6/diff/diffreg.c#L1800-L1808>
 933 [so-17148468-comment]: <https://stackoverflow.com/questions/17148468/capturing-filenames-including-parentheses-with-gnu-makes-wildcard-function#comment24825307_17148894>