The TEI is dead

Long live the TEI

C. M. Sperberg-McQueen

Pisa, 16 November 2001

TOC | First


Overview

previous table of contents next
1 of 23

Some things we did right

previous table of contents next
2 of 23

What to do better now

previous table of contents next
3 of 23

Political / organizational

previous table of contents next
4 of 23
(Why are you listening to me?)

Hopes

previous table of contents next
5 of 23
  • retain globalist / universalist aims
  • retain and expand international base
  • wider adoption and awareness

Technical program

previous table of contents next
6 of 23
Always and foremost: maintaining and improving the tag sets.
Some overarching challenges:
  • extensibility
  • modularity
  • integration with practice

Extensibility

previous table of contents next
7 of 23
  • The key: partial understanding.
  • The closed-world hypothesis: just say No.
  • Knowing how to extend in advance.
(How will we know what we think until we see what we do?)

Extensibility

previous table of contents next
8 of 23
Consider a precedent:
  • HTML's extensibility rule.
  • Two kinds of extension
    • ignore the tags
    • ignore the elements
    but only one kind of support.
  • This cup is half full.
Bottom line: err boldly.

Modularity

previous table of contents next
9 of 23
TEI pioneered modularity in DTDs: Pizza model.
We can do even better:
  • make the pizza model generically usable (Chinese menu!)
  • integrate better the work done elsewhere
    • MathML, SVG, etc.
    • CALS or HTML tables
    • Docbook
  • easier extension of TEI

Modularity how?

previous table of contents next
10 of 23
Consequences:
  • cleaner definition of extension
  • easier for developers to recognize and support standard TEI
  • easier to support single tag sets

Namespaces: half empty

previous table of contents next
11 of 23
The W3C Namespaces Recommendation does a lot less than meets the eye.
  • no magic (or non-magic) information about meaning
  • no universally unique names
  • no information about markup vocabularies or document types
This glass is mostly empty.

Namespaces: half full

previous table of contents next
12 of 23
The W3C Namespaces Recommendation does just one thing:
  • It defines one* way to tell your stuff from everybody else's stuff.
Little things can go a long way.

Namespaces: some issues

previous table of contents next
13 of 23
Namespace names are uniform resource identifiers (URIs). E.g. http://www.tei-c.org/ns/teip4.
What goes at that URL?
Namespace mechanism supports direct reuse.

Namespaces: some issues

previous table of contents next
14 of 23
Namespace names are uniform resource identifiers (URIs). E.g. http://www.tei-c.org/ns/teip4.
What goes at that URL?
Namespace mechanism supports direct reuse.
It does not support naturalization of predefined semantics.
Revive architectural forms?

Integration of theory and practice I

previous table of contents next
15 of 23
Integrate and cohabit with work done elsewhere:
  • Unicode
  • XPath
  • XML Schema, Relax NG, ...

Unicode

previous table of contents next
16 of 23

XPath and EPN

previous table of contents next
17 of 23
The TEI Extended Pointer Notation
  • implemented, but not really widely
  • numerous newer specs, interrelated:
    • XPointer
    • XSLT
    • XPath
    • various subsets of XPath (XSLT match, XML Schema, XForms, ...)
  • What is to be done?

XPointer -- issues

previous table of contents next
18 of 23
  • simplicity
  • expressive power
  • interoperability
  • semantic consistency (intertranslatability)
  • syntactic consistency (interoperability)
Wait and see?
Alternate EPN and XPointer tag sets?

Schema languages

previous table of contents next
19 of 23
There are lots and lots of XML schema languages:
  • Sox, Biztalk (XML Data Reduced), DDML, DSD, ...
  • Relax, Trex, Relax NG
  • XML Schema (W3C)

Other markup languages

previous table of contents next
20 of 23
Integrate and cohabit with work done elsewhere:
  • MathML
  • parts of XHTML
  • SVG
  • SMIL?
  • CALS/SGML Open/Oasis tables, HTML tables, ...

Theory and practice II

previous table of contents next
21 of 23
The community needs continuing work on `markup theory':
  • Thesis: markup represents an interpretation of the text
  • perfect markup is thus markup which perfectly captures our theory of the text;
  • a perfect markup language captures our theory of what is possible in texts.

Long live the TEI

previous table of contents next
22 of 23
The value of the TEI lies not any particular organizational form,
nor even in a particular SGML or XML DTD,
but in its role as an expression of our will to ensure ...

Long live the TEI

previous table of contents next
23 of 23
  • that the resources we create are reusable, because not tied exclusively to a particular application
  • that we work cooperatively to produce shared semantics for as broad and deep a universe of texts as possible
  • ... for behold, if they speak a single language, nothing they wish to do will be impossible for them.