Why TEI Stand-off Markup Authoring Needs Simplification
2016, Journal of the Text Encoding Initiative
https://doi.org/10.4000/JTEI.1838Abstract
It is now commonly acknowledged that a text can be encoded according to multiple and competing modeling principles, despite the longstanding inuence of the OHCO model (Ordered Hierarchy of Content Objects: see DeRose et al. 1990), which identies nesting units of writing (such as chapters, sections, and paragraphs) as the representation of "what text really is." The TEI Guidelines generally follow this principle (with some important recent changes in stance), but provide mechanisms for encoding other textual features as secondary hierarchies, such as syntactic or rhetorical structures, or structures pertaining to physical text-bearing objects such as pages, openings (e.g., a pair of facing pages in a book), or other surfaces. 2 Secondary hierarchies can be encoded in TEI in a few ways, covered in Chapter 20 of the Guidelines "Non-hierarchical Structures." A commonplace solution is to use empty elements through the text to indicate the beginning of textual units ("milestones" in TEI jargon, such as <pb> for page breaks). When milestones are not sucient, some inline elements can use pointing mechanisms to connect the starting and ending points of a textual feature. The elements <addSpan> and <delSpan>, for
References (8)
- Piez, Wendell. 2015. "TEI in LMNL: Implications for Modeling." Journal of the Text Encoding Initiative 8. http:// journals.openedition.org/jtei/1337; doi:10.4000/jtei.1337.
- Robinson, Peter. 2013. "Towards a Theory of Digital Editions." Variants 10:105-31.
- Sahle, Patrick. 2013. Digitale Editionsformen. Zum Umgang mit der Überlieferung unter den Bedingungen des Medienwandels. Teil 3: Textbegrie und Recodierung. Schriften des Instituts für Dokumentologie und Editorik, 9. Norderstedt: Books on Demand. https://kups.ub.uni-koeln.de/5353/1/DigEditionen_3.pdf.
- Schmidt, Desmond, and Robert Colomb. 2009. "A Data Structure for Representing Multi-version Texts Online." International Journal of Human-Computer Studies 67(6): 497-514. doi:10.1016/j.ijhcs.2009.02.001.
- TEI Consortium. 2019. TEI P5: Guidelines for Electronic Text Encoding and Interchange. Version 3.5.0. Last updated January 29. N.p.: TEI Consortium. http://www.tei-c.org/Vault/P5/3.5.0/doc/tei-p5-doc/en/html/.
- Viglianti, Raaele. 2016. "Music and Words: Reconciling Libretto and Score Editions in the Digital Medium." In Ei, dem alten Herrn zoll' ich Achtung gern': Festschrift für Joachim Veit zum 60. Geburtstag, edited by Kristina Richts and Peter Stadler, 727-46. München: Allitera Verlag. doi:10.25366/2018.46.
- Weber, Carl Maria von. 2007. Der Freischütz: Romantische Oper in drei Aufzügen: Kritische Textbuch-Edition. Text by Friedrich Kind; edited by Solveig Schreiter in collaboration with the Carl-Maria-von-Weber- Gesamtausgabe. Munich: Allitera Verlag.
- 1 There are non-XML solutions to deal with multiple encoding hierarchies, such as LMNL (Layered Markup and Annotation Language), a data model that can be expressed with a syntax that, unlike XML, allows overlapping elements. Seemingly, stand-o would not be necessary in a TEI document expressed using LMNL: "to be unburdened of the chore of assigning and maintaining pointers or references … comes as a considerable relief: even in a plain text editor, one can simply tag, focusing on the tasks of tagging" (Piez 2015). The main obstacle to adopting this kind of approach is ultimately the lack of software support, as XML is supported by a large number of tools and languages. The relief of "simply tagging" oered by LMNL is also likely proportional to the number of hierarchies one wants or needs to represent: the proliferation of overlapping elements may burden the readability of the text enough to prompt encoders to move one or more hierarchies "away" from the text, in some stand-o form.