Rose Tools: A Medieval Manuscript Text-Image Annotation Project

Christine McWebb

Outline

Title

Abstract

Introduction

Objectives and Anticipated Outcome

Background and Conceptualization

Conclusion

Rose Tools: A Medieval Manuscript Text-Image Annotation Project

Christine McWebb

2011, Digitizing Medieval and Early Modern Material Culture, eds. Brent Nelson, Melissa Terras, ACMRS

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

With the now widespread availability of digitized data, researchers in the humanities are learning to go about their work in new ways-and requiring new research tools to do so.

Alessandro Zammataro

2022

This essay is focused on the development of new digital paleographical methods to support the study of medieval and modern manuscripts. The potential of image manipulation software such as Adobe Photoshop has not yet been entirely explored, especially if we consider the constant improvements and new features that are offered by continual software updates. Compared to traditional (human eye on the page) manuscript reading, digital imaging processing offers many tools for the analysis of script from several different paleographical perspectives: for instance, close reading of the external shape of words or single letters as well as overlaying letter comparison to define commonalities and differences via accurate pixel computation. In addition, features of this software can be used to enhance readings and build knowledge by restoring effaced, discolored, or corrupted parchment without affecting the original document. The poetic corpus written by Italian poet Francesco Petrarca (Petrarch) during the thirteenth century, entitled Rerum vulgarium fragmenta, also known as the Canzoniere, serves as a case study here. This corpus is fascinating in part because it is extremely rare in medieval literature to be able to trace the process of author revision from first draft (MS Vat. Lat. 3196) through definitive final copy (MS Vat. Lat. 3195). Petrarch is in fact the only medieval poet for whom we have the original autograph manuscript. In composing the Canzoniere, Petrarch transcribed poems previously written in Vat. Lat. 3196, also known as the Codice degli abbozzi, to Vat. Lat. 3195. Since an autograph always provides invaluable insight into a text's genesis and creation, these two codices remain at the center of a long hermeneutical and philological debate. 1 My thanks to Dr. Amyrose McCue Gill of TextFormations for her assistance in editing the text and revising translations.

downloadDownload free PDF View PDFchevron_right

Text Technology for the Digital Humanities

Martina Trognitz

CLARIN

This chapter presents the Austrian experience of building CLARINrelated infrastructures and services and describes its impact on the wider humanities research community. We will focus on the activities of the Austrian Centre for Digital Humanities and Cultural Heritage at the Austrian Academy of Sciences (ACDH-CH), a centre of expertise which now supports projects in a broad range of humanities disciplines. Part of ACDH-CH's services are concerned with research data preservation in the long-term repository ARCHE, which will be elaborated on here, as will a set of text-technological and semantic services. Furthermore, the crucial role of knowledge sharing measures for the increased adoption of DH methods is described and Austrian contributions and cooperation in the context of building European research infrastructures for the humanities are highlighted.

downloadDownload free PDF View PDFchevron_right

Tools for Digital Humanities: Parallel Corpus and Visualization

Olga Scrivner

The merging of corpus linguistic methods and digital technology can provide new ways of representing medieval digital texts. In this paper, we introduce a multi-layered parallel Old Occitan-English corpus. We show how parallel alignment can help overcome some challenges associated with historical manuscripts. Furthermore, we apply a resource-light method of building an emotion annotation via parallel alignment. Finally, using visualization tools such as ANNIS and GoogleViz, we demonstrate how our parallel corpus can be queried and visualized dynamically via modern language.

downloadDownload free PDF View PDFchevron_right

Medieval Digital Humanities (ENG 697)

Geoffrey Clegg

In the Spring semester of 2011 I signed up for a PhD level seminar at Northern Illinois University entitled "Paleography" taught by Dr. Nicole Clifton. The majority of the coursework consisted of learning various styles of handwriting scripts dating from 100 BCE to 1700 CE as well as transcribing, dating, and identifying the origin of a manuscript housed in the Rare Books section of the university library. It was in this class that I was first introduced to the extensive work conducted by the British Library"s Manuscript Studies division housed on their website. The BL was able to digitize a large assortment of collected texts from across their holdings, especially medieval manuscripts dating as early as the 10 th century. While I have previously involved myself in such technological discussions as Kairos and Computers and Composition, I had never spent time working with the intersection between ancient text and modern technology. The availability of ancient manuscripts and the ability to work with programs like Adobe allowed me to transition my thinking about writing from one that focused exclusively on hypertextual writing to seeing the need for writing to become more accessible, especially works that are normally housed in archives hidden away from public view. Is there a digital medieval humanities? Multimodality within the medieval community is nothing new as projects such as CANTUS database, Project Gutenberg, various linguistic tutorials for medieval Latin, Old High German, Old English, and Old French, and annotated hypertext websites covering the works of Geoffrey Chaucer, Thomas Malory, and Wolfram von Eschenbach.

downloadDownload free PDF View PDFchevron_right

A flexible model for the collaborative annotation of digitized literary works

Maria Goicoechea

2012

XML Gayoso-Cabada, Joaquin, Universidad Complutense de Madrid, Spain, gayoxo@gmail.com Ruiz, Cesar, Universidad Complutense de Madrid, Spain, cruiz85@gmail.com Pablo-Nuñez, Luis, Universidad Complutense de Madrid, Spain, lpnunez@filol.ucm.es Sarasa-Cabezuelo, Antonio, Universidad Complutense de Madrid, Spain, asarasa@fdi.ucm.es Goicoechea-de-Jorge, Maria, Universidad Complutense de Madrid, Spain, mgoico@filol.ucm.es Sanz-Cabrerizo, Amelia, Universidad Complutense de Madrid, Spain, amsanz@filol.ucm.es Sierra-Rodriguez, Jose-Luis, Universidad Complutense de Madrid, Spain, jlsierra@fdi.ucm.es

downloadDownload free PDF View PDFchevron_right

Going Online is not Enough! Electronic Descriptions of Ancient Manuscripts, and the Needs of Manuscript Studies

Patrick Andrist

in Tara Andrews, Caroline Macé (ed.), Analysis of Ancient and Medieval Texts and Manuscripts: Digital Approaches, 2014

downloadDownload free PDF View PDFchevron_right

Ideas Towards Interfacing Digital Humanities Research

Hans Walter Gabler

downloadDownload free PDF View PDFchevron_right

Text-image coupling for editing literary sources

Eric Lecolinet

2002

Users need more sophisticated tools to handle the growing number of image-based documents available in databases. In this paper, we present a system devoted to the editing and browsing of complex literary hypermedia including original manuscript documents and other handwritten sources. Editing capabilities allow the user to transcribe manuscript images in an interactive way and to encode the resulting textual representation by means of a logical markup language (based on the XML/TEI specification). Both representations (image and structured text) are tightly linked to facilitate the reading and the interpretation of documents. This text/image coupling scheme is an attempt to unify several layers of information in order to provide the user with a global vision of the work. Our system also supplies tools capable of processing and relating information stored both in images and structured texts. Finally, application-specific visualization techniques have been developed in order to provide users with a way to identify relationships between source documents and help them to navigate.

downloadDownload free PDF View PDFchevron_right

Digital Classical Philology

Monica Berti

De Gruyter eBooks, 2019

Cataloging and Citing Greek and Latin Authors and Works illustrates not only how Classicists have built upon larger standards and data models such as the Functional Requirements for Bibliographic Records (FRBR, allowing us to represent different versions of a text) and the Text Encoding Initiative (TEI) Guidelines for XML encoding of source texts (representing the logical structure of sources) but also highlights some major contributions from Classics. Alison Babeu, Digital Librarian at Perseus, describes a new form of catalog for Greek and Latin works that exploits the FRBR data model to represent the many versions of our sourcesincluding translations. Christopher Blackwell and Neel Smith built on FRBR to develop the Canonical Text Services (CTS) data model as part of the CITE Architecture. CTS provides an explicit framework within which we can address any substring in any version of a text, allowing us to create annotations that can be maintained for years and even for generations. This addressesat least within the limited space of textual dataa problem that has plagued hypertext systems since the 1970s and that still afflicts the World Wide Web. Those who read these papers years from now will surely find that many of the URLs in the citations no longer function but all of the CTS citations should be usablewhether we remain with this data model or replace it with something more expressive. Computer Scientists Jochen Tiepmar and Gerhard Heyer show how they were able to develop a CTS server that could scale to more than a billion words, thus establishing the practical nature of the CTS protocol. If there were a Nobel Prize for Classics, my nominations would go to Blackwell and Smith for CITE/CTS and to Bruce Robertson, whose paper on Optical Character Recognition opens the section on Data Entry, Collection, and Analysis for Classical Philology. Robertson has worked a decade, with funding and without, on the absolutely essential problem of converting images of print Greek into machine readable text. In this effort, he has mastered a wide range of techniques drawn from areas such as computer human interaction, statistical analysis, and machine learning. We can now acquire billions of words of Ancient Greek from printed sources and not just from multiple editions of individual works (allowing us not only to trace the development of our texts over time but also to identify quotations of Greek texts in articles and books, thus allowing us to see which passages are studied by different scholarly communities at different times). He has enabled fundamental new work on Greek. Meanwhile the papers by Tauber, Burns, and Coffee are on representing characters, on a pipeline for textual analysis of Classical languages and on a system that detects where one text alludes towithout extensively quotinganother text. At its base, philology depends upon the editions which provide information about our source texts, including variant readings, a proposed reconstruction of the original, and reasoning behind decisions made in analyzing the text.

downloadDownload free PDF View PDFchevron_right

[With D. Fusi, F. Fischer, C. Zittel], Taming the Hydra: A Model for Textual Dynamics and Constellations of Goethe’s Venetian Epigrams, in S. Rebora et al. (Ed.). Diversity, Equity, and Inclusion: Challenges and Opportunities for Digital Humanities in the Age of Artificial Intelligence, Verona 2025

Matteo Zupancic

This paper presents a digital model and software created in the context of the VEdition project, to provide a critical digital edition of Goethe's Venetian Epigrams. The paper proposes an innovative textological approach, focusing on a generic and reusable model of autographs to represent the dynamic nature of the creative process. While preserving the "objective" reproduction of documents separate from subjective scholarly interpretations, the model focuses on a single structured, computable and compact graph-based data structure, allowing to generate multiple text versions, annotated at any granularity level, for both textual and visual content. A full-fledged web UI (and an alternative complementary DSL) facilitates the creation of content, allowing scholars to focus on the reconstruction of the creative process at a higher abstraction level, while providing virtually unlimited export formats for integration with TEI-based production flows.

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

Peter Shoemaker

Computers and the Humanities, 1997

This paper concerns the Charrette Project, a multimedia electronic archive of a medieval manuscript tradition. In this paper, we argue that the computer's strengths in manipulating complex and varied resources should be an important organizing principle in the conception and construction of electronic text projects. Specifically, we describe the elements of the Charrette archive, its architecture, and its potential for scholarly research and pedagogical applications.

downloadDownload free PDF View PDFchevron_right

Recensione del volume "Analysis of Ancient and Medieval Texts and Manuscripts: Digital Approaches", di T.L. Andrews e Caroline Macé, Turnhout 2014, in "DHQ", vol. 15.3 (http://digitalhumanities.org/dhq/vol/15/3/000546/000546.html#)

Giulia Freni

Analysis of Ancient and Medieval Texts and Manuscripts: Digital Approaches discusses the possibilities offered by collaboration between classical studies and digital resources, in order to explore what could be the future of digital humanities. The digital revolution has changed the approach to research, especially for humanities. In the last decades the process of image scanning, transcription and creation of digital archives of text has materialized, but many scholars aren't conscious of the results it can achieve for their studies. For that reason, Analysis of Ancient and Medieval Texts and Manuscripts: Digital Approaches aims to show the possibilities given by computer-assisted methods in the analysis of ancient and medieval codices. In particular, there is a focus on the combination and comparison of data, which can lead

downloadDownload free PDF View PDFchevron_right

El'Manuscript 2018, 7th International Conference on Textual Heritage and Information Technologies, Vienna and Krems, Austria, 14-18 September 2018

Fabio Cusimano

2018

Fabio Cusimano A 'cloud' full of digitized manuscripts. The Veneranda Biblioteca Ambrosiana, from the Custos Catalogi to the Data Curator. Keywords (ENG.): Veneranda Biblioteca Ambrosiana, digital library, digitization, cooperation, free access, 'cloud', data curation, International Image Interoperability Framework (IIIF). Abstract (ENG.): Digital libraries and digitization projects have developed from their inception as a temporary phenomenon to become a real opportunity. Today the dissemination of knowledge can make use of new web-based technologies, while also benefiting from the improved quality of digital objects, more mature metadata standards, more capable retrieval technologies, etc. But the roots of libraries are older than the so-called digital revolution, as can be seen from the case of the Veneranda Biblioteca Ambrosiana in Milan. Since the first years of the 17th century, this prestigious conservation library has managed and curated a special and unique collection of precious manuscripts, and, now as then, it makes those masterpieces freely available for users from all over the world, both at its reading room and, today, through its new digital initiative. Many librarians (also known as custodes catalogi) have succeeded each other over the centuries, always pursuing preservation and conservation targets, and always enriching and curating the catalogs. Today we are still inspired by those masters of the past, but we are also engaged in making these precious sources available for an ever-increasing audience thanks to our new freely accessible digital library.

downloadDownload free PDF View PDFchevron_right

Tools for Digital Humanities: Enabling Access to the Old Occitan Romance of Flamenca

Olga Scrivner

Accessing historical texts is often a challenge because readers either do not know the historical language, or they are challenged by the technological hurdle when such texts are available digitally. Merging corpus linguistic methods and digital technology can provide novel ways of representing historical texts digitally and providing a simpler access. In this paper, we describe a multi-dimensional parallel Old Occitan-English corpus, in which word alignment serves as the basis for search capabilities as well as for the transfer of annotations. We show how parallel alignment can help overcome some challenges of historical manuscripts. Furthermore, we apply a resource-light method of building an emotion annotation via parallel alignment, thus showing that such annotations are possible without speaking the historical language. Finally, using visualization tools, such as ANNIS and GoogleViz, we demonstrate how the emotion analysis can be queried and visualized dynamically in our parallel corpus, thus showing that such information can be made accessible with low technological barriers.

downloadDownload free PDF View PDFchevron_right

Examples of challenges and opportunities in visual analysis in the digital humanities

John ffrench

Human Vision and Electronic Imaging XX, 2015

The massive digitization of books and manuscripts has converted millions of works that were once only physical into electronic documents. This conversion has made it possible for scholars to study large bodies of work, rather than just individual texts. This has offered new opportunities for scholarship in the humanities. Much previous work on digital collections has relied on optical character recognition and focused on the textual content of books. New work is emerging that is analyzing the visual layout and content of books and manuscripts. We present two different digital humanities projects in progress that present new opportunities for extracting data about the past, with new challenges for designing systems for scholars to interact with this data. The first project we consider is the layout and spectral content of thousands of pages from medieval manuscripts. We present the techniques used to study content variations in sets of similar manuscripts, and to study material variations that may indicate the location of manuscript production. The second project is the analysis of representations in the complete archive of Vogue magazine over 120 years. We present samples of applying computer vision techniques to understanding the changes in representation of women over time.

downloadDownload free PDF View PDFchevron_right

Between visual art and visual text. Intermediality and hypertext: A possible combination for twenty- first century philology

teresa nocita

Journal of Art Historiography, 2022

The birth of digital writing, characterized by a process of correction that implies the omission of the preparatory editorial phases of a literary text, has brought about an epochal change in the author-text relationship, now characterized, for the first time in literary history, by the disappearance of autograph documentation. This evident loss would seem to threaten the survival of twenty-first century philology, destined to operate despite the absence of the author’s handwritten documents. But the genetic reconstruction of the text, if taken as a speculative habitus and common research practice, can constitute a valid answer and a new possibility for future philological inquiry which combines literature, music and art in a new Hypermedia. The compositional history of the work and that of its dissemination can be exemplified by an exhibition of typologically diverse materials, such as images, sounds, videos, which allow us to contextualize the literary text through a multidisciplinary creative process and to reconnect it to the very important and popular field of intermediality studies. This article proposes a few samples of this new research approach regarding Giovanni Boccaccio and his literary masterpiece Decameron.

downloadDownload free PDF View PDFchevron_right

No Text (…) is Ever One Thing": Utilizing Digital Technology in Literary Research

Sonia Howell

2012

Driven by the debate between “specialist” and “generalist” methods of analysis in the study of world literature, we embarked upon a case study consisting of a textual analysis of the dialogical relationship between patient and therapist in “factional” Irish and English novels. By appropriately encoding text passages and using suitable visualization techniques, we aspired to provide a methodology that could be utilized at both national and international levels of study. This paper will argue that both the findings generated by the software developed, and those discovered through the development process, serve to visibly enhance both traditional textual scholarship and work in the field of Digital Humanities.

downloadDownload free PDF View PDFchevron_right

Digital Explanatory Annotations for Literary Texts: Possibilities - Practices - Problems – Prospects.

Miriam Lahrsow

[This pdf (published in January 2021) is a slightly revised version of the original 2017 document.] This thesis is concerned with digital explanatory annotations that are meant to help readers understand, interpret, and enjoy literary texts. The first chapter outlines the advantages of digital over printed annotations. The second chapter evaluates the annotations in eleven digital editions. The focus lies on their extent and systematization, their layout, their use of links and multimedia elements, their citeability, and on whether they were created collaboratively or not. The third chapter is concerned with TEASys – a system developed at the university of Tübingen which helps annotators structure their explanations with regards to length/depth and content. This system allows readers to choose which parts of an annotation they want to read for their individual interests and research purposes. The fourth chapter discusses the results of a survey concerned with students’ attitudes towards digital reading and digital annotations. The last chapter outlines three challenges that digital annotations are still facing: (1) readers’ preference for printed texts (2) the questions how the academic quality of collaboratively written annotations can be guaranteed, and (3) the question how digital annotations that are constantly being revised can be archived.

downloadDownload free PDF View PDFchevron_right

Medieval texts - Contemporary media. The art and science of editing in the digital age

Marina Buzzoni

The essay collected in this volume began life as Workshop papers. The Workshop, which took place in Pavia in June 2008, explored the interaction between ICT systems and the philological analysis of medieval texts. Throughout the book the theoretical implications of both present and future reasearch are taken into account, and the current e-edition projects here illustrated offer an international perspective of editorial scholarship in the electronic medium, providing also original contributions in the field of digital philology.

downloadDownload free PDF View PDFchevron_right

María del Carmen López Ruiz, Rev. of: David Hamidovič, Claire Clivaz, Sarah Bowen Savant (eds.), in coll. with Alessandra Marguerat, Ancient Manuscripts in Digital Culture. Visualization, Data Mining, Communication

Mediterranea. International Journal on the Transfer of Knowledge

2021

Review of: David Hamidovič, Claire Clivaz, Sarah Bowen Savant (eds.), in coll. with Alessandra Marguerat, Ancient Manuscripts in Digital Culture. Visualization, Data Mining, Communication, Brill, Leiden–Boston 2019 (Digital Biblical Studies, 3), PP. XVI + 28

downloadDownload free PDF View PDFchevron_right

Rose Tools: A Medieval Manuscript Text-Image Annotation Project

Sign up for access to the world's latest research

Abstract

Related papers

Related papers