Automated Alignment of Medieval Text Versions based on Word Embeddings
Title | Automated Alignment of Medieval Text Versions based on Word Embeddings |
Publication Type | Conference Paper |
Year of Publication | 2019 |
Authors | Meinecke, Christofer, Wrisley David Joseph, and Jänicke Stefan |
Conference Name | LEVIA’19: Leipzig Symposium on Visualization in Applications 2019 |
Conference Location | Leipzig |
Keywords | Digital Humanities, Sentence Alignment, Visualization, Word Embedding |
Abstract | Medieval textuality is characterized by instability in text structure and length that varies according to the text tradition. This instability in the versions, otherwise known as “mouvance”, is characterized by dialectal difference, traces of orality, the modification of wording and even the rewriting and rearrangement of large parts of the text. To help humanities scholars in the exploratory analysis of such complex text collections, the visual analytic system iteal was initially proposed. The system aligns similar phrases on a line-level on the basis of string similarity and word n-grams. We propose an extension of this system that replaces the parameter-based approach with an automatic one using word embeddings thereby adding a semantic component. The benefit of the new visualization system is shown through a comparison of different versions of medieval French texts. Additionally, a domain-expert compared the parameter-based approach with the approach based on word embeddings to outline the similarities and differences in the alignments. |
URL | https://levia19.vizcovery.de/papers/LEVIA19_paper_6.pdf |
DOI | 10.31219/osf.io/tah3y |
Undefined