Challenges in Annotating Medieval Latin Charters

Timo Korkiakangas, Marco Passarotti

Research output: Contribution to journalArticleScientificpeer-review

Abstract

No annotation guidelines concerning substandard Latin are presently available. This paper describes an annotation style of substandard Latin that supplements the method designed for standard Latin by the Perseus Latin Dependency Treebank and the Index Thomisticus Treebank. Each word of the corpus can be assigned only one morphological analysis. In our system, the analysis can be either functional or formal. Functional analysis is applied when a form is language-evolutionarily deducible from the corresponding standard Latin form used in the same (semantico )syntactic function (e.g. solidus pro solidos ‘gold coins’ as a direct object: analysis “accusative”). Formal analysis applies when no connection to the functionally required classical form exists (e.g. heredibus pro heredes ‘heirs’ as a subject: analysis “ablative” or “dative”). When running queries on the corpus, the formally analysed forms can be isolated, and percentages of standard and substandard forms can be counted. In addition, further principles concerning syntax and specific morphological issues are introduced.
Original languageEnglish
JournalJournal for Language Technology and Computational Linguistics
Volume26
Issue number2
Pages (from-to)103-114
Number of pages12
Publication statusPublished - Feb 2012
MoE publication typeA1 Journal article-refereed

Fields of Science

  • 6121 Languages
  • 113 Computer and information sciences

Cite this

@article{9b4cb48c450349c5b8eab9ef64432bb2,
title = "Challenges in Annotating Medieval Latin Charters",
abstract = "No annotation guidelines concerning substandard Latin are presently available. This paper describes an annotation style of substandard Latin that supplements the method designed for standard Latin by the Perseus Latin Dependency Treebank and the Index Thomisticus Treebank. Each word of the corpus can be assigned only one morphological analysis. In our system, the analysis can be either functional or formal. Functional analysis is applied when a form is language-evolutionarily deducible from the corresponding standard Latin form used in the same (semantico )syntactic function (e.g. solidus pro solidos ‘gold coins’ as a direct object: analysis “accusative”). Formal analysis applies when no connection to the functionally required classical form exists (e.g. heredibus pro heredes ‘heirs’ as a subject: analysis “ablative” or “dative”). When running queries on the corpus, the formally analysed forms can be isolated, and percentages of standard and substandard forms can be counted. In addition, further principles concerning syntax and specific morphological issues are introduced.",
keywords = "6121 Languages, 113 Computer and information sciences",
author = "Timo Korkiakangas and Marco Passarotti",
year = "2012",
month = "2",
language = "English",
volume = "26",
pages = "103--114",
journal = "Journal for Language Technology and Computational Linguistics",
issn = "2190-6858",
publisher = "Gesellschaft f{\"u}r Sprachtechnologie & Computerlinguistik",
number = "2",

}

Challenges in Annotating Medieval Latin Charters. / Korkiakangas, Timo; Passarotti, Marco.

In: Journal for Language Technology and Computational Linguistics, Vol. 26, No. 2, 02.2012, p. 103-114.

Research output: Contribution to journalArticleScientificpeer-review

TY - JOUR

T1 - Challenges in Annotating Medieval Latin Charters

AU - Korkiakangas, Timo

AU - Passarotti, Marco

PY - 2012/2

Y1 - 2012/2

N2 - No annotation guidelines concerning substandard Latin are presently available. This paper describes an annotation style of substandard Latin that supplements the method designed for standard Latin by the Perseus Latin Dependency Treebank and the Index Thomisticus Treebank. Each word of the corpus can be assigned only one morphological analysis. In our system, the analysis can be either functional or formal. Functional analysis is applied when a form is language-evolutionarily deducible from the corresponding standard Latin form used in the same (semantico )syntactic function (e.g. solidus pro solidos ‘gold coins’ as a direct object: analysis “accusative”). Formal analysis applies when no connection to the functionally required classical form exists (e.g. heredibus pro heredes ‘heirs’ as a subject: analysis “ablative” or “dative”). When running queries on the corpus, the formally analysed forms can be isolated, and percentages of standard and substandard forms can be counted. In addition, further principles concerning syntax and specific morphological issues are introduced.

AB - No annotation guidelines concerning substandard Latin are presently available. This paper describes an annotation style of substandard Latin that supplements the method designed for standard Latin by the Perseus Latin Dependency Treebank and the Index Thomisticus Treebank. Each word of the corpus can be assigned only one morphological analysis. In our system, the analysis can be either functional or formal. Functional analysis is applied when a form is language-evolutionarily deducible from the corresponding standard Latin form used in the same (semantico )syntactic function (e.g. solidus pro solidos ‘gold coins’ as a direct object: analysis “accusative”). Formal analysis applies when no connection to the functionally required classical form exists (e.g. heredibus pro heredes ‘heirs’ as a subject: analysis “ablative” or “dative”). When running queries on the corpus, the formally analysed forms can be isolated, and percentages of standard and substandard forms can be counted. In addition, further principles concerning syntax and specific morphological issues are introduced.

KW - 6121 Languages

KW - 113 Computer and information sciences

M3 - Article

VL - 26

SP - 103

EP - 114

JO - Journal for Language Technology and Computational Linguistics

JF - Journal for Language Technology and Computational Linguistics

SN - 2190-6858

IS - 2

ER -