The Menotec project (2010–2012)

Menotec was an infrastructure project funded by the Norwegian Research Council (2010–2012) with the aim of transcribing and annotating a corpus of Old Norwegian texts. (Announcement by NFR 17 Sept. 2010.) The transcribed and morphologically annotated texts have been (and will be) published in the Medieval Nordic Text Archive, while the syntactically annotated texts have been published in the treebank of the PROIEL project, as well as being made accessible through the INESS portal. The funding for the project lasted for the three years 2010–2012, but the project work continues within new contexts.

Transcriptions were made of eight central Old Norwegian law manuscripts (such as Holm perg 34 4to and Upps DG 8 I), and a full linguistic annotation was made of four major Old Norwegian manuscripts: The Old Norwegian Homily Book in AM 619 4to (ca. 1200–1225), the legendary saga of St Olaf in Upps DG 8 II (ca. 1225–1250), Strengleikar in Upps DG 4–7 4to (ca. 1270) and the Law of Magnus the Lawmender in Holm perg 34 4to (ca. 1275–1300). These texts have been annotated morphologically (adding the lemma and the grammatical form of each word) as well as syntactically. The syntactic annotation is based on dependency analysis, as this has been developed in the PROIEL project.

The annotated corpus counts approx. 200,000 words, but will in the coming years be extended by at least ca. 50,000 words. This is the first project offering a syntactic annotation of Old Norwegian. On the PROIEL site, the Old Norwegian texts will join a central Old Icelandic work, the Poetic Edda in GKS 2365 4to (a manuscript often referred to as Codex Regius). The Eddic poems have been annotated along the same lines as the texts in Menotec. Furthermore, several other Early Germanic and Romance texts will be found on the PROIEL site.

The project was led by Christian-Emil Ore at the University of Oslo, and included several other participants at this university, Karl G. Johansson, Anna C. Horn, Signe Laake, Kari Kinn, Dag Haug and Hanne Eckhoff. The University of Bergen was a partner in the project, and at this university, Odd Einar Haugen was leading the work of linguistic annotation. Also from Bergen, Fartein Th. Øverland, participated in the project, and from Iceland, Haraldur Bernharðsson and Eirikur Kristjánsson. The name Menotec is not actually an acronym, but is derived from the project name Menota (which in its turn is an acronym for the Medieval Nordic Text Archive)

Guidelines for the annotation have been published by Haugen and Øverland and are available in parallel versions in Norwegian and English, Retningslinjer and Guidelines. These guidelines explain the conventions of the morphological and syntactic annotation and also give a pragmatic introduction to dependency analysis for Old Norwegian, covering a wide range of annotation problems which arose during the project.

Further information on the encoding and annotation of Medieval Nordic sources will be found on the site of the Medieval Nordic Text Archive. Since the texts have been transcribed on a diplomatic level, some special characters are needed for the display of the texts. The encoding of these characters follow the recommendations of the Medieval Unicode Font Initiative, and several suitable fonts can be downloaded free of charge from the website of this project.

After the completion of the Menotec project, work on the morphological and syntactic annotation of texts have continued in other contexts. Another important law manuscript, Upps DG 8 I (ca. 1300–1350), is being syntactically annotated by Robert K. Paulsen at the University of Bergen, and will probably be completed in 2016. In 2015, a fully annotated version of Pamphilus saga in Upps DG 4–7 was published as a student contribution at the Universty of Bergen, and other texts are scheduled to follow.

 


Tilbake til hovedsiden

Opprettet 25.08.2015. Sist oppdatert 28.12.2015. Vevsjef.