Skip to main navigation Skip to main content Skip to page footer

A CORPUS-BASED VALENCY LEXICON FOR A CONTRASTIVE AND DIACHRONIC STUDY OF ANCIENT & MEDIEVAL LANGUAGES

The research project “A Corpus-based Valency Lexicon for a Contrastive and Diachronic Study of Ancient & Medieval Languages” was hosted at the GlossaContact Laboratory (2024–2025) . 

Objectives:

1. Digitization of diachronic texts

Systematic digitization of texts in Greek (Ancient, Medieval, Modern Greek) and English (Old, Middle, Modern English), as well as other Indo-European languages (e.g., Latin, Sanskrit, Germanic, Romance, and Slavic languages), using the TEI XML standard with an emphasis on encoding aligned with PROIEL-style annotation. The digitization process includes the integration of retranslations (intra- and interlingual) to study language contact and diachronic evolution.

2. PROIEL-based annotation and argument structure analysis

Automatic and semi-automatic annotation of texts using PROIEL and the Penn-Helsinki Parsed Corpora, focusing on argument structure (valency) and lemmatization. The analysis prioritizes the extraction and standardization of argument relations, aiming to document the evolution of verbal valency and syntactic structures, with a particular emphasis on English and Greek.

3. Creation of a diachronic corpus with emphasis on argument structure and retranslations

Development of a diachronic corpus that incorporates retranslations (intra- and interlingual) and focuses on argument structure (valency), facilitating comparative analysis of language change and contact, including the diachrony of Greek, English, and other Indo-European languages. The corpus is supported by metadata that enhances the study of the development of verbal structures and argument relations.

4. Digitization of diachronic texts

Systematic digitization of texts in Greek (Ancient, Medieval, Modern Greek) and English (Old, Middle, Modern English), as well as other Indo-European languages (e.g., Latin, Sanskrit, Germanic, Romance, and Slavic languages), using the TEI XML standard with an emphasis on encoding aligned with PROIEL-style annotation. The digitization process includes the integration of retranslations (intra- and interlingual) to study language contact and diachronic evolution.

5. PROIEL-based annotation and argument structure analysis

Automatic and semi-automatic annotation of texts using PROIEL and the Penn-Helsinki Parsed Corpora, focusing on argument structure (valency) and lemmatization. The analysis prioritizes the extraction and standardization of argument relations, aiming to document the evolution of verbal valency and syntactic structures, with a particular emphasis on English and Greek.

6. Creation of a diachronic corpus with emphasis on argument structure and retranslations

Development of a diachronic corpus that incorporates retranslations (intra- and interlingual) and focuses on argument structure (valency), facilitating comparative analysis of language change and contact, including the diachrony of Greek, English, and other Indo-European languages. The corpus is supported by metadata that enhances the study of the development of verbal structures and argument relations.

Visit our website for news and updates.