Школа 2009
Школа
Организационный комитет
Programos komitetas
Mokyklos veiklos kryptys ir temos
Pagrindinės datos
Registracija ir taikymas
Конкурс
Конкурсные работы
Dalyvio mokestis
Программа школы
Mokyklos medžiaga
Участники школы
Organizacinė Infromacija
Kultūrinė programa
Фотогалерея





Lost Password?
No account yet? Register
We have 3 guests online
RSS-ленты новостей
rss20.gif

Portalo kūrimą rėmė Rusijos humanitarinių mokslų fondas, projektas Nr. 07-04-12140в.

(c) "Informacinės technologijos ir rašytinis palikimas", 2008-2020

The PROIEL corpus’s architecture and annotation schemes. How can we make the most out of scanty data? / Архитектура корпуса PROIEL и схемы аннотирования. Как мы можем оптимально использовать недостаточное количество данных? PDF Print E-mail
Written by: Hanne Martine Eckhoff, Dag Haug   
Воскресенье, 04 Октябрь 2009

 

Материалы к лекции (презентация)

In this lecture we explain the architecture of the PROIEL parallel corpus of New Testament texts. The corpus consists of the original Greek NT and its earliest translations into Indo-European languages: Latin, Gothic, Classical Armenian and Old Church Slavic (OCS). We will use examples mostly from OCS. We will look at file formatting for import, text processing and the many layers of annotation employed in the corpus: for morphology, syntax and information structure. We will also describe our system of free tags that in principle can be used by any scholar for any purpose: we will exemplify this by discussing our animacy tagging.

Practice session. In this practice session we import two chapters from the Gospel of Matthew from the Codex Zographensis and let the participants try out the PROIEL application for annotation in the various layers. 

 
< Prev   Next >