El'Manuscript-09-WS
Workshop
Organizational committee
Program comittee
Topic areas
Basic dates
Registration and request
Competition
Contestworks
Organizational payment
Workshop program
Workshop materials
Workshop participants
Organizational information
Cultural program
Photos





Lost Password?
No account yet? Register
We have 3 guests online
RSS-feed
rss20.gif

The project is supported by the Russian Foundation for Basic Research, project #07-04-12140в

(c) "Information Technologies and Textual Heritage", 2008-2020

Automatic alignment and aspects of using parallel corpora / Автоматическое выравнивание и перспективы использования параллельных корпусов PDF Print E-mail
Written by: Hanne Martine Eckhoff, Dag Haug   
Воскресенье, 04 Октябрь 2009

 

Материалы к лекции (презентация)

For the study of languages such as OCS, that are mainly extant in translation, an aligned parallel corpus is invaluable. In the PROIEL corpus, all the translations are automatically aligned with the Greek original at token level, with a success rate of about 97 %. In this lecture we discuss the automatic token aligner and demonstrate how the token alignments can be used in combination with multiple layers of annotation to do sophisticated contrastive work on the translation languages.

 
< Prev   Next >