|
Проблемы лингвистической разметки и анализа электронных критических изданий текстов письменного письменного наследия в стандарте XML-TEI |
|
|
|
Written by: Алексей Михайлович Лаврентьев
|
Воскресенье, 05 Август 2012 |
Summary. In this paper we consider some problems of automatic linguistic annotation and analysis of textual heritage documents encoded according to the TEI XML guidelines. TEI XML is a popular standard for encoding electronic editions of textual heritage documents as it allows highly customizable semantically-oriented markup independent of a particular platform or software. TEI is aimed at facilitating data exchange and interoperability. However, rich editorial markup including various readings and interpretations at various levels of linguistic hierarchy may be a serious challenge if one wants to apply NLP (natural language processing) tools to such an edition. Based on the example of the Base de Français Médiéval Old French corpus and on the electronic edition of the Queste del saint Graal, we will discuss the solutions to these problems that are implemented in the TXM platform import modules.
|
|
Извлечение и анализ дат произведений в корпусе цитат онлайн-словаря |
|
|
|
Written by: Андрей Анатольевич Крижановский, Луговая Наталья Борисовна, Круглов Василий Михайлович
|
Воскресенье, 05 Август 2012 |
Текст в формате PDF (262.18 kB)
Текст в формате PDF_En (2.07 MB)
Summary. Quantitative evaluation of
quotations in the Russian Wiktionary was performed with the use of the
developed Wiktionary parser. It was found that the number of quotes in the
dictionary grows fast (51.5 thousands in 2011, 62 thousands in 2012). These
quotes were extracted and stored to the database of the machine-readable dictionary.
The tables of the relational database of the machine-readable dictionary
related to the quotations were designed. The histogram of distribution of
quotations of literary works created in different years was built.
|
|
Пермские газеты колчаковского периода: источниковедческий анализ и моделирование информационной системы |
|
|
|
Written by: Динара Амировна Гагарина, Корниенко Сергей Иванович, Масленников Николай Николаевич, Пигалева Светлана Валерьевна
|
Суббота, 04 Август 2012 |
Summary. The
article presents the project devoted to preservation, documentation and
analysis of Perm newspapers of the «Kolchak period» by means of information
technologies. Results of the first stage are given. Conclusions of the source
study are drawn. The information model of the collection, its main objects and
attributes are described and substantiated.
|
|
Проект создания учебного пособия по текстологии и комментированию лингвистических источников для студентов-палеославистов |
|
|
|
Written by: Татьяна Игоревна Афанасьева, Е.А.Кузьмонова, Т.В.Пентковская
|
Суббота, 04 Август 2012 |
Материалы в формате PDF (197.05 kB)
Summary. The present manual on textual criticism is part of the
educational complex “Palaeoslavistica”. It includes a brief theoretical
information on the main sections of the Slavic textual criticism, as well as a
complex set of tasks, aimed at the practical skills of linguistic text-critical
analysis Church Slavonic texts of various subjects. Part of the tasks of this
kind is presented in an interactive form on the website of the Philological
faculty of Lomonosov Moscow State University.
|
|
|
|
<< Start < Prev 1 2 3 4 5 6 7 8 9 10 Next > End >>
|
Results 73 - 81 of 83 |