NLP Resources for a Rare Language Morphological Analyzer: Danish Case
dc.contributor.author | Котов, М.В. | |
dc.date.accessioned | 2018-07-16T23:29:16Z | |
dc.date.available | 2018-07-16T23:29:16Z | |
dc.date.issued | 2017 | |
dc.description | ORCID ID: http://orcid.org/0000-0001-8327-5197 | ru_RU |
dc.description.abstract | The paper discusses the characteristics and practical aspects of application of the natural language processing resources available for developing a rare language morphological analysis solution. The case under consideration reveals the pipeline design needed to prepare the grammatical resources for Danish. Being rare not only in terms of distribution, but also in the amount of natural language resources available, the Danish language represents a significant problem in terms of application of third-party tools to help solve various NLP-related issues. The paper focuses on part-of-speech tagging and lemmatization, typical but indispensable tasks at the pre-processing stage within the framework of developing a morphological analyzer as a custom NLP solution. | ru_RU |
dc.identifier.citation | Kotov M. NLP resources for a rare language morphological analyzer: danish case / Mykhailo Kotov // Computational linguistics andintelligent systems (COLINS 2017) : proceedings of the 1st International conference, Kharkiv, Ukraine, 21 April 2017 / National Technical University «KhPI», Lviv Polytechnic National University. – Kharkiv, 2017. – P. 31–36. – Bibliography: 12 titles. | ru_RU |
dc.identifier.uri | https://ekhnuir.karazin.ua/handle/123456789/14264 | |
dc.language.iso | en | ru_RU |
dc.publisher | National Technical University «KhPI», Lviv Polytechnic National University | ru_RU |
dc.subject | Research Subject Categories::TECHNOLOGY::Information technology | ru_RU |
dc.subject | Research Subject Categories::HUMANITIES and RELIGION::Languages and linguistics | ru_RU |
dc.subject | morphological analyzer, lemmatization, part-of-speech tagging, Hunspell, OpenNLP, Snowball stemmer, SyntaxNet, word-list | ru_RU |
dc.title | NLP Resources for a Rare Language Morphological Analyzer: Danish Case | ru_RU |
dc.type | Article | ru_RU |
Файли
Контейнер файлів
1 - 1 з 1
Вантажиться...
- Назва:
- nlp-resources-for-morph-analyzer_2017.pdf
- Розмір:
- 374.55 KB
- Формат:
- Adobe Portable Document Format
Ліцензійна угода
1 - 1 з 1
Вантажиться...
- Назва:
- license.txt
- Розмір:
- 7.8 KB
- Формат:
- Item-specific license agreed upon to submission
- Опис: