DSpace Repository

Towards Building a Modern Written Tamil Treebank

Show simple item record

dc.contributor.author Parameswari, K.
dc.contributor.author Sarveswaran, K.
dc.date.accessioned 2023-02-07T06:04:55Z
dc.date.available 2023-02-07T06:04:55Z
dc.date.issued 2022
dc.identifier.uri http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/9036
dc.description.abstract In this paper, we describe the creation of a morphosyntactically annotated treebank for modern written Tamil following the Universal Dependencies (UD) framework to support the implementation and evaluation of Tamil dependency parsers. At present, this treebank consists of 534 sentences. This paper discusses unique constructions found in Tamil and explains sub-relations and language-specific relations introduced, apart from outlining the methodology. This carefully annotated treebank can also serve as the benchmark dataset to evaluate Tamil Natural Language Processing (NLP) tools. The treebank will be extended further to cover more complex constructions in Tamil, and annotations will be enriched by incorporating the Enhanced Universal Dependencies scheme. en_US
dc.language.iso en en_US
dc.publisher TLT, SyntaxFest en_US
dc.title Towards Building a Modern Written Tamil Treebank en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record