Abstract:
Sinhalese was formerly called Sinhalese and is the language of Sinhalese. Sinhala has two
forms such as spoken language and written language. Sinhala is very gentle in grammatical
form and rigidity. Sinhala has its own writing system; it is descendants of Brahmi's book in
India. There are many applications for Sinhala typing, Sinhala translation and also although
there are many more tutorials for learning Sinhala. It is really difficult task to check the
grammatical errors of a sentence. Detecting grammatical error is really important while doing
documentations in Sinhala. The aim of this work is to detect grammatical mistakes, if any, of a
given text. Parts of speech tagging is not a new concept. For Sinhala Language also many
researches have been carried out to detect words and tag. In those works, statistical approaches
such as Hidden Markov Model and stochastic models were used. But, in this study, tagging is
used with the use of database built within the system. That type of approach has been used for
Tamil language also. Sinhala Grammar Checker was developed with the influence of that work. in this work, the grammatical errors are found based on the agreement of "Subject" and the "finite verb" with the help of sub tags. as the language consists of many sub tags, we have
identified most important ten sub-tags with the guidance of a Sinhala Language Expert. These
sub-tags play the main role in detecting grammatical mistakes which are due to the
disagreement between subjects and finite verbs with respect to person, number and/or gender,
and tense disagreement between finite verbs and words that indicate tense aspects in the
sentences. In this work, grammar checking for simple sentences is successfully done. But for
complex sentences, some additional tags and methods are to be used. Further, a web based real
time system is intended to be developed to check the grammatical correctness of a context
written in Sinhala language.