Please use this identifier to cite or link to this item: http://repository.kln.ac.lk/handle/123456789/23067
Title: Grammatical error detection and correction model for Sinhala language sentences
Authors: Pabasara, H.M.U.
Jayalal, S.
Keywords: Grammar checking, Hybrid approach, Part-of- speech-tagger
Issue Date: 2020
Publisher: Department of Industrial Management, Faculty of Science, University of Kelaniya, Sri Lanka
Citation: Pabasara, H.M.U., Jayalal, S. (2020). Grammatical error detection and correction model for Sinhala language sentences. In : International Research Conference on Smart Computing and Systems Engineering, 2020. Department of Industrial Management, Faculty of Science, University of Kelaniya, Sri Lanka, p.17.
Abstract: As the national language of Sri Lanka, the greater part of the exercises at most of all the services are completed in Sinhala whereas it is imperative to guarantee the spelling and syntactic accuracy to convey the ideal significance from the perspective of automated materials with the unavailability of resources even though there are enough amount of available materials as hard copy and books. With the high multifaceted nature of the language, it sets aside extensive effort to physically edit the substance of a composed setting. The necessity to overcome this problem has risen numerous years back. But with the complexity of grammar rules in morphologically lavish Sinhala language, the accuracy of the grammar checkers developed so far has been contrastingly lower and thus, to overcome the issue a novel hybrid approach has been introduced. Spell checked Sinhala active sentences being pre-processed, separated nouns and verbs were analyzed with the help of a resourceful part-of- speech-tagger and a morphological analyzer and alongside the sentences were sent through a pattern recognition mechanism to identify its sentence pattern. Then a decision tree-based algorithm has been used to evaluate the verb with the “subject” and output feedback about the correctness of the sentence. To train this decision tree, a dataset consisting of 800 records which included information about 25 predefined grammar rules in Sinhala was used. Finally, the error correction was provided using a machine learning algorithm-based sentence guessing model for the three possible tenses. Conducted research results paved the way to identify the sentence pattern, grammar rules and finally, suggest corrections for identified incorrect grammatical sentences with an acceptable accuracy rate of 88.6 percent which concluded that the proposed hybrid approach was an accurate approach for detecting and correcting grammatical mistakes in Sinhala text.
URI: http://repository.kln.ac.lk/handle/123456789/23067
Appears in Collections:Smart Computing and Systems Engineering - 2020 (SCSE 2020)

Files in This Item:
File Description SizeFormat 
3.pdf894.75 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.