A Study of Sinhala Spelling Error Patterns for Spelling Error Correction

No Thumbnail Available

Date

2015

Journal Title

Journal ISSN

Volume Title

Publisher

Staff Development Center, University of Kelaniya

Abstract

Spelling error detection & correction techniques are used widely by word processing, machine translation, information retrieval and natural language processing systems. Even though it is straight forward to verify a misspelled word by looking up in a word dictionary, it is very hard to suggest the best correction. For a morphologically rich and a complex Indic language like Sinhala a probabilistic method is the best approach for qualifying the best correction for a detected misspelled word over the other existing methods. This research intends to identify & analyze non-word spelling error patterns in Sinhala. A word dictionary will be used to identify the errors and a special software tool will be developed in order to record statistical data regarding the spelling errors of Sinhala documents. This tool will be used by a Sinhala language expert to record data related to spelling errors in a selected sample of documents. Errors will be categorized into different types along with statistical results and will be analyzed. The reasons of language specific error patterns will be discussed and a weight based decision tree format will be derived as an outcome which can be used to find the best correction from a word dictionary to replace a misspelled word.

Description

Keywords

Sinhala, spelling error patterns, spell checking, non-word errors

Citation

Wijekoon, Himesha (2015). A Study of Sinhala Spelling Error Patterns for Spelling Error Correction. In: Research Forum E Proceeding, Staff Development Centre Research Forum, Cycle 14-2015, University of Kelaniya, Kelaniya, pp 11.

Collections

Endorsement

Review

Supplemented By

Referenced By