Text and non-text classification from doctor writing prescription images

No Thumbnail Available

Date

2021

Journal Title

Journal ISSN

Volume Title

Publisher

Faculty of Science, University of Kelaniya, Sri Lanka

Abstract

The classification of text and non-text block is an important problem in document analysis. This paper focuses on text and non-text classification, which plays a major role in the consequent processes of Optical Character Recognition (OCR). The system consists of binarization using Otsu’s method, noise removal using median filter, skew detection and correction using Radon transform, segmentation, feature extraction and text/non-text classification. The proposed method for text and non-text classification is a combination of two techniques: decision rule with density features and Support Vector Machines (SVMs) with Histogram of Oriented Gradients (HOG) features. The text and non-text classification is performed by segmenting the medical prescription image into blocks using a run-length smearing algorithm and projection techniques. Moreover, the classification is performed by using binary SVMs with HOG features and a decision rule by density feature. Experiments have been carried out with a dataset of 50 medical prescription images and achieved classification rates of 92.47% using a decision rule by density feature and SVM with HOG features.

Description

Keywords

SVMs, HOG, Medical prescription image, Text and non-text classification

Citation

Subasinghe, M, Ramanan, M. (2021) Text and non-text classification from doctor writing prescription images, Proceedings of the International Conference on Applied and Pure Sciences (ICAPS 2021-Kelaniya)Volume 1,Faculty of Science, University of Kelaniya, Sri Lanka.Pag.74

Collections

Endorsement

Review

Supplemented By

Referenced By