Digital Repository

Implementation of a hardware system to assist illegible using a hidden Markov model based speaker independent, continuous speech recognition system for Sinhala language

Show simple item record

dc.contributor.author Samankula, W.G.D.M.
dc.date.accessioned 2017-03-28T06:00:05Z
dc.date.available 2017-03-28T06:00:05Z
dc.date.issued 2016
dc.identifier.citation Samankula, W.G.D.M. (2016). Implementation of a hardware system to assist illegible using a hidden Markov model based speaker independent, continuous speech recognition system for Sinhala language. M.Phil. Thesis, University of Kelaniya. en_US
dc.identifier.uri http://repository.kln.ac.lk/handle/123456789/16886
dc.description.abstract In this thesis, a speaker independent speech recognition system was built to recognize the continuous Sinhala speech sentences using the toolkit, HTK-3.4. I based on the statistical approach, Hidden Markov Model (HMM). Three hundred sentences were considered for recording. Data recordings were done with 50 males and 50 females and testing was performed by 10 speakers who had and had not participated for the training. The recognized sequence of words are the commands to automate home appliances such as light, television and radio etc., to help people with differently-able to operate equipment. The different feature extraction methods such as Mel Frequency Cepstral Coefficient (MFCC), Perceptual Linear Prediction (PLP), Linear Predictive Coding (LPC), Bark Frequency Cepstral Coefficients (BFCC), Linear Prediction Reflection Coefficients (LPREFC), LPC Cepstral Coefficients (LPCEPSTRA), log mel-filter bank channel outputs (FBANK) and linear mel-filter bank channel outputs (MELSPEC) were used with different number of feature parameters varied between 4 to 12 by adding log energy coefficients, and their first, second and third derivatives in order to find the optimal number of parameters for each method. The context-independent and contextdependent acoustic models: word-internal and cross-word triphones and tied state triphones were developed. Decision tree state clustering method was applied for creating tied state triphones and the optimal threshold values for the outlier threshold (RO) and the threshold controlling clustering termination (TB) were determined to create the phonetic decision tree in order to get the optimal result. Finally, tied state triphone based multiple mixture models were applied with 2 mixture, 4 mixture, 8 mixture, 16 mixture and 32 mixture systems. The comparison of above mentioned approaches is discussed in detail. The speech recognition system was physically implemented to provide access from a PC or laptop, based on Arduino UNO board (ATmega328 microcontroller). The identified command is transferred to the Arduino UNO board through serial communication and then a signal is transmitted using Radio Frequency (RF) to operate an electrical home appliance using a wireless transceiver module. en_US
dc.language.iso en en_US
dc.relation.ispartofseries TH;1307
dc.subject Sinhala speech recognition en_US
dc.subject Hidden Markov Model en_US
dc.subject Feature Extraction en_US
dc.subject Operate electrical appliances en_US
dc.subject Microcontroller en_US
dc.title Implementation of a hardware system to assist illegible using a hidden Markov model based speaker independent, continuous speech recognition system for Sinhala language en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account