Implementation of a hardware system to assist illegible using a hidden Markov model based speaker independent, continuous speech recognition system for Sinhala language

dc.contributor.authorSamankula, W.G.D.M.
dc.date.accessioned2017-03-28T06:00:05Z
dc.date.available2017-03-28T06:00:05Z
dc.date.issued2016
dc.description.abstractIn this thesis, a speaker independent speech recognition system was built to recognize the continuous Sinhala speech sentences using the toolkit, HTK-3.4. I based on the statistical approach, Hidden Markov Model (HMM). Three hundred sentences were considered for recording. Data recordings were done with 50 males and 50 females and testing was performed by 10 speakers who had and had not participated for the training. The recognized sequence of words are the commands to automate home appliances such as light, television and radio etc., to help people with differently-able to operate equipment. The different feature extraction methods such as Mel Frequency Cepstral Coefficient (MFCC), Perceptual Linear Prediction (PLP), Linear Predictive Coding (LPC), Bark Frequency Cepstral Coefficients (BFCC), Linear Prediction Reflection Coefficients (LPREFC), LPC Cepstral Coefficients (LPCEPSTRA), log mel-filter bank channel outputs (FBANK) and linear mel-filter bank channel outputs (MELSPEC) were used with different number of feature parameters varied between 4 to 12 by adding log energy coefficients, and their first, second and third derivatives in order to find the optimal number of parameters for each method. The context-independent and contextdependent acoustic models: word-internal and cross-word triphones and tied state triphones were developed. Decision tree state clustering method was applied for creating tied state triphones and the optimal threshold values for the outlier threshold (RO) and the threshold controlling clustering termination (TB) were determined to create the phonetic decision tree in order to get the optimal result. Finally, tied state triphone based multiple mixture models were applied with 2 mixture, 4 mixture, 8 mixture, 16 mixture and 32 mixture systems. The comparison of above mentioned approaches is discussed in detail. The speech recognition system was physically implemented to provide access from a PC or laptop, based on Arduino UNO board (ATmega328 microcontroller). The identified command is transferred to the Arduino UNO board through serial communication and then a signal is transmitted using Radio Frequency (RF) to operate an electrical home appliance using a wireless transceiver module.en_US
dc.identifier.citationSamankula, W.G.D.M. (2016). Implementation of a hardware system to assist illegible using a hidden Markov model based speaker independent, continuous speech recognition system for Sinhala language. M.Phil. Thesis, University of Kelaniya.en_US
dc.identifier.urihttp://repository.kln.ac.lk/handle/123456789/16886
dc.language.isoenen_US
dc.relation.ispartofseriesTH;1307
dc.subjectSinhala speech recognitionen_US
dc.subjectHidden Markov Modelen_US
dc.subjectFeature Extractionen_US
dc.subjectOperate electrical appliancesen_US
dc.subjectMicrocontrolleren_US
dc.titleImplementation of a hardware system to assist illegible using a hidden Markov model based speaker independent, continuous speech recognition system for Sinhala languageen_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
1307.pdf
Size:
173.22 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: