Browsing by Author "Kumari, H.M.N.S."
Now showing 1 - 2 of 2
- Results Per Page
- Sort Options
Item A Bayesian Approach for Raisin Data Classification(Department of Industrial Management, Faculty of Science, University of Kelaniya Sri Lanka, 2023) Kumari, H.M.N.S.; Nawarathne, U.M.M.P.K.Raisin performs a decisive role in the commodity economy. Recently, low-quality raisin products have been introduced to agricultural markets worldwide. Therefore, it is crucial to identify a suitable classification method to distinguish between varieties of raisins. Previous research has employed various traditional machine learning methods to classify commodities. However, it is challenging to quantify uncertainties through traditional machine learning models. Therefore, this study employed a Bayesian Logistic Regression (BLR) model using seven morphological features of two varieties of raisins grown in Turkey. Initially, different machine learning techniques were employed on data. After that, four priors, such as Jefferys, Laplace, Cauchy, and Gaussian, were considered, and hyperparameters were tuned using the empirical Bayes method. Marginal posterior distributions of the model parameters were estimated, and the convergence of the models was checked. Then, evaluation metrics of the BLR model with different priors were compared to those of machine learning models. According to the results, the BLR model with Gaussian prior produced the highest accuracy of 93%. Finally, it can be concluded that the BLR model with Gaussian prior provides substantially better results when classifying raisin data.Item A Sentiment Analysis of COVID-19 Tweets Data Using Different Word Embedding Techniques(Department of Industrial Management, Faculty of Science, University of Kelaniya Sri Lanka, 2023) Nawarathne, U.M.M.P.K.; Kumari, H.M.N.S.The COVID-19 virus that invaded the world in 2019 caused many casualties while creating enormous mental turmoil among humans. During this pandemic period, humans were confined to prevent the virus from spreading. Due to the isolation, people used social media platforms like Twitter to express their ideas. Therefore, this study analyzed tweets related to COVID-19. Initially, text data processing techniques were employed, and sentiment labels were assigned. Then the data were trained using different machine learning (ML) models such as Multinomial Naïve Bayes (MNB), Random Forest (RF), Support Vector Machine (SVM), Decision Tree (DT), k-Nearest Neighbours (KNN), Logistic Regression (LR), Extreme Gradient Boosting (XGB), and CatBoost (CB). During the training phase, word embedding techniques such as Bag-of-Words (BoW), Term Frequency-Inverse Document Frequency (TF-IDF), Word2Vec, Global Vectors for Word Representation (Glove), Bidirectional Encoder Representations from Transformers (BERT), and Robustly Optimized BERT-Pretraining Approach (RoBERTa) were used, and evaluation metrics such as accuracy, macro average precision, macro average recall, and macro average f1-score were calculated to evaluate these models. According to the results, the CB model, which used the RoBERTa technique, achieved an accuracy of 97%. Therefore, it can be concluded that CB with RoBERTa provides better results when classifying tweet data.