Question paper analysis with Natural Language Processing

Jayakody, J.R.K.C.; Perera, P.L.M.

Please use this identifier to cite or link to this item: http://repository.kln.ac.lk/handle/123456789/13988

Title:	Question paper analysis with Natural Language Processing
Authors:	Jayakody, J.R.K.C. Perera, P.L.M.
Keywords:	Blooms’ Taxonomy Natural Language Processing data mining
Issue Date:	2016
Publisher:	Department of Zoology and Environmental Management, University of Kelaniya, Kelaniya, Sri Lanka.
Citation:	Jayakody, J.R.K.C. and P.L.M. Perera 2016. Question paper analysis with Natural Language Processing. In: Proceedings of the International Symposium on Information and Communication Technology for Sustainable Development, 10-12 August 2016, V.P.A. Weerasinghe and W.M.D.N. Wijeyaratne (Eds.), p 34, Department of Zoology and Environmental Management, University of Kelaniya, Kelaniya, Sri Lanka 57pp.
Abstract:	“Art of Paper Setting” is very popular terminology when it is come to education examination process. As it is an “Art”, teachers should passionate enough to prepare a better question paper which will reflect the educational objectives. There are few steps involved in the process of paper setting and analysis of the paper is the most important element among those steps as it is only indicator of the alignment of questions with intended objectives. When it comes to the analysis process, human intelligence can analyze questions more easily. But implementing similar intelligent systems with computer intelligence is a real challenge. Therefore the purpose of this research is to build a computer intelligent system which can analyze and classify questions. When it is come to classification standards, Bloom’s Taxonomy is a world recognized cognitive skills classification standard. Therefore this standard was used as the guide for the questions categorization of question papers. In the analysis phase, natural language processing techniques were used to analyze the raw text. With these techniques, first the row texts were processed and then the meaningful features of the questions such as verb similarity stem pattern similarity and stem meaning similarity were extracted. Next with machine learning techniques, a model (the brain of the system) was trained by feeding extracted question features. For the model training, several classification algorithms such as Multinomial Naive Bayes Classifier, Bernoulli Naive Bayes Classifier, Logistic Regression Classifier, Stochastic Gradient Descent Classifier, C-Support Vector Classifier and Linear Support Vector Classifier were used. Accuracy levels of each and every classification algorithms were measured with changing the size of the training data set and the optimum algorithm was selected for model training. Finally the model was trained with the optimum algorithm and that model was used to classify the unseen questions. The ultimate model was fine tuned to gain 80% classification accuracy.
URI:	http://repository.kln.ac.lk/handle/123456789/13988
ISBN:	978-955-4563-83-4
Appears in Collections:	International Symposium on ICT for Sustainable Development (ICTSD 2016)

Files in This Item:

File	Description	Size	Format
Jayakody, J.R.K.C. and P.L.M. Perera.pdf		208.12 kB	Adobe PDF	View/Open

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets