Natural Language Processing (NLP) Techniques to Measure the Syllabus Coverage with the Final Exam Paper

Dharmapriya, A.G.K.S.; Jayakody, J.R.K.C.

Please use this identifier to cite or link to this item: http://repository.kln.ac.lk/handle/123456789/19000

Title:	Natural Language Processing (NLP) Techniques to Measure the Syllabus Coverage with the Final Exam Paper
Authors:	Dharmapriya, A.G.K.S. Jayakody, J.R.K.C.
Keywords:	Natural language processing Cosine similarity with TF-IDF TF-IDF variations Semantic similarit
Issue Date:	2018
Publisher:	3rd International Conference on Advances in Computing and Technology (ICACT ‒ 2018), Faculty of Computing and Technology, University of Kelaniya, Sri Lanka.
Citation:	Dharmapriya, A.G.K.S. and Jayakody, J.R.K.C. (2018). Natural Language Processing (NLP) Techniques to Measure the Syllabus Coverage with the Final Exam Paper. 3rd International Conference on Advances in Computing and Technology (ICACT ‒ 2018), Faculty of Computing and Technology, University of Kelaniya, Sri Lanka. p22.
Abstract:	Exam questions usually play a pivotal role in the education and it is the main assessment technique to evaluate the Intended learning Outcome (ILO). The main Intention of the subject syllabus is to cover the ILO. If the syllabus is not covered effectively through the exam questions, it is a challenging task to identify whether students have acquired and enhance the necessary skills and the knowledge which are specified in the given ILO. Therefore the preparation of the exam papers and its evaluation plays an important role in education to improve the performance of the students. Having appropriate level of exam questions which covers the entire syllabus is a time consuming, tedious and challenging work for the instructors. Therefore this research work was done with the view of setting up effective question paper to measure the depth of the syllabus coverage. Further, Identification of the students’ knowledge level was an output. Natural Language Processing (NLP) techniques such as tokenization, stop word removal, non-alpha numeric word removals and tagging were used to process the contents and questions. NLP with NLTK and cosine similarity with TF-IDF (term frequency inverse document frequency), TF-IDF variations and semantic similarity algorithms were developed to generate a unique set of rules to identify the best syllabus coverage contents for exam questions. Rule based set of logics were developed to classify the exam questions under different syllabus topic. Based on the experiment output, evaluators and instructors can redesign their exam paper. 72 exam questions were used as the initial dataset. Final evaluation was done based on the total value generated from TF-IDF, TF-IDF variations, TF-IDF with cosine similarity and semantic similarity.
URI:	http://repository.kln.ac.lk/handle/123456789/19000
Appears in Collections:	ICACT 2018

Files in This Item:

File	Description	Size	Format
22.pdf		280.14 kB	Adobe PDF	View/Open

Show full item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets