Digital Repository

A Comparative Evaluation of PDF-to-HTML Conversion Tools

Show simple item record

dc.contributor.author Pathirana, Pramodya
dc.contributor.author Silva, Asini
dc.contributor.author Lawrence, Thenuka
dc.contributor.author Weerasinghe, Thushani
dc.contributor.author Abeyweera, Roshan
dc.date.accessioned 2024-01-16T04:51:59Z
dc.date.available 2024-01-16T04:51:59Z
dc.date.issued 2023
dc.identifier.citation Pathirana Pramodya; Silva Asini; Lawrence Thenuka; Weerasinghe Thushani; Abeyweera Roshan (2023), A Comparative Evaluation of PDF-to-HTML Conversion Tools, International Research Conference on Smart Computing and Systems Engineering (SCSE 2023), Department of Industrial Management, Faculty of Science, University of Kelaniya Sri Lanka. Page 24 en_US
dc.identifier.uri http://repository.kln.ac.lk/handle/123456789/27362
dc.description.abstract PDF (Portable Document Format) is a popular file format used for sharing and storing documents across different platforms. However, there are occasions when the content of a PDF document needs to be re-purposed for online use. PDF-to-HTML conversion is a common method used to achieve this goal. This research paper presents a comparative evaluation of existing PDF-to-HTML conversion tools for their suitability in extracting text and images. These tools were tested using school textbooks in Sri Lanka, which contain complex text formatting and non-textual elements. The evaluation was based on various criteria, such as the accuracy of the output, handling of complex text formatting, and non-textual elements. Comparisons were drawn based on the performance of each of these tools with respect to the criteria. The study provides useful insights for individuals and organizations looking to re-purpose PDF content for online use in the HTML format, particularly in the education sector. en_US
dc.publisher Department of Industrial Management, Faculty of Science, University of Kelaniya Sri Lanka en_US
dc.subject e-learning, educational design research, text extraction, PDF to HTML conversion en_US
dc.title A Comparative Evaluation of PDF-to-HTML Conversion Tools en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account