Digital Repository

Predicting box office success of movies using sentiment analysis and opinion mining

Show simple item record Basnayake, H. Jayalal, S. 2017-01-05T08:42:40Z 2017-01-05T08:42:40Z 2016
dc.identifier.citation Basnayake, H. and Jayalal, S. 2016. Predicting box office success of movies using sentiment analysis and opinion mining. In Proceedings of the International Research Symposium on Pure and Applied Sciences (IRSPAS 2016), Faculty of Science, University of Kelaniya, Sri Lanka. p 88. en_US
dc.identifier.isbn 978-955-704-008-0
dc.description.abstract Movies and social media come together as a result of people sharing their opinions on social media and movie makers using the same platforms for movie promotions. From movie makers to movie goers, many parties are interested in the success or failure of a movie. Forecasting the success of a movie before its release has been a difficult task for many industry analysts. Since film industry’s unpredictable nature, many analysts have come up with different algorithms and mechanisms to predict the success of a movie. One of the mechanisms to predict the box office success is hype analysis. Hype is one of the factors that drive people to the theatres to watch a new movie. Box office opening of a new movie depends on this hype and it will boost up the total box office collection. Hype can be estimated through social media platforms like Twitter. Twitter can be used as a corpus for sentiment analysis and opinion mining. A movie’s success cannot be predicted in a high accurate level solely based on social factors. Classical factors like movie’s brand name, cast, director, etc. are also important aspects in movie’s performance at box office and should be considered as well. However, a highly accurate method for movie box office prediction integrating both social and classical factors is yet to be introduced for this research area. In this study, tweets related to the particular movie before releasing are collected using an archiver tool and are used as input data. Then the collected data is preprocessed in order to get a clean dataset. As a part of sentiment analysis and opinion mining, feature selection is performed using N-gram method in order to filter out irrelevant data records and unlike Bag of words method, this does not require an extensive dictionary of words since it uses combinations of words and letters. Afterwards the data related to classical factors are integrated with the proposed formula in order to predict the opening box office collection of the movie. The proposed formula is an extension of a formula used in a previous research and the new extension represent the inclusion of classical factors. Finally, the results are compared with actual box office data and the previous formula results in order to compare and determine the level of accuracy. Based on initial results, the proposed formula showed of an accuracy level more than 85 percent when the results were compared with actual box office data. Even though it produced a higher accuracy level, the results produced were less than the actual box office values. Thus further testing is needed to determine the actual accuracy level. en_US
dc.language.iso en en_US
dc.publisher Faculty of Science, University of Kelaniya, Sri Lanka en_US
dc.subject Movies en_US
dc.subject Box office en_US
dc.subject Sentiment analysis en_US
dc.subject Opinion mining en_US
dc.subject Social media en_US
dc.title Predicting box office success of movies using sentiment analysis and opinion mining en_US
dc.type Article en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


My Account