A semantic similarity search engine for movies

Ahmad Mustafa, Hammam Mheidat, Adam Shatnawi

Abstract


Semantic similarity has been gaining traction in the field of natural language processing. It is a measure of how similar two pieces of text are in terms of their meaning. It can be used to improve search engine results. We propose a deep learning-based approach to build a semantic similarity search engine for movies based on a movie summary. Filmmakers can gain insight into audience preferences and trends, allowing them to create more engaging and successful films. The dataset used in this study was gathered from internet movie database (IMDb), it includes movie summaries along with their corresponding name movies. The test dataset was generated using ChatGPT to be very close to human input. The universal sentence encoder (USE) model presented promising results in semantic similarity, the model results show that for the top 5 similar movies, the model returned 176 out of 300 movies (58.6%). For the top 10 similar movies, the model returned 211 out of 300 movies (70.3%). Additionally, for the top 15 similar movies, the model returned 229 out of 300 movies (76.3%). And, for the top 20 similar movies, the model returned 249 of 300 movies (83%). This method can be applied to movie recommendation systems or to organize films in a collection automatically.

Keywords


Deep learning; Machine learning; Natural language processing; Recommender system; Search engine; Semantic similarity

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v14i6.pp7137-7144

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).