Arabic tweeps dialect prediction based on machine learning approach

Khaled Alrifai, Ghaida Rebdawi, Nada Ghneim

Abstract


In this paper, we present our approach for profiling Arabic authors on twitter, based on their tweets. We consider here the dialect of an Arabic author as an important trait to be predicted. For this purpose, many indicators, feature vectors and machine learning-based classifiers were implemented. The results of these classifiers were compared to find out the best dialect prediction model. The best dialect prediction model was obtained using random forest classifier with full forms and their stems as feature vector.

Keywords


Arabic dialects detection; author profiling; machine learning; social media analysis; text mining;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v11i2.pp1627-1633

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).