Amharic event text classification from social media using hybrid deep learning

Amogne Andualem Ayalew, Melaku Lake Tegegne, Bommy Manivannan, Tamilarasi Suresh, Napa Komal Kumar, Battula Krishna Prasad, Tsehay Admassu Assegie, Ayodeji Olalekan Salau

Abstract


This study aims to develop a hybrid deep-learning model for detecting and classifying Amharic text. Various natural language applications, such as information extraction, event extraction, conversation, text summarization, and require an automatic event classification. However, existing studies focused on classification, giving little attention to the preprocessing and feature extraction techniques. To address this problem, this work proposed a hybridized deep learning-based Amharic social media text event classification model. The model consists of word-to-vector (Word2vecv) word embedding techniques to capture the semantic and syntactic representation. Convolutional neural network (CNN) is used to extract short-length text features. Additionally, bidirectional long-short memory (Bi-LSTM) is used to extract features from long Amharic sentences and classify those events based on their classes. The dataset used for training and testing consists of 6,740 labeled Amharic text sentences, collected from social media. The result shows an accuracy of 94.8% in detecting and classifying Amharic text events.

Keywords


Convolutional neural network; Deep learning; Event classification; Long short-term memory; Word embedding technique

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v15i2.pp2264-2270

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).