A multi-modal framework for improving the accuracy of phishing email detection
Abstract
Phishing emails continue to pose a significant cybersecurity threat, particularly through the increasing use of malicious attachments to evade traditional text-based detection systems. Most existing approaches focus primarily on email content, creating a blind spot in attachment-aware phishing detection. This paper proposes a multi-modal phishing email classification model that integrates email header features, body text analysis, and attachment inspection within an ensemble learning framework. Independent machine learning classifiers are employed for each email component, and a majority voting mechanism is used to determine the final classification decision. The proposed model is evaluated using publicly available email and attachment datasets that are combined to simulate attachment-bearing phishing emails. Experimental results demonstrate strong detection performance across multiple evaluation metrics. Nevertheless, the study acknowledges the limitation of using synthetically paired email bodies and attachments, which may not fully capture real-world semantic relationships. The findings highlight the importance of incorporating attachment-aware analysis into phishing detection systems and provide a foundation for future research on semantic consistency modeling and transformer-based architectures.
Keywords
Attachment-based analysis; Email security; Machine learning; Multi-model detection; Neural network; Phishing email detection
Full Text:
PDFDOI: http://doi.org/10.11591/ijece.v16i3.pp1608-1625
Copyright (c) 2026 Lamees Mohamed Faraj, Sayed Abdel-Gaber, Hanan Fahmy

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578
This journal is published by theĀ Institute of Advanced Engineering and Science (IAES).