Gene set imputation method-based rule for recovering missing data using deep learning approach

Amer Al-Rahayfeh, Saleh Atiewi, Muder Almiani, Ala Mughaid, Abdul Razaque, Bilal Abu-Salih, Mohammed Alweshah, Alaa Alrawajfeh

Abstract


Data imputation enhances dataset completeness, enabling accurate analysis and informed decision-making across various domains. In this research, we propose a novel imputation method, a spectral clustering based on a gene set using adaptive weighted k-nearest neighbor (AWKNN), and an imputation of missing data using a convolutional neural network algorithm for accurate imputed data. In this research, we have considered the Kaggle water quality dataset for the imputation of missing values in water quality monitoring. Data cleaning detects inaccurate data from the dataset by using the median modified Weiner filter (MMWFILT). The normalization technique is based on the Z-score normalization (Z-SN) approach, which improves data organization and management for accurate imputation. Data reduction minimizes unwanted data and the amount of capacity required to store data using an improved kernel correlation filter (IKCF). The characteristics and patterns of data with specific columns are analyzed using enhanced principal component analysis (EPCA) to reduce overfitting. The dataset is classified into complete data and missing data using the light- DenseNet (LIGHT DN) approach. Results show the proposed outperforms traditional techniques in recovering missing data while preserving data distribution. Evaluation based on pH concentration, chloramine concentration, sulfate concentration, water level, and accuracy.

Keywords


Convolutional neural network; Data imputation; Kernel correlation filter; Spectral clustering; Water quality monitoring

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v15i4.pp4296-4317

Copyright (c) 2025 Amer Al-Rahayfeh, Saleh Atiewi, Muder Almiani, Ala Mughaid, Abdul Razaque, Bilal Abu-Salih, Mohammed Alweshah, Alaa Alrawajfeh

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by theĀ Institute of Advanced Engineering and Science (IAES).