On Usable Speech Detection by Linear Multi-Scale Decomposition for Speaker Identification

Wajdi Ghezaiel, Amel Ben Slimane, Ezzedine Ben Braiek

Abstract


Usable speech is a novel concept of processing co-channel speech data. It is proposed to extract minimally corrupted speech that is considered useful for various speech processing systems. In this paper, we are interested for co-channel speaker identification (SID). We employ a new proposed usable speech extraction method based on the pitch information obtained from linear multi-scale decomposition by discrete wavelet transform. The idea is to retain the speech segments that have only one pitch detected and remove the others. Detected Usable speech was used as input for speaker identification system. The system is evaluated on co-channel speech and results show a significant improvement across various Target to Interferer Ratio (TIR) for speaker identification system.


Keywords


co-channel speech; Usable speech; Multi-scale decomposition; Discrete wavelet transform; Speaker identification;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v6i6.pp2766-2772

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).