Smart chatbot for surveys by convolutional networks speech recognition
Abstract
This paper details the development of an innovative voice chatbot interface specifically designed for evaluating user options using a Likert scale by color. The core of this interface is designing a convolutional neural network architecture, which has been trained with MEL spectrogram inputs from seven possible words for each answer. These spectrograms are crucial in capturing the audio features necessary for effective voice recognition and establishing the interactions that occur between the chatbot and the user, allowing the convolutional network to learn and distinguish between different types of user responses accurately. During the training phase, the convolutional neural network achieved an accuracy rate of 91.4%, indicating its robust performance in processing and interpreting voice commands. The interface was tested in a controlled environment, with a group of ten users and a survey of 5 questions, where it achieved a perfect detection accuracy of 100%. The results demonstrate the system's capacity for natural user interaction by voice and employing a free text to speech (TTS) algorithm for the chatbot voice.
Keywords
Chatbot; Convolutional network; Database; Deep learning; Voice selection
Full Text:
PDFDOI: http://doi.org/10.11591/ijece.v15i3.pp3410-3417
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578
This journal is published by the Institute of Advanced Engineering and Science (IAES).