Smart chatbot for surveys by convolutional networks speech recognition

Robinson Jimenez-Moreno, Javier Eduardo Martínez Baquero, Luis Alfredo Rodriguez Umaña

Abstract


This paper details the development of an innovative voice chatbot interface specifically designed for evaluating user options using a Likert scale by color. The core of this interface is designing a convolutional neural network architecture, which has been trained with MEL spectrogram inputs from seven possible words for each answer. These spectrograms are crucial in capturing the audio features necessary for effective voice recognition and establishing the interactions that occur between the chatbot and the user, allowing the convolutional network to learn and distinguish between different types of user responses accurately. During the training phase, the convolutional neural network achieved an accuracy rate of 91.4%, indicating its robust performance in processing and interpreting voice commands. The interface was tested in a controlled environment, with a group of ten users and a survey of 5 questions, where it achieved a perfect detection accuracy of 100%. The results demonstrate the system's capacity for natural user interaction by voice and employing a free text to speech (TTS) algorithm for the chatbot voice.

Keywords


Chatbot; Convolutional network; Database; Deep learning; Voice selection

Full Text:

PDF


DOI: http://doi.org/10.11591/ijece.v15i3.pp3410-3417

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

International Journal of Electrical and Computer Engineering (IJECE)
p-ISSN 2088-8708, e-ISSN 2722-2578

This journal is published by the Institute of Advanced Engineering and Science (IAES).