A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification

Understanding the reason for an infant's cry is the most difficult thing for parents. There might be various reasons behind the baby's cry. It may be due to hunger, pain, sleep, or diaper-related problems. The key concept behind identifying the reason behind the infant's cry is mainly...

Full description

Saved in:
Bibliographic Details
Main Authors: Vinayak Ravi Joshi (Author), Kathiravan Srinivasan (Author), P. M. Durai Raj Vincent (Author), Venkatesan Rajinikanth (Author), Chuan-Yu Chang (Author)
Format: Book
Published: Frontiers Media S.A., 2022-03-01T00:00:00Z.
Subjects:
Online Access:Connect to this object online.
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 doaj_e9f3ba8b8a0547ab8682e85b240a8f70
042 |a dc 
100 1 0 |a Vinayak Ravi Joshi  |e author 
700 1 0 |a Kathiravan Srinivasan  |e author 
700 1 0 |a P. M. Durai Raj Vincent  |e author 
700 1 0 |a Venkatesan Rajinikanth  |e author 
700 1 0 |a Chuan-Yu Chang  |e author 
700 1 0 |a Chuan-Yu Chang  |e author 
245 0 0 |a A Multistage Heterogeneous Stacking Ensemble Model for Augmented Infant Cry Classification 
260 |b Frontiers Media S.A.,   |c 2022-03-01T00:00:00Z. 
500 |a 2296-2565 
500 |a 10.3389/fpubh.2022.819865 
520 |a Understanding the reason for an infant's cry is the most difficult thing for parents. There might be various reasons behind the baby's cry. It may be due to hunger, pain, sleep, or diaper-related problems. The key concept behind identifying the reason behind the infant's cry is mainly based on the varying patterns of the crying audio. The audio file comprises many features, which are highly important in classifying the results. It is important to convert the audio signals into the required spectrograms. In this article, we are trying to find efficient solutions to the problem of predicting the reason behind an infant's cry. In this article, we have used the Mel-frequency cepstral coefficients algorithm to generate the spectrograms and analyzed the varying feature vectors. We then came up with two approaches to obtain the experimental results. In the first approach, we used the Convolution Neural network (CNN) variants like VGG16 and YOLOv4 to classify the infant cry signals. In the second approach, a multistage heterogeneous stacking ensemble model was used for infant cry classification. Its major advantage was the inclusion of various advanced boosting algorithms at various levels. The proposed multistage heterogeneous stacking ensemble model had the edge over the other neural network models, especially in terms of overall performance and computing power. Finally, after many comparisons, the proposed model revealed the virtuoso performance and a mean classification accuracy of up to 93.7%. 
546 |a EN 
690 |a baby cry 
690 |a feature vectors 
690 |a MFCC 
690 |a spectrograms 
690 |a stack-based algorithms 
690 |a Public aspects of medicine 
690 |a RA1-1270 
655 7 |a article  |2 local 
786 0 |n Frontiers in Public Health, Vol 10 (2022) 
787 0 |n https://www.frontiersin.org/articles/10.3389/fpubh.2022.819865/full 
787 0 |n https://doaj.org/toc/2296-2565 
856 4 1 |u https://doaj.org/article/e9f3ba8b8a0547ab8682e85b240a8f70  |z Connect to this object online.