The optimal performance of multi-layer neural network for speaker-independent isolated spoken Malay parliamentary speech / Noraini Seman Zainab Abu Bakar ...[et al.]

This paper describes speech recognizer modeling techniques which are suited to high performance and robust isolated word recognition in speaker-independent manner. In this study, a speech recognition system is presented, specifically for an isolated spoken Malay word recognizer which uses spontaneou...

Full description

Saved in:
Bibliographic Details
Main Authors: Seman, Noraini (Author), Abu Bakar, Zainab (Author), Abu Bakar, Nordin (Author), Mohamed, Haslizatul Fairuz (Author), Abdullah, Nur Atiqah Sia (Author), Prasanna, Ramakrisnan (Author), Syed Ahmad, Sharifah Mumtazah (Author)
Format: Book
Published: Faculty of Computer and Mathematical Sciences, 2010.
Subjects:
Online Access:Link Metadata
Tags: Add Tag
No Tags, Be the first to tag this record!

MARC

LEADER 00000 am a22000003u 4500
001 repouitm_11106
042 |a dc 
100 1 0 |a Seman, Noraini  |e author 
700 1 0 |a Abu Bakar, Zainab  |e author 
700 1 0 |a Abu Bakar, Nordin  |e author 
700 1 0 |a Mohamed, Haslizatul Fairuz  |e author 
700 1 0 |a Abdullah, Nur Atiqah Sia  |e author 
700 1 0 |a Prasanna, Ramakrisnan  |e author 
700 1 0 |a Syed Ahmad, Sharifah Mumtazah  |e author 
245 0 0 |a The optimal performance of multi-layer neural network for speaker-independent isolated spoken Malay parliamentary speech / Noraini Seman Zainab Abu Bakar ...[et al.] 
260 |b Faculty of Computer and Mathematical Sciences,   |c 2010. 
500 |a https://ir.uitm.edu.my/id/eprint/11106/1/11106.pdf 
520 |a This paper describes speech recognizer modeling techniques which are suited to high performance and robust isolated word recognition in speaker-independent manner. In this study, a speech recognition system is presented, specifically for an isolated spoken Malay word recognizer which uses spontaneous and formal speeches collected from Parliament of Malaysia. Currently the vocabulary is limited to ten words that can be pronounced exactly as it written and control the distribution of the vocalic segments. The speech segmentation task is achieved by adopted energy based parameter and zero crossing rate measure with modification to better locates the beginning and ending points of speech from the spoken words. The training and recognition processes are realized by using Multi-layer Perceptron (MLP) Neural Networks with two-layer feedforward network configurations that are trained with stochastic error back-propagation to adjust its weights and biases after presentation of every training data. The Mel-frequency Cepstral Coefficients (MFCCs) has been chosen as speech extraction approach from each segmented utterance as characteristic features for the word recognizer. The MLP performance to determine the optimal cepstral orders and hidden neurons numbers are analyzed. Recognition results showed that the performance of the two-layer network increased as the numbers of hidden neurons increased. Experimental result also showed that the cepstral orders of 12 to 14 were appropriate for the speech feature extraction for the data in this study. 
546 |a en 
690 |a Neural networks (Computer science) 
655 7 |a Article  |2 local 
655 7 |a PeerReviewed  |2 local 
787 0 |n https://ir.uitm.edu.my/id/eprint/11106/ 
787 0 |n https://mjoc.uitm.edu.my/ 
856 4 1 |u https://ir.uitm.edu.my/id/eprint/11106/  |z Link Metadata