Automatic speech annotation based on enhanced wavelet Packets Best Tree Encoding (EWPBTE) feature
Loading...
Date
2016
Authors
Journal Title
Journal ISSN
Volume Title
Type
Conference Paper
Publisher
Institute of Electrical and Electronics Engineers Inc.
Series Info
International Conference on Electrical, Electronics, and Optimization Techniques, ICEEOT 2016
Scientific Journal Rankings
Abstract
This paper aimed at introducing a completely automated Arabic phone recognition system based on Enhanced Wavelet Packets Best Tree Encoding (EWPBTE) 15-point speech feature. The process of enhancing of WPBTE is provided by adding energy component to WPBTE, which is implemented in Matlab software and makes an enhancement of 65 % to recognizer accuracy which is the most contribution in this paper. EWPBTE is used to find phoneme boundaries along speech utterance. Hidden Markov Model (HMM) and Gaussian Mixtures are used for building the statistical models through this research. HMM Tool Kit (HTK) software is utilized for implementation of the model. The System can identify spoken phone at 57.01% recognition rate based on Mel Frequency Cepstral Coefficients (MFCC), 21.07% recognition rate based on WPBTE and 86.23% recognition rate based on EWPBTE. The proposed EWPBTE vector is 15 components compared to 39 components of MFCC. This makes it very promising features vector to be under research and in development phase. � 2016 IEEE.
Description
Scopus
Keywords
Accuracy, Components, Gaussian Mixture, Phone, Recognition Rate, Character recognition, Encoding (symbols), Forestry, Hidden Markov models, Markov processes, MATLAB, Telephone sets, Trellis codes, Accuracy, Components, Development phase, Energy components, Gaussian mixtures, Mel-frequency cepstral coefficients, Phone, Phone recognition, Speech recognition