A Modified Hilbert Analysis Method to Improve Voice Stress Analysis Systems

Loading...
Thumbnail Image

Date

2019-05

Journal Title

Journal ISSN

Volume Title

Type

Article

Publisher

International Journal of Computer Applications (0975 – 8887)

Series Info

International Journal of Computer Applications (0975 – 8887);Volume 178 – No. 12, May 2019

Doi

Abstract

Analyzing the cognitive load generated in the brain is the most important issue for specific applications such as voice stress analysis (VSA) systems in which the detection of stressed speech caused by an act of deception under law enforcement interview questioning or military interrogation. The most widely used algorithm for VSA systems is the empirical mode decomposition (EMD). Currently EMD that uses the cubic spline interpolation technique to find the envelopes of the non-periodic signal takes a long processing time, and to achieve accurate results the process is very time consuming and expensive otherwise some tests tend to produce inaccurate results. On the other hand EMD that uses Hilbert analysis method to speed up the process and provide more accurate results, suffer from finding the envelopes of the non-periodic signal. In this paper, a new algorithm is proposed for VSA, named fast Fourier transform (FFT) with a modified Hilbert analysis method (MH) for EMD algorithm, (FTT_MH_EMD), which provides a new technique that modifies the conventional Hilbert analysis method and combines it with the fast Fourier transform algorithm to overcome the previous limitations of using individually the FTT algorithm, the cubic spline interpolation technique or the conventional Hilbert analysis method and that can speed up the processing time and gives accurate results. Simulations and results witness that the proposed algorithm provides higher accuracy than the other attempts and also the processing time has dropped by 10 times faster than those in the products currently available in the market for VSA The advantages of using FFT for VSA are low cost when implemented in products and fast iterations method, and the disadvantage is the low resolution and therefore low accuracy (60%–70%) [1]. On the other hand the advantage of using Mcquiston Ford for VSA is the high accuracy results which are greater than 80%, and the disadvantage are complex algorithms and expensive implementation [5]. An important, barely audible, frequency is produced by the human vocal cords which could measure in a fairly accurate manner if the subject is being deceitful due to the effect the brain’s cognitive load has on the vocal cords [1 and 5]. This frequency is called the micro-tremor frequency generated by the involuntary micro-tremor muscle fibers in the vocal cords. When the subject is being deceitful, more work is induced on the brain which as a result causes tremor muscles to tense up in a non-voluntary manner [1 and 6]. Different non voluntary muscles would tense up including the micro-tremor muscle fibers found on the vocal cords. This induces a higher micro-tremor frequency range notably showing a large quantity of induced cognitive load and therefore a deceitful subject [7]. The micro-tremor frequency of all muscles in the body, including the vocal chords, vibrates in the 8 to 12 Hz range in the normal case without any stress, and that the VSA vendors claim to be the sole source of detecting if an individual is lying or is under stress [1, 4 and 8]. The paper is organized from seven sections EMD, EMD with Hilbert Analysis, previous work and limitations, the proposed algorithm, the proposed VSA system model, results and simulations and finally the conclusion

Description

Keywords

University for Cognitive Load., Empirical Mode Decomposition, Voice Stress Analysis

Citation

Full Text link