Speech stress recognition using semi-eager learning |
| |
Affiliation: | 1. Research Institute for Signals, Systems and Computational Intelligence, sinc(i), FICH-UNL/CONICET, Argentina;2. Department of Electronics and Communication Engineering, Indian Institute of Technology Guwahati, India;1. School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China;2. School of Precision Instruments and Optoelectronics Engineering, Tianjin University, Tianjin 300072, China |
| |
Abstract: | Homo-sapiens suffer from psychogenic pain due to current day lifestyle. According to psychologists, stress is the most destructive form of psychalgia and it is a vicious companion for this species. Immoderate levels of stress may lead to the death of many individuals. Normally, the presence of stress gives rise to certain emotions which can be detected to predict stress levels of a person. This paper proposes the development of mechanized and efficient Speech Emotion Recognition (SER) for stress level analysis. The paper investigates the performance of perceptual based speech features like Revised Perceptual Linear Prediction Coefficients, Bark Frequency Cepstral Coefficients, Perceptual Linear Predictive Cepstrum, Gammatone Frequency Cepstral coefficient, Mel Frequency Cepstral Coefficient, Gammatone Wavelet Cepstral Coefficient and Inverted Mel Frequency Cepstral Coefficients on SER. The novelty of this work involves application of a SemiEager (SemiE) learning algorithm for evaluating auditory cues. SemiE offers advantages over eager and lazy based learning by reducing the computational cost. Stress level recognition being the main objective, the Speech Under Simulated and Actual Stress (SUSAS) benchmark database is used for performance analysis. A comparative analysis is presented to demonstrate the improvement in the SED performance. An overall accuracy of 90.66% recognition of stress related emotions is achieved. |
| |
Keywords: | Speech emotion Revised Perceptual Linear Prediction Coefficient’s (RPLP) Perceptual Linear Predictive Cepstrum (PLPC) Gammatone Frequency Cepstral coefficient (GFCC) SemiEager |
本文献已被 ScienceDirect 等数据库收录! |
|