Advances in Non-Linear Modeling for Speech Processing includes advanced topics in non-linear estimation and modeling techniques along with their applications to speaker recognition. Non-linear aeroacoustic modeling approach is used to estimate the important fine-structure speech events, which are not revealed by the short time Fourier transform (STFT). This aeroacostic modeling approach provides the impetus for the high resolution Teager energy operator (TEO). This operator is characterized by a time resolution that can track rapid signal energy changes within a glottal cycle. The cepstral features like linear prediction cepstral coefficients (LPCC) and mel frequency cepstral coefficients (MFCC) are computed from the magnitude spectrum of the speech frame and the phase spectra is neglected. To overcome the problem of neglecting the phase spectra, the speech production system can be represented as an amplitude modulation-frequency modulation (AM-FM) model. To demodulate the speech signal, to estimation the amplitude envelope and instantaneous frequency components, the energy separation algorithm (ESA) and the Hilbert transform demodulation (HTD) algorithm are discussed. Different features derived using above non-linear modeling techniques are used to develop a speaker identification system. Finally, it is shown that, the fusion of speech production and speech perception mechanisms can lead to a robust feature set.
Gobl, C., Bennett, E., Chasaide, A.N.: Expressive synthesis: how crucial is voice quality? ... 100985, Chicago (2010) Moon, T.K.: Mathematical Methods and Algorithms for Signal Processing (1999) Nordstorm, K.I., Driessen, P.F.: Variable ...
Non-linear oscillators and predictors 3. Higher-order statistics 4. Independent component analysis 5. Nearest neighbors 6. Neural networks 7. Decision trees 8. Non-parametric models 9. Dynamics for non-linear systems 10. Fractal methods 11.
J. Acoustic. Soc. Amer. Vol. 75 (1984) 897-907. Esposito, A., Rampone, S., Stanzione, C., Tagliaferri R.: A Mathematical Model for Speech Processing. In Proceedings of IEEE on Neural Networks for Signal Processing (1992) 194-203 .
This intriguing book constitutes the thoroughly refereed postproceedings of the International Conference on Non-Linear Speech Processing, NOLISP 2007, held in Paris, France, in May 2007.
This book presents recent advances in nonlinear speech processing beyond nonlinear techniques.
This book constitutes the proceedings of the 6th International Conference on Nonlinear Speech Processing, NOLISP 2013, held in Mons, Belgium, in June 2013.
This book constitutes of the major results of the EU COST (European Cooperation in the field of Scientific and Technical Research) Action 277: NSP, Nonlinear Speech Processing, running from April 2001 to June 2005.
[4] M. Bouchard, “Multichannel affine and fast affine projection algorithms for active noise control and acoustic equalization systems,” IEEE Transactions on Speech and Audio Processing, vol. 11, no. 1, pp. 54–60,2003.
Description of broadcast news speech corpus used in studies Description Language Tamil Telugu Hindi Multilingual Number of bulletins 33 20 19 72 News readers (Male:Female) (10:23) (11:9) (6:13) (27:45) Number of bulletins used for ...
This book is a valuable resource for a. The academic research community b. The ICT market c. PhD students and early stage researchers d. Companies, research institutes e.