This book presents a systematic approach to the automatic recognition of simultaneous speech signals using computational auditory scene analysis. Inspired by human auditory perception, this book investigates a range of algorithms and techniques for decomposing multiple speech signals by integrating a spectro-temporal fragment decoder within a statistical search process. The outcome is a comprehensive insight into the mechanisms required if automatic speech recognition is to approach human levels of performance.
Representing speech signals such that specific characteristics of speech are included is essential in many Air Force and DoD signal processing applications.
Designed for beginning users of Dragon Naturally Speaking, this self-paced, self-instructional guide provides the user with all the instruction necessary to become proficient in the use of this popular speech recognition software.
This book is a comprehensive and authoritative guide to voice user interface (VUI) design.
Fisher, D., Soderland, S., McCarthy, J., Feng, F., and Lehnert, W. G. (1995). Description of the UMass system as used for MUC-6. In MUC-6, San Francisco, pp. 127–140. Fisher, W. (1996) tsylb2 software and documentation. Fitt, S. (2002).
Prentice Hall授权
Speech Recognition: The Complete Practical Reference Guide
The Application of Hidden Markov Models in Speech Recognition presents the core architecture of a HMM-based LVCSR system and proceeds to describe the various refinements which are needed to achieve state-of-the-art performance.
PMX() Pqxkn() Remarquons que l'architecture du réseau permet aisément d'introduire l'information concernant le ... Des études empiriques ont montré que, dans le cas de la reconnaissance de la parole, un contexte de 9 trames de 10 ms (c ...
Speaker Perception and Recognition. An Integrative Framework for Computational Speech Processing: An Integrative Framework for Computational Speech Processing