This book is a comprehensive and authoritative guide to voice user interface (VUI) design. The VUI is perhaps the most critical factor in the success of any automated speech recognition (ASR) system, determining whether the user experience will be satisfying or frustrating, or even whether the customer will remain one. This book describes a practical methodology for creating an effective VUI design. The methodology is scientifically based on principles in linguistics, psychology, and language technology, and is illustrated here by examples drawn from the authors' work at Nuance Communications, the market leader in ASR development and deployment. The book begins with an overview of VUI design issues and a description of the technology. The authors then introduce the major phases of their methodology. They first show how to specify requirements and make high-level design decisions during the definition phase. They next cover, in great detail, the design phase, with clear explanations and demonstrations of each design principle and its real-world applications. Finally, they examine problems unique to VUI design in system development, testing, and tuning. Key principles are illustrated with a running sample application. A companion Web site provides audio clips for each example: www.VUIDesign.org The cover photograph depicts the first ASR system, Radio Rex: a toy dog who sits in his house until the sound of his name calls him out. Produced in 1911, Rex was among the few commercial successes in earlier days of speech recognition. Voice User Interface Design reveals the design principles and practices that produce commercial success in an era when effective ASRs are not toys but competitive necessities.
Representing speech signals such that specific characteristics of speech are included is essential in many Air Force and DoD signal processing applications.
Designed for beginning users of Dragon Naturally Speaking, this self-paced, self-instructional guide provides the user with all the instruction necessary to become proficient in the use of this popular speech recognition software.
Fisher, D., Soderland, S., McCarthy, J., Feng, F., and Lehnert, W. G. (1995). Description of the UMass system as used for MUC-6. In MUC-6, San Francisco, pp. 127–140. Fisher, W. (1996) tsylb2 software and documentation. Fitt, S. (2002).
Prentice Hall授权
Speech Recognition: The Complete Practical Reference Guide
The Application of Hidden Markov Models in Speech Recognition presents the core architecture of a HMM-based LVCSR system and proceeds to describe the various refinements which are needed to achieve state-of-the-art performance.
PMX() Pqxkn() Remarquons que l'architecture du réseau permet aisément d'introduire l'information concernant le ... Des études empiriques ont montré que, dans le cas de la reconnaissance de la parole, un contexte de 9 trames de 10 ms (c ...
This book presents a systematic approach to the automatic recognition of simultaneous speech signals using computational auditory scene analysis.
Speaker Perception and Recognition. An Integrative Framework for Computational Speech Processing: An Integrative Framework for Computational Speech Processing
Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition