Lip Tracking for Audio-visual Speech Recognition

ISBN-10: 1423582446
ISBN-13: 9781423582441
Category: Artificial intelligence
Pages: 320
Language: English
Published: 1997
Author: Robert August Kaucic

Description

Human speech is conveyed through both acoustic and visual channels and is therefore inherently multi-modal. Further, the two channels are largely complementary in that the acoustic signal typically contains information about the manner of articulation while the visual signal embodies knowledge of the place of articulation. This orthogonal nature of the audio and visual components has enticed researchers to develop audio-visual speech recognition systems that have been shown to be robust to acoustic noise. A fundamental requirement of automatic audio-visual speech recognition is the need for real-time tracking; however, this necessity has been largely ignored by the lipreading community. This work presents a new approach for tracking unadorned lips in real time (50 fields/sec). The tracking framework presented combines comprehensive shape and motion models learnt from continuous speech sequences with focused image feature detection methods. Statistical models of the grey-level appearance of the mouth are shown to enable identification of the lip boundary in poorly contrasted grey-level images. The combined armory of the these modeling approaches permits robust, real-time tracking of unadorned lips. Isolated-word recognition experiments using dynamic time warping and Hidden Markov Model-based recognizers demonstrate that real-time, contour-based, lip tracking can be used to provide robust recognition of degraded speech. In noisy acoustic conditions, the performance of recognizers incorporating visual shape parameters are superior to the acoustic-only solutions, providing for error rate reductions up to 44%.

Get the book

Similar books

AI苏醒: 科幻电影的思想实验室
By 张鹏
AI苏醒: 科幻电影的思想实验室
Artificial Intelligence: Tools, Techniques, and Applications
By B. Petkoff, W. Bibel
A novel type of vector algebra can be constructed for these ordered pairs of vectors , which are then referred to as dual vectors ( Brand , 1947 ) . It is directly analogous to the usual vector algebra , but with a new kind of dot ...
Decision Support Systems for Ecosystem Management: An Evaluation of Existing Systems
By H. Todd Mowrer
Specifically, UTOOLS imports MOSS import/export files produced by MOSS, ARC-INFO, or other GIS systems, ERDAS GIS files, USGS DEMs, ASCII flat-files, and database files produced by Dbase, Rbase, and other DMBS. 30) User-designed inputs: ...
Artificial Intelligence
By Ian Pratt
See, for example, Laird, Rosenbloom and Newell [4] and Newell [8]. These researchers claim that certain empirical results on human acquisition of cognitive skills, such as the so-called power law of learning, can be explained by taking ...
After AI: Strategies to Survive & Thrive
By Derek William Pearson
"After AI: Strategies to Survive & Thrive" examines ways to actively promote the natural human attributes that give us that advantage: the stuff that makes us MORE THAN MACHINES.
Artificial Intelligence: A Modern Approach
By Stuart Russell, Peter Norvig
"Updated edition of popular textbook on Artificial Intelligence. This edition specific looks at ways of keeping artificial intelligence under control"--
Intelligent Robots and Computer Vision XVI: Algorithms, Techniques, Active Vision, and Materials Handling : 15-17 October, 1997, Pittsburgh, Pennsylvania
By Society of Photo-optical Instrumentation Engineers, David Paul Casasent, National Institute of Standards and Technology
... Motorola Manufacturing Systems , Boynton Beach , Florida . “ Good robotic systems can handle these tasks and help Motorola achieve Six Sigma quality . " S 5 Micromechanical manipulators , molecular robotics , nanorobotics are names ...
Applications of Artificial Intelligence in Engineering V: Manufacture and planning
By John S. Gero
Step A must be first , followed by Step B. Steps C and D go last but they can be in either order . ... but there Seton A mill sides Ride 6 Second B mull side 1 mill side 6 drill boles Selon drill , lap salplate boles MOODI part on ...
Randomization and Approximation Techniques in Computer Science: Second International Workshop, RANDOM '98, Barcelona, Spain, October 8-10, 1998 : Proceedings
By Michael George Luby, José D. P. Rolim
Randomization and Approximation Techniques in Computer Science: Second International Workshop, RANDOM '98, Barcelona, Spain, October 8-10, 1998 : Proceedings
Associative Engines: Connectionism, Concepts, and Representational Change
By Andy Clark, Professor of Philosophy and Director of Philosophy Neuroscience Psychology Program Andy Clark
And what Davies fears is the inability of a system which ( prima facie ) lacks such recurrent abstractions to meet the demand of strict causal systematicity of inferential transitions . The upshot of it all , according to Davies ( 1991 ...