Human speech is conveyed through both acoustic and visual channels and is therefore inherently multi-modal. Further, the two channels are largely complementary in that the acoustic signal typically contains information about the manner of articulation while the visual signal embodies knowledge of the place of articulation. This orthogonal nature of the audio and visual components has enticed researchers to develop audio-visual speech recognition systems that have been shown to be robust to acoustic noise. A fundamental requirement of automatic audio-visual speech recognition is the need for real-time tracking; however, this necessity has been largely ignored by the lipreading community. This work presents a new approach for tracking unadorned lips in real time (50 fields/sec). The tracking framework presented combines comprehensive shape and motion models learnt from continuous speech sequences with focused image feature detection methods. Statistical models of the grey-level appearance of the mouth are shown to enable identification of the lip boundary in poorly contrasted grey-level images. The combined armory of the these modeling approaches permits robust, real-time tracking of unadorned lips. Isolated-word recognition experiments using dynamic time warping and Hidden Markov Model-based recognizers demonstrate that real-time, contour-based, lip tracking can be used to provide robust recognition of degraded speech. In noisy acoustic conditions, the performance of recognizers incorporating visual shape parameters are superior to the acoustic-only solutions, providing for error rate reductions up to 44%.
AI苏醒: 科幻电影的思想实验室
A novel type of vector algebra can be constructed for these ordered pairs of vectors , which are then referred to as dual vectors ( Brand , 1947 ) . It is directly analogous to the usual vector algebra , but with a new kind of dot ...
Specifically, UTOOLS imports MOSS import/export files produced by MOSS, ARC-INFO, or other GIS systems, ERDAS GIS files, USGS DEMs, ASCII flat-files, and database files produced by Dbase, Rbase, and other DMBS. 30) User-designed inputs: ...
See, for example, Laird, Rosenbloom and Newell [4] and Newell [8]. These researchers claim that certain empirical results on human acquisition of cognitive skills, such as the so-called power law of learning, can be explained by taking ...
"After AI: Strategies to Survive & Thrive" examines ways to actively promote the natural human attributes that give us that advantage: the stuff that makes us MORE THAN MACHINES.
"Updated edition of popular textbook on Artificial Intelligence. This edition specific looks at ways of keeping artificial intelligence under control"--
... Motorola Manufacturing Systems , Boynton Beach , Florida . “ Good robotic systems can handle these tasks and help Motorola achieve Six Sigma quality . " S 5 Micromechanical manipulators , molecular robotics , nanorobotics are names ...
Step A must be first , followed by Step B. Steps C and D go last but they can be in either order . ... but there Seton A mill sides Ride 6 Second B mull side 1 mill side 6 drill boles Selon drill , lap salplate boles MOODI part on ...
Randomization and Approximation Techniques in Computer Science: Second International Workshop, RANDOM '98, Barcelona, Spain, October 8-10, 1998 : Proceedings
And what Davies fears is the inability of a system which ( prima facie ) lacks such recurrent abstractions to meet the demand of strict causal systematicity of inferential transitions . The upshot of it all , according to Davies ( 1991 ...