Late Integration in Audio-visual Continuous Speech Recognition
A. Verma, T. Faruquie, C. Neti, S. Basu, A. Senior
In proceedings of Automatic Speech Recognition and Understanding,
Colorado, 12-15 December 1999.
Using visual information in speech recognition has been an area of interest because it can significantly improve the speech recognition efficiency in the conditions where audio only recognition suffers due to noisy environment. In this paper, we present a new approach to combine audio and video to improve the robustness of the speech recognition system in the noisy environments. We also compare the results of the new approach with the corresponding results of the approaches proposed earlier in the literature.