PROJECTS
IBM Research Homepage 
 Research Home  >> Audio Visual Speech Technology Group


Audio-Visual Group Publications


G. Potamianos, J. Luettin, C. Neti. Hierarchical discriminant features for audio-visual LVCSR, Submitted to ICASSP, Salt Lake City, May 2001.


J. Luettin, G. Potamianos, C. Neti. Asynchronous stream modeling for large-vocabulary audio-visual speech recognition, Submitted to ICASSP, Salt Lake City, May 2001.


H. Glotin, D. Vergyri, C. Neti, G. Potamianos, J. Luettin. Weighting schemes for audio-visual fusion in speech recognition, Submitted to ICASSP, Salt Lake City, May 2001.


A. Ghosh, A. Verma, and A. Sarkar. Using likelihood L-statistic as confidence measure in audio-visual speech recognition, Submitted to ICASSP, Salt Lake City, May 2001.


G. Potamianos, C. Neti. Stream confidence estimation for audio-visual speech recognition, ICSLP, vol III, pp. 746-749, Beijing, October 2000.


C. Neti, G. Iyengar, G. Potamianos, A. Senior, B. Maison. Perceptual interfaces for information interaction: Joint processing of audio and visual information for human-computer interaction, ICSLP, vol III, pp. 11-14, Beijing, October 2000.


C. Neti, G. Potamianos, J. Luettin, I. Matthews, D. Vergyri, J. Sison, A.Mashari, and J. Zhou, "Audio-Visual Speech Recognition", Final Workshop 2000 Report, Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, MD (Oct. 12, 2000).


G. Potamianos, A. Verma, C. Neti, G. Iyengar, S. Basu. "A cascade image transform for speaker independent automatic speechreading" International Conference on Multimedia and Expo, vol. II, pp. 1097-1100, New York, July-August 2000.


C.Neti, P.deCuetos A.Senior. Audio-visual intent-to-speak detection for human-computer interaction, ICASSP June 5-9 2000, Istanbul, Turkey.


G.Iyengar, C.Neti. Speaker change detection using joint audio-visual statistics, RIAO 12-14 April 2000, Paris, France, Dec. 20, 1999.


C.Neti, B.Maison, A.Senior, G.Iyengar, P.deCuetos, S.Basu, A.Verma. Joint proccessing of audio and visual information for multimedia indexing and human-computer interaction, RIAO April 12-14 2000, Paris, France.


Benoit Maison, Chalapathy Neti, Andrew Senior. Audio-Visual speaker recognition for video broadcast news: some fusion techniques, IEEE Multimedia Signal Processing (MMSP99), Denmark, Sept, 1999.


S. Basu, C. Neti, N. Rajput, A. Senior. L. Subramaniam, A. Verma. Audio-Visual large-vocabulary continous speech recognition in the broadcast news domain, IEEE Multimedia Signal Processing Conference (MMSP99), Denmark, Sept, 1999.


Andrew Senior, Chalapathy Neti, Benoit Masion. On the use of visual information for improving audio-based speaker recognition, Audio-Visual Speech processing conference (AVSP99), Santa Cruz, CA, Aug, 1999.


Chalapathy Neti, Stephane Maes, Mark Lucente and Dragutin Petkovic. Knowledge/Smart Spaces, 1999 DARPA/NSF/NIST Workshop on Research issues in Smart Computing Environments, July 1999.


Chalapathy Neti, Andrew Senior. Audio-Visual speaker recognition for video broadcast news, DARPA HUB4 Workshop, Washington D.C., March 1999.


Ashish Verma, Tanveer Faruquie, C. Neti, Sankar Basu, Andrew Senior. Late Integration in Continuous Audio-Visual Speech Recognition, ASRU, Colorado, 1999.


A.W.Senior. Recognizing faces in broadcast video. IEEE International Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in Real-Time Systems. ICCV 1999.


Jianbo Ma, Chalapathy Neti, Andrew Senior. Pose compensation for bimodal speech recognition. Automatic speech recognition and understanding workshop (ASRU99), Keystone Resort, Colarado, 1999.


S. Basu, E. E. Jan, Mark Lucente and Chalapathy Neti. Beyond Audio-based speech recognition, 1998 NIST/DARPA Workhop on SmartSpaces, Gaithersburg, MD, 1998.

Tanveer A. Faruquie, Chalapathy Neti, Nitendra Rajput, L. Venkata Subramaniam, Ashish Verma, Translingual Visual Speech Synthesis, IBM India Research Lab, India


 Privacy | Legal | Contact | IBM Home | Research Home | Project List | Research Sites | Page Contact