
 |
 |

Audio-Visual Group Publications
G. Potamianos, J. Luettin, C. Neti. Hierarchical discriminant features for audio-visual LVCSR,
Submitted to ICASSP, Salt Lake City, May 2001.
J. Luettin, G. Potamianos, C. Neti. Asynchronous stream modeling for large-vocabulary audio-visual speech
recognition, Submitted to ICASSP, Salt Lake City, May 2001.
H. Glotin, D. Vergyri, C. Neti, G. Potamianos, J. Luettin. Weighting schemes for audio-visual fusion in speech recognition,
Submitted to ICASSP, Salt Lake City, May 2001.
A. Ghosh, A. Verma, and A. Sarkar. Using likelihood L-statistic as confidence measure in audio-visual speech recognition,
Submitted to ICASSP, Salt Lake City, May 2001.
G. Potamianos, C. Neti. Stream confidence estimation for audio-visual speech recognition,
ICSLP, vol III, pp. 746-749, Beijing, October 2000.
C. Neti, G. Iyengar, G. Potamianos, A. Senior, B. Maison. Perceptual interfaces for information interaction: Joint processing of
audio and visual information for human-computer interaction, ICSLP, vol III, pp. 11-14, Beijing, October 2000.
C. Neti, G. Potamianos, J. Luettin, I. Matthews, D. Vergyri, J. Sison, A.Mashari,
and J. Zhou, "Audio-Visual Speech Recognition",
Final Workshop 2000 Report, Center for Language and Speech Processing,
The Johns Hopkins University, Baltimore, MD (Oct. 12, 2000).
G. Potamianos, A. Verma, C. Neti, G. Iyengar, S. Basu. "A cascade image transform
for speaker independent automatic speechreading" International Conference
on Multimedia and Expo, vol. II, pp. 1097-1100, New York, July-August 2000.
C.Neti, P.deCuetos A.Senior. Audio-visual intent-to-speak detection for
human-computer interaction, ICASSP June 5-9 2000, Istanbul, Turkey.
G.Iyengar, C.Neti. Speaker change detection using joint audio-visual
statistics, RIAO 12-14 April 2000, Paris, France, Dec. 20, 1999.
C.Neti, B.Maison, A.Senior, G.Iyengar, P.deCuetos, S.Basu, A.Verma. Joint
proccessing of audio and visual information for multimedia indexing and
human-computer interaction, RIAO April 12-14 2000, Paris, France.
Benoit Maison, Chalapathy Neti, Andrew Senior. Audio-Visual speaker recognition
for video broadcast news: some fusion techniques, IEEE Multimedia
Signal Processing (MMSP99), Denmark, Sept, 1999.
S. Basu, C. Neti, N. Rajput, A. Senior. L. Subramaniam, A. Verma.
Audio-Visual large-vocabulary continous speech recognition in the broadcast
news domain, IEEE Multimedia Signal Processing Conference (MMSP99),
Denmark, Sept, 1999.
Andrew Senior, Chalapathy Neti, Benoit Masion. On the use of visual information
for improving audio-based speaker recognition, Audio-Visual Speech
processing conference (AVSP99), Santa Cruz, CA, Aug, 1999.
Chalapathy Neti, Stephane Maes, Mark Lucente and Dragutin Petkovic. Knowledge/Smart
Spaces, 1999 DARPA/NSF/NIST Workshop on Research issues in Smart
Computing Environments, July 1999.
Chalapathy Neti, Andrew Senior. Audio-Visual speaker recognition for
video broadcast news, DARPA HUB4 Workshop, Washington D.C., March
1999.
Ashish Verma, Tanveer Faruquie, C. Neti, Sankar Basu, Andrew Senior. Late
Integration in Continuous Audio-Visual Speech Recognition, ASRU,
Colorado, 1999.
A.W.Senior. Recognizing faces in broadcast video. IEEE International
Workshop on Recognition, Analysis, and Tracking of Faces and Gestures in
Real-Time Systems. ICCV 1999.
Jianbo Ma, Chalapathy Neti, Andrew Senior. Pose compensation for bimodal speech recognition. Automatic speech recognition
and understanding workshop (ASRU99), Keystone Resort, Colarado, 1999.
S. Basu, E. E. Jan, Mark Lucente and Chalapathy Neti. Beyond Audio-based
speech recognition, 1998 NIST/DARPA Workhop on SmartSpaces, Gaithersburg,
MD, 1998.
Tanveer A. Faruquie, Chalapathy Neti, Nitendra Rajput, L. Venkata Subramaniam, Ashish Verma, Translingual
Visual Speech Synthesis, IBM India Research Lab, India
|
|