|

|
G. Iyengar Publications:
- B Ramabhadran, J Huang, U Chaudhari, G Iyengar, HJ Nock, Guess Who's Speaking: Audio Segmentation for the Automated Transcription of Large Spoken Archives
To appear in Eurospeech 2003
- HJ Nock, G Iyengar, C Neti, Speaker Localisation using Audio-Visual Synchrony: An Empirical Study
To appear in CIVR 2003
- G Iyengar, HJ Nock, C Neti, Audio-Visual Synchrony for Detection of Monologues in Video Archives, Proc ICASSP 2003 (Presented at ICME 2003)
- HJ Nock, G Iyengar, C Neti, Issues in Speech-based Retrieval of Video,
Proc ISCA Tutorial Workshop (Multilingual Spoken Document Retrieval) 2003
- HJ Nock, W Adams, G Iyengar, C-Y Lin, M Naphade, A Natsev, C Neti, JR Smith, B Tseng, User-trainable Video Annotation Using Multimodal Cues
To appear in Proc SIGIR 2003
- W.H. Adams, G. Iyengar, C-Y Lin, M.R. Naphade, C. Neti, H.J. Nock, J.R. Smith, Semantic Indexing of Multimedia Content Using Visual, Audio and Text Cues. Eurasip Journal on Applied Signal Processing
Vol 2003, No 2, Feb 2003
- G. Iyengar, H. Nock, C. Neti, M. Franz, Semantic Indexing of Multimedia using Audio, Text and Visual Cues, Proceedings of ICME2002, Lausanne, Switzerland, 2002.
- G. Potamianos, C. Neti, G. Iyengar, and E. Helmuth, Large-vocabulary
audio-visual speech recognition by machines and humans, Proc.
Eurospeech, Aalborg, 2001.
- G. Potamianos, C. Neti, G. Iyengar, A.W.
Senior, and A. Verma, A cascade visual front end for speaker independent
automatic speechreading,Int. J. Speech Technology,
Vol. 4, pp. 193-208, 2001.
- C. Neti, G. Iyengar, G. Potamianos, A. Senior, B. Maison. Perceptual
interfaces for information interaction: Joint processing of audio and visual
information for human-computer interaction, ICSLP, vol III, pp.
11-14, Beijing, October 2000.
- C.Neti, B.Maison, A.Senior, G.Iyengar, P.deCuetos, S.Basu, A.Verma. Joint
proccessing of audio and visual information for multimedia indexing and
human-computer interaction,RIAO April 12-14 2000, Paris,
France.
- G.Iyengar, C.Neti. Speaker change detection
using joint audio-visual statistics, RIAO 12-14 April 2000,
Paris, France, Dec. 20, 1999.
MULTIMEDIA SIGNAL PROCESSING WORKSHOP, CANNES, OCTOBER 2001
- G. Iyengar, G. Potamianos, C. Neti, T. Faruquie, and A. Verma, Robust
detection of visual ROI for automatic speechreading, Proc. IEEE Work. Multimedia Signal Process., Cannes, 2001.
- G. Iyengar and C. Neti, Detection of faces under shadows and lighting variations, Cannes, 2001.
- G. Iyengar, C. Neti. A vision-based microphone switch for speech intent detection, Recognition, Analysis and Tracking of Face and Gestures in Real Time Systems (RATFG-RTS) Workshop at ICCV 2001 in Vancouver, 13th July 2001.
|