Photo
Audio Visual Speech Technologies

 

J. Huang Publications:


Audio-Visual Speech Recognition: (AVSR)
  1. Jing Huang and Daniel Povey, "Discriminatively Trained Features Using fMPE for Multi-Stream Audio-Visual Speech Recognition", 9th European Conference on Speech Communication and Technology, 2005.
  2. Jing Huang and Karthik Visweswariah, "Improving Lip-reading with Feature Space Transforms for Multi-Stream Audio-Visual Speech Recognition", 9th European Conference on Speech Communication and Technology, 2005.
  3. Jing Huang, Etienne Marcheret, Karthik Visweswariah, "Rapid Feature Space Speaker Adaptation for Multi-Stream HMM-based Audio-Visual Speech Recognition", IEEE International Conference on Multimedia & Expo, 2005
  4. Jing Huang, G. Potamianos et al, "Audio-visual speech recognition using an infrared headset", Journal of Speech Communication, 2004.
  5. G. Potamianos, C. Neti, J. Huang, J.H. Connell, S. Chu, V. Libal, E. Marcheret, N. Haas, J. Jiang, "Towards practical deployment of audio-visual speech recognition", IEEE International Conference on Acoustics, Speech, and Signal Processing, 2004.
  6. Jing Huang, G. Potamianos, C. Neti, "Improving audio-visual speech recognition with an infrared headset", ISCA Tutorial and Research Workshop on Audio Visual Speech Processing, 2003.

Large-Vocabulary Continuous Speech Recognition (LVCSR)
  1. B. Ramabhadran, J. Huang, U. Chaudrari, G. Iyengar, H. Nock, "Guessing who's speaking: audio segmentation for the automated transcription of large spoken archives", 8th European Conference on Speech Communication and Technology, 2003.
  2. B. Ramabhadran, J. Huang, M. Picheny, "Towards automatic transcription of large spoken archives --- English ASR for the MALACH project", IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003.
  3. J. Huang, V. Goel, R. Gopinath, B. Kingsbury, P. Olsen and K. Visweswariah. "Large vocabulary conversational speech recognition with the extended maximum likelihood linear transformation (EMLLT) model", International Conference on Speech and Language Processing, 2002.
  4. M. Padmanabhan, G. Saon, J. Huang, B. Kingsbury and L. Mangu. "Automatic speech recognition performance on a voicemail transcription task, IEEE Trans. on Speech and Audio Processing, 2002.
  5. J. Huang, B. Kingsbury, L. Mangu, M. Padmanabhan, G. Saon, and G. Zweig. "Recent Improvements in Speech Recognition Performance on large vocabulary conversational speech (Voicemail and SWITCHBOARD)", International Conference on Speech and Language Processing, 2000.
  6. Yuqing Gao, Raimo Bakis, Jing Huang, Bing Xiang. "Multistage Coarticulation Model Combining Articulatory, Formant and Cepstral Features", International Conference on Speech and Language Processing, 2000.
  7. Jing Huang and Mukund Padmanabhan. "A study of adaptation techniques on a voicemail transcription task," 6th European Conference on Speech Communication and Technology, 1999.
  8. Mukund Padmanabhan, Sankar Basu, Jing Huang, George Saon, and Geoffrey Gzweig. "Speech recognition performance on a new voicemail transcription task", 6th European Conference on Speech Communication and Technology, 1999.

Information Extraction:
  1. J. Huang, G. Zweig and M. Padmanabhan. "Extracting Caller Information from Voicemail", Lecture Notes in Computer Science, 2002.
  2. J. Huang and G. Zweig. "Maximum entropy model for punctuation annotation from speech", International Conference on Speech and Language Processing, 2002.
  3. J. Huang, G. Zweig and M. Padmanabhan. "Extracting caller information from voicemail", Annual International ACM SIGIR, Conference on Research and Development in Information Retrieval, 2001.
  4. Jing Huang, Geoffrey Zweig, Mukund Padmanabhan, "Information Extraction from Voicemail", Annual Meeting of the Association for Computational Linguistics, 2001.

Computer Vision:
  1. J. Huang, R. Kumar and R. Zabih. "Automatic hierarchical color image classification", Journal of Applied Signal Processing, EURASIP special issue, 2003.
  2. J. Huang and S. Ravikumar, "Boosting techniques for image classification", Fourth Asian Conference on Computer Vision, 2000.
  3. Jing Huang, S. Ravikumar, Mandar Mitra, Wei-Jing Zhu, and Ramin Zabih. "Spatial color indexing and applications", International Journal of Computer Vision, vol.35, no.3, 1999.
  4. Jing Huang, S. Ravikumar, and Ramin Zabih. "An automatic hierarchical image classification scheme", Proc. 6th ACM Conference on Multimedia, 1998.
  5. Jing Huang, S. Ravikumar, Mandar Mitra, and Wei-Jing Zhu. "Spatial color indexing and applications", Proc. 6th International Conference on Computer Vision, 1998.
  6. Jing Huang, S. Ravikumar, and Mandar Mitra. "Combining supervised learning with color correlograms for content-based image retrieval", Proc. 5th ACM Conference on Multimedia, 1997.
  7. Jing Huang, S. Ravikumar, Mandar Mitra, Wei-Jing Zhu, and Ramin Zabih. "Image indexing using color correlograms", Proc. 16th IEEE Conference on Computer Vision and Pattern Recognition, 1997.
Scientific Computing:
  1. Lizhen Gu and Jing Huang. "Several new finite-difference schemes for initial boundary problems of convection-diffusion equations", Proc. of Computational Mathematics (in Chinese), 1990.
  2. Da-Yong Cai and Jing Huang. "A new algorithm for the eigenvalue problem of matrices", Journal of Computational Mathematics, 7:3, 1989.
Patents:
  1. Jing Huang and Mukund Padmanabhan, "Methods and Apparatus for Fast Adaptation of Band-quantized Speech Decoding System", US patent 6,421641 B1.
  2. Jing Huang, S Ravi Kumar, Mandar Mitra, and Wei-Jing Zhu. "Image Sub-region Query by Using Color Correlograms", US patent 6,430,312 B1.
  3. Jing Huang, S Ravi Kumar, Mandar Mitra, and Wei-Jing Zhu. "Image indexing using color correlograms", US patent number 6,246,790.