Photo
Audio Visual Speech Technologies

 

H. Nock Publications:


Journal Papers
  1. HJ Nock, M Ostendorf, Parameter Reduction Schemes for Loosely Coupled HMMs, To appear in Computer, Speech and Language 2003


  2. W.H. Adams, G. Iyengar, C-Y Lin, M.R. Naphade, C. Neti, H.J. Nock, J.R. Smith, Semantic Indexing of Multimedia Content Using Visual, Audio and Text Cues. Eurasip Journal on Applied Signal Processing Vol 2003, No 2, Feb 2003.


  3. H.J. Nock and S.J. Young, Modelling Asynchrony in Automatic Speech Recognition Using Loosely-Coupled HMMs. Cognitive Science(Invited Paper). May-June 2002.


  4. M. Riley, W. Byrne, M. Finke, S. Khudanpur, A. Ljolje, J. McDonough, H. Nock, M. Saraclar, C. Wooters, G. Zavaliagkos, Stochastic Pronunciation Modelling from Hand-Labelled Phonetic Corpora In Speech Communication. November 1999. 29(2-4). pp 209-224.


  5. M. Saraclar, H.J. Nock, S. Khudanpur,Pronunciation Modeling by Sharing Gaussian Densities Across Phonetic Models. In Computer Speech and Language. April 2000. 14(2). pp 137-160.


Conference Papers

  1. B Ramabhadran, J Huang, U Chaudhari, G Iyengar, HJ Nock, Guess Who's Speaking: Audio Segmentation for the Automated Transcription of Large Spoken Archives To appear in Eurospeech 2003


  2. HJ Nock, G Iyengar, C Neti, Speaker Localisation using Audio-Visual Synchrony: An Empirical Study To appear in CIVR 2003


  3. G Iyengar, HJ Nock, C Neti, Audio-Visual Synchrony for Detection of Monologues in Video Archives, Proc ICASSP 2003 (Presented at ICME 2003)


  4. HJ Nock, G Iyengar, C Neti, Issues in Speech-based Retrieval of Video, Proc ISCA Tutorial Workshop (Multilingual Spoken Document Retrieval) 2003


  5. HJ Nock, W Adams, G Iyengar, C-Y Lin, M Naphade, A Natsev, C Neti, JR Smith, B Tseng, User-trainable Video Annotation Using Multimodal Cues To appear in Proc SIGIR 2003


  6. W.H. Adams, G. Iyengar, C-Y Lin, M.R. Naphade, C. Neti, H.J. Nock, J.R. Smith, Semantic Indexing of Multimedia Content Using Visual, Audio and Text Cues. Eurasip Journal on Applied Signal Processing Vol 2003, No 2, Feb 2003


  7. Alejandro Jaimes, Milind Naphade, Harriet Nock, John R Smith and Belle L Tseng, Context Enhanced Video Understanding. To appear in Proc. SPIE 2003.


  8. H.J. Nock, G. Iyengar, C. Neti, Assessing Face and Speech Consistency for Monologue Detection in Video. In Proc. ACM Multimedia 2002}, Juan-les-Pins, France.


  9. Ozgur Cetin, Harriet J. Nock, Katrin Kirchhoff, Jeff Bilmes, Mari Ostendorf, The 2001 GMTK-Based SPINE ASR System. In Proc. of ICSLP 2002, Denver, USA.


  10. G. Iyengar, H. Nock, C. Neti, M. Franz, Semantic Indexing of Multimedia using Audio, Text and Visual Cues. In Proc. of ICME 2002. Lausanne, Switzerland.


  11. H.J. Nock, S.J. Young, A Comparison of Exact and Approximate Algorithms for Decoding and Training Loosely-Coupled HMMs. In Proc. of WISP (Institute of Acoustics) 2001, Stratford-upon-Avon, UK.


  12. H.J. Nock, S.J. Young, Loosely Coupled HMMs for ASR. In Proc of ICSLP 2000, Beijing, China.


  13. M. Saraclar, H.J. Nock, S. Khudanpur, Modeling Pronunciation by Sharing Gaussian Densities Across Phonetic Models. In Proc. of Eurospeech 1999, Budapest, Hungary. pp 515-518.


  14. W. Byrne, M. Finke, S. Khudanpur, J. McDonough, H.J. Nock, M. Riley, M. Saraclar, C. Wooters, G. Zavaliagkos, Pronunciation Modelling Using A Hand-Labelled Corpus for Conversational Speech Recognition. In Proc. of ICASSP 1998, Seattle, USA. pp 313-316.


  15. M. Riley, W. Byrne, M. Finke, S. Khudanpur, A. Ljolje, J. McDonough, H. Nock, M. Saraclar, C. Wooters, G. Zavaliagkos, Stochastic pronunciation modelling from hand-labelled phonetic corpora. In Proc. of ESCA ETRW on Modeling Pronunciation for Automatic Speech Recognition 1998, Nijmegen, The Netherlands. pp 109-116.


  16. H.J. Nock and S.J. Young,Automatic Detection and Correction of Poor Multiword Pronunciations. In Proc. of ESCA ETRW on Modeling Pronunciation for Automatic Speech Recognition 1998, Nijmegen, The Netherlands. pp 85-90.


  17. B. Byrne, M. Finke, S. Khudanpur, J. McDonough, H. Nock, M. Riley, M. Saraclar, C. Wooters, G. Zavaliagkos, Pronunciation Modelling for Conversational Speech Recognition: A Status Report From WS97. In Proc. of IEEE Workshop on Automatic Speech Recognition and Understanding 1997, Santa Barbara, USA. pp 26-33.


  18. H.J. Nock, M.J.F. Gales and S.J. Young, A Comparative Study of Methods for Decision-Tree State Clustering, In Proc. of Eurospeech 97. Rhodes, Greece. pp 111-114.


Dissertations

  1. H.J. Nock, Techniques for Modelling Phonological Processes in Automatic Speech Recognition, PhD Thesis, Cambridge University Engineering Department. August 2001.


  2. H.J. Nock, bContextual Clustering for Automatic Speech Recognition. MPhil Thesis, Cambridge University Engineering Department. September 1996.


  3. H.J. Nock, A Type Inference System for Interaction Nets Undergraduate Final Year Project, Cambridge University Computer Lab. July 1994.