[ IBM Research ]
[ Find ] [ News ] [ Products ] [ Support ] [ Business solutions ] [ Inside IBM ] [ Interest groups ]
[ Clear ]
Superhuman Speech Recognition - Publications

Superhuman Publications

Acoustic Modeling     Decoding Technology  Bayesian Networks     Language Processing  


Acoustic Modeling
 

Lattice-Based Unsupervised MLLR for Speaker Adaptation
Mukund Padmanabhan, George Saon and Geoffrey Zweig
ASR 2000

Boosting Gaussian Mixtures in an LVCSR System
Geoffrey Zweig and Mukund Padmanabhan
ICASSP 2000

Eliminating Inter-Speaker Variability Prior To Discriminant Transforms
George Saon, Mukund Padmanabhan and Ramesh Gopinath
ASRU 2001

Maximum Likelihood Discriminant Feature Spaces
George Saon, Mukund Padmanabhan, Ramesh Gopinath and Scott Chen
ICSAAP 2000

Linear Feature Space Transformations for Speaker Adaptation
George Saon, Geoffrey Zweig and Mukund Padmanabhan
ICASSP 2001

Digit Recognition in Noisy Environments via a Sequential GMM/HMM System
Shai Fine, George Saon and Ramesh Gopinath
ICASSP 2002

Minimum Bayes Error Feature Selection for Continuous Speech Recognition
George Saon and Mukund Padmanabhan
NIPS 2000

Robust Speech Recognition in Noisy Environments: The 2001 IBM SPINE Evaluation System
Brian Kingsbury, George Saon, Lidia Mangu, Mukund Padmanabhan and Ruhi Sarikaya
ICASSP 2002

A Hybrid HMM/TRAPS Model for Robust Voice Activity Detection
Brian Kingsbury, Pratibha Jain and Andre Adami
ICSLP 2002

Improvements to the IBM Aurora-2 Multi-Condition System
George Saon and Juan Huerta
ICSLP 2002


Decoding Technology 

Exact Alpha-Beta Computation in Logarithmic Space with Application to MAP Word Graph Construction
Geoffrey Zweig and Mukund Padmanabhan
ICSLP 2000

Arc-Minimization in Finite State Decoding Graphs with Cross-Word Context
Geoffrey Zweig, George Saon, and Francois Yvon
ICSLP 2002


Bayesian Networks 

Probabilistic Modeling with Bayesian Networks for Automatic Speech Recognition
Geoffrey Zweig and Stuart Russell
Australian Journal of Intelligent Information Processing 5(4) 1999

Dependency Modeling with Bayesian Networks in a Voicemail Transcription System
Geoffrey Zweig and Mukund Padmanabhan
Eurospeech 1999

The Graphical Models Toolkit: An Open Source Software System for Speech and Time Series Processing
Jeff Bilmes and Geoffrey Zweig
ICASSP 2002

Structurally Discriminative Graphical Models for Automatic Speech Recognition - Results from the 2001 Johns Hopkins Summer Workshop
Geoffrey Zweig, Jeff Bilmes, et al.
in Proceedings of the Fourth International Conference on Automatic
Face and Gesture Recognition Grenoble France, March 2000.


Language Processing 

Information Extraction from Voicemail
Jing Huang, Geoffrey Zweig, and Mukund Padmanabhan
ACL 2002

Extracting Caller Information from Voicemail
Geoffrey Zweig, Jing Huang, and Mukund Padmanabhan
Eurospeech 2001

Maximum Entropy Model for Punctuation Annotation from Speech
Jing Huang and Geoffrey Zweig
ICSLP 2002

Data Driven Approach to Lexicon Design in a Continuous Speech Recognition System
George Saon and Mukund Padmanabhan
IEEE Transactions in Speech and Audio Processing. 2000

 
Last updated: 10/29/02
 
Superhuman Main Page Group Papers Related Groups



[  Assist ][ Legal ][ Privacy ][ Orders ][ IBM Home ][ IBM Research ]