Modeling Consistency in a Speaker Independent Continuous Speech Recognition System (2008)
Yochai Konig, Nelson Morgan, Chuck Wooters, Victor Abrash, Michael Cohen, Horacio Franco
We would like to incorporate speaker-dependent consistencies, such as gender, in an otherwise speaker-independent speech recognition system. In this paper we discuss a Gender Dependent Neural Network...
Integrating Neural Networks Into Computer Speech Recognition Systems (2007)
Michael Cohen, Horacio Franco, Nelson Morgan, David Rumelhart, Victor Abrash, Yochai Konig
this paper we describe the ini ial baseline DECIPHER system and the approach for n integrating MLP-based estimation techniques; present a umber of new techniques that have been developed to allow d l...
Transition-Based Connectionist Speech Recognition (2007)
Maximization Of A Posteriori, Yochai Konig, Herv Bourlard, Nelson Morgan
In this paper, we introduce REMAP, an approach for the training and estimation of posterior probabilities using a recurslye algorithm that is reminiscent of the EM-based Forward-Backward (Liporace...
Discriminative Mixture Weight Estimation For Large Gaussian Mixture Models (1999)
Françoise Beaufays, Mitchel Weintraub, Yochai Konig
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech recognition (LVCSR) systems. Each phone is modeled with a large Gaussian mixture model (GMM) whose...
Discriminative Mixture Weight Estimation For Large Gaussian Mixture Models (1999)
Fran Coise Beaufays, Mitchel Weintraub, Yochai Konig
This paper describes a new approach to acoustic modeling for large vocabulary continuous speech recognition (LVCSR) systems. Each phone is modeled with a large Gaussian mixture model (GMM) whose...
DYNAMO: An Algorithm for Dynamic Acoustic Modeling (1998)
Francoise Beaufays, Mitch Weintraub, Yochai Konig
This paper summarizes part of SRI's effort to improve acoustic modeling in the context of the Large Vocabulary Continuous Speech Recognition (LVCSR) project. It concentrates on two problems that...
DYNAMO: An Algorithm for Dynamic Acoustic Modeling (1998)
Francoise Beaufays Mitch, Mitch Weintraub, Yochai Konig
This paper summarizes part of SRI's effort to improve acoustic modeling in the context of the Large Vocabulary Continuous Speech Recognition (LVCSR) project. It concentrates on two problems that...
Nonlinear Discriminant Feature Extraction For Robust Text-Independent Speaker Recognition (1998)
Yochai Konig, Larry Heck, Mitch Weintraub, Kemal Sonmez, R Esum E
We study a nonlinear discriminant analysis (NLDA) technique that extracts a speaker-discriminant feature set. Our approach is to train a multilayer perceptron (MLP) to maximize the separation between...
DYNAMO: An Algorithm for Dynamic Acoustic Modeling (1998)
Francoise Beaufays, Mitch Weintraub, Yochai Konig
This paper summarizes part of SRI's effort to improve acoustic modeling in the context of the Large Vocabulary Continuous Speech Recognition (LVCSR) project. It concentrates on two problems that...
Discriminative Training Of Minimum Cost Speaker Verification Systems (1998)
Larry Heck, Yochai Konig, R Esum E
This paper presents a new training procedure for speaker verification systems. The procedure extends previous speaker verification work by (1) developing a new discriminative a posteriori-based...
Discriminative training of minimum cost speaker verification systems (1998)
Ce papier présente une nouvelle méthode d’apprentissage pour les systèmes de vérification du locuteur. Cette méthode améliore les travaux précédents dans le domaine de vérification du...
DYNAMO: An Algorithm for Dynamic Acoustic Modeling (1998)
Françoise Beaufays, Mitch Weintraub, Yochai Konig
This paper summarizes part of SRI’s effort to improve acoustic modeling in the context of the Large Vocabulary Continuous Speech Recognition (LVCSR) project. It concentrates on two problems that...
Neural-Network Based Measures Of Confidence For Word Recognition (1997)
Mitch Weintraub, Françoise Beaufays, Ze'ev Rivlin, Yochai Konig, Andreas Stolcke
This paper proposes a probabilistic framework to define and evaluate confidence measures for word recognition. We describe a novel method to combine different knowledge sources and estimate the...
Thesis (Ph. D. in Computer Science)--University of California, Berkeley, May 1996.
Remap - Experiments With Speech Recognition (1996)
Yochai Konig, Hervé Bourlard, Nelson Morgan
In this report we present experimental and theoretical results using a framework for training and modeling continuous speech recognition systems based on the theoretically optimal Maximum a...
Transition-based statistical training for ASR (1995)
Nelson Morgan, Yochai Konig, Su-lin Wu, Hervd Bourlard
It is known that in human speech recognition, the perceptually-dominant and information-rich portions of the speech signal, which may also be the parts with a better
Transition-Based Statistical Training for ASR (1995)
Nelson Morgan, Yochai, Konig Su-Lin Wu, Hervé Bourlard, Yochai Konig, Su-lin Wu
INTRODUCTION It is known that in human speech recognition, the perceptually -dominant and information-rich portions of the speech signal, which may also be the parts with a better chance to withstand...
Hervé Bourlard, Yochai Konig, Nelson Morgan
In this paper, we briefly describe REMAP, an approach for the training and estimation of posterior probabilities, and report its application to speech recognition. REMAP is a recursive algorithm that...
Yochai Konig, Hervé Bourlard, Nelson Morgan
In this paper, we introduce REMAP, an approach for the training and estimation of posterior probabilities using a recursive algorithm that is reminiscent of the EM-based Forward-Backward (Liporace...
Modeling Dynamics In Connectionist Speech Recognition - The Time Index Model (1994)
We are experimenting with an approach to connectionist speech recognition that models the dynamics within a speech segment using temporal position as an explicit variable. Currently, the most common...
"Eigenlips" for Robust Speech Recognition (1994)
Christoph Bregler, Christoph Bregler, Yochai Konig, Yochai Konig
In this study we improve the performance of a hybrid connectionist speech recognition system by incorporating visual information about the corresponding lip movements. Specifically, we investigate...
Modeling dynamics in connectionist speech recognition - the time index model (1994)
emitted in a given speech unit (a “segment”), as opposed to a single acoustic vector as used for HMMs. The production of the acoustic We are experimenting with an approach to connectionist speech...
GDNN: A Gender-Dependent Neural Network for Continuous Speech Recognition (1992)
Yochai Konig, Nelson Morgan, Claudia Chandra
Conventional speaker-independent speech recognition systems do not consider speakerdependent parameters in the probability estimation of phonemes. These recognition systems are instead tuned to the...
Victor Abrash, Horacio Franco, Michael Cohen, Nelson Morgan, Yochai Konig
a s An approach to modeling long-term consistencies in peech signal within the framework of a hybrid Hidden ) s Markov Model (HMM) / Multilayer Perceptron (MLP peaker-independent continuous-speech...
Combining Neural Networks And Hidden Markov Models For Continuous Speech Recognition (1992)
Michael Cohen, Ichael Cohen, David Rumelhart, Nelson Morgan, Horacio Franco, Victor Abrash, ...
e present a speaker-independent, continuous-speech recog- ( nition system based on a hybrid multilayer perceptron MLP)/hidden Markov model (HMM). The system come bines the advantages of both...
A New Training Algorithm For Hybrid HMM/ANN Speech Recognition Systems
Hervé Bourlard, Yochai Konig, Nelson Morgan, Christophe Ris
In this paper, we briefly describe REMAP, an approach for the training and estimation of posterior probabilities, and report its application to speech recognition. REMAP is a recursive algorithm that...