Directory

Speech Recognition Research Papers - Academia.edu
This paper proposes text independent automatic speaker verification system using IMFCC (Inverse/ Reverse Mel Frequency Coefficients) and IT-EM (Information Theoretic Expectation Maximization). To perform speaker verification, feature... more
    • by 
    •   2  
      Computer ScienceSpeech Recognition
This paper aims to build a Bangla speech sentence recognition system by Hidden Markov Model (HMM). This system includes two phases; such as, a feature extraction phase to generate speech features from the Bangla speech sentence and a... more
    • by  and +1
    •   2  
      Speech RecognitionHidden Markov Models
A new form of augmentative and alternative communication (AAC) device for people with severe speech impairment-the voice-input voice-output communication aid (VIVOCA)-is described. The VIVOCA recognizes the disordered speech of the user... more
    • by 
    •   2  
      Speech RecognitionSpeech Translation
Most elderly people monitoring systems include the detection of abnormal situations, in particular distress situations, as one of their main goals. In order to reach this objective, many solutions end up combining several modalities such... more
    • by 
    •   9  
      Computer ScienceArtificial IntelligenceGeriatricsSpeaker Recognition
This paper describes a study on tone statistics of peoples' names in Mandarin Chinese. The problem was brought out when we tried to apply an English version of a speech recognizer to a Chinese voice tag dialing task. The questions were:... more
    • by 
    •   8  
      Cognitive ScienceComputer ScienceStatistical AnalysisSpeech Recognition
For many audiovisual applications, the integration and synchronization of audio and video signals is essential. The objective of this paper is to develop a system that displays the active objects in the captured video signal, integrated... more
    • by 
    •   15  
      Computer ScienceSignal ProcessingAudio Signal ProcessingSignal Integrity
This paper presents an application, "LentInfo", which is a system used to provide information about programmes for the Festival Lent in Slovenia. The Festival Lent consists of different open-air theatre and music performances and raws... more
    • by 
    •   12  
      Cognitive ScienceSpeech SynthesisSpeech RecognitionLinguistics
The estimation of initial language models for new applications of spoken dialogue systems without large taskspecific training corpora is becoming an increasingly important issue. This paper investigates two different approaches in which... more
    • by 
    •   15  
      Educational TechnologySpeech RecognitionLinguisticsStochastic processes
Vocal communication is most effective when the listener is able to observe the mouth of the speaker. This is especially true for the hearing impaired, and dramatically true for the deaf, who rely on lip-reading for comprehending speech.
    • by 
    •   11  
      AnimationComputer AnimationSpeech RecognitionSpeech Communication
We present an improved system combination technique,iROVER. Our approach obtains significant improvements over ROVER, and is consistently better across varying numbers of component systems. A classifier is trained on features from the... more
    • by 
    • Speech Recognition
The Sphinx-4 speech recognition system is the latest addition to Carnegie Mellon University's repository of Sphinx speech recognition systems. It has been jointly designed by Carnegie Mellon University, Sun Microsystems Laboratories... more
    • by 
    •   5  
      Speech RecognitionSystem DesignInformation SourcesLanguage Model
This paper proposes several speech technology improvements for increasing robustness, reliability and ergonomics in speech interfaces for controlling aerial vehicles. These improvements consist of including a statistical language model... more
    • by 
    •   20  
      Mechanical EngineeringAerospace EngineeringErgonomicsSemantics
Life is a blessing from the graces of God that he granted to all His creatures. God has excelled in creating this life in its most beautiful form, making the difference of people and the difference of graces among them a cornerstone for... more
    • by  and +1
    •   6  
      Natural Language ProcessingSign LanguagesAssistive TechnologySpeech Recognition
We present two real-time hidden Markov model-based systems for recognizing sentence-level continuous American Sign Language (ASL) using a single camera to track the user's unadorned hands. The first system observes the user from a desk... more
    • by 
    •   18  
      Information SystemsComputer VisionSign LanguagePattern Recognition
This paper describes a method to detect smiles and laughter sounds from the video of natural dialogue. A smile is the most common facial expression observed in a dialogue. Detecting a user's smiles and laughter sounds can be useful for... more
    • by 
    •   8  
      Face RecognitionUser InterfaceSpeech RecognitionFacial expression
With the advancement of technology, we can implement a variety of ideas to serve mankind in numerous ways. Inspired by this, we have developed a smart hand glove system which will be able to help the people having hearing and speech... more
    • by 
    • Speech Recognition
Speech recognition and speaker identification are important for authentication and verification in security purpose, but they are difficult to achieve. Speaker identification methods can be divided into textindependent and text-dependent.... more
    • by 
    •   8  
      Speech RecognitionNeural NetworkSpeaker IdentificationSpeech Segmentation
In this paper, we present a mismatch-aware stochastic matching (MASM) algorithm to alleviate the performance degradation under mismatched training and testing conditions. MASM first computes a reliability measure of applying a set of... more
    • by 
    •   9  
      Automatic Speech RecognitionSpeech RecognitionStochastic processesHidden Markov Models
A methodology and environment for building adaptive speech recognition systems is presented. The development environment is designed for isolated word recognition systems. A small speech recognition system is developed for a home... more
    • by 
    •   3  
      Speech RecognitionWord RecognitionEnvironment and Development
Prosody has been widely used in many speech-related applications including speaker and word recognition, emotion and accent identification, topic and sentence segmentation, and text-to-speech applications. An important application we... more
    • by 
    •   7  
      Speech RecognitionDecision TreesFundamental FrequencyWord Recognition
Growing needs for French closed-captioning of live TV broadcasts in Canada cannot be met only with stenography-based technology because of a chronic shortage of skilled stenographers. Using speech recognition for live closed-captioning,... more
    • by 
    •   5  
      Speech RecognitionModel UpdatingCollaborative WorkReal Time
This paper deals with the introduction of an efficient speech front-end for automatic speech recognition. This front-end not only performs well, in comparison to the traditional and widely used MFCC, but is also efficiently implemented in... more
    • by 
    •   5  
      Circuits and SystemsAutomatic Speech RecognitionSpeech RecognitionSpeech enhancement
c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 8 6 ( 2 0 0 7 ) 21-29 a b s t r a c t Personal Digital Assistant devices are becoming a frequently used device for the bedside care of the patient. Ways of... more
    • by  and +1
    •   15  
      Biomedical EngineeringTelecommunicationsSpeech RecognitionOrthopaedics
This paper describes a database of dysarthric speech produced by 19 speakers with cerebral palsy. Speech materials consist of 765 isolated words per speaker: 300 distinct uncommon words and 3 repetitions of digits, computer commands,... more
    • by 
    •   7  
      Speech ProductionAutomatic Speech RecognitionSpeech RecognitionCerebral Palsy
In the present work excerpts of research are presented, concerning the application of modified acoustic signal processing methods in the problem of "understanding" of selected pathologies of vocal tract. The presented concept of the... more
    • by  and +2
    •   10  
      Speech RecognitionNeural NetworkSpeech ProcessingSignal Analysis
This paper focuses on microphone arrays to realize distant-talking speech recognition in real environments. In distant-talking situations, users can speak at arbitrary positions while moving. Therefore, it is very important for high... more
    • by 
    •   9  
      EngineeringNeural NetworksSpeech RecognitionHidden Markov Models
We present an approach to automatically recognize sign language and translate it into a spoken language. A system to address these tasks is created based on state-ofthe-art techniques from statistical machine translation, speech... more
    • by 
    •   8  
      TechnologyImage ProcessingSign LanguageSpeech Recognition
It is well known that the introduction of acoustic background distortion and the variability resulting from environmentally induced stress causes speech recognition algorithms to fail. In this paper, several causes for recognition... more
    • by 
    •   13  
      Cognitive ScienceComputer ScienceStatistical AnalysisSpeech Recognition
We describe a system for model based speech separation which achieves superhuman recognition performance when two talkers speak at similar levels. The system can separate the speech of two speakers from a single channel recording with... more
    • by 
    •   8  
      Speech RecognitionProceedingsSpeaker IdentificationTemporal Constraints
STRAIGHT, a speech analysis, modification synthesis system, is an extension of the classical channel VOCODER that exploits the advantages of progress in information processing technologies and a new conceptualization of the role of... more
    • by 
    •   9  
      Mechanical EngineeringSpeech SynthesisSpeech perceptionSpeech Recognition
Extractive speech summarization, which purports to select an indicative set of sentences from a spoken document so as to succinctly represent the most important aspects of the document, has garnered much research over the years. In this... more
    • by 
    •   13  
      Information RetrievalNatural Language ProcessingSemanticsSpeech Recognition
The main steps of document processing have been reviewed, especially those implemented on Arabic writing. The techniques used in this research, such as Vector Quantization (VQ), Hidden Markov Models (HMM), and Induction of Decision Trees... more
    • by 
    •   15  
      Pattern RecognitionWritingAutomatic Speech RecognitionNeural Networks
This paper presents an emerging application of multimodal interface research to distributed applications. We have developed the QuickSet prototype, a pen/voice system running on a hand-held PC, communicating via wireless LAN through an... more
    • by 
    •   7  
      Natural Language ProcessingMultimodal InteractionSpeech RecognitionGesture Recognition
This paper considers the problem of constructing an efficient inverted index for the spoken term detection (STD) task. More specifically, we construct a deterministic weighted finite-state transducer storing soft-hits in the form of... more
    • by 
    •   9  
      EngineeringNatural Language ProcessingAutomatic Speech RecognitionSpeech Recognition
In this paper, we present a set of optimizations for a spoken language interface for mobile devices that can improve the recognition accuracy and user interaction experience. A comparison between a speech and a graphical interface, when... more
    • by 
    •   10  
      Speech RecognitionUser eXperienceMobilityUsability Evaluation
    • by 
    •   2  
      Speech RecognitionDynamic Range
Acoustic signals recorded simultaneously in a reverberant environment can be described as sums of differently convolved sources. The task of source separation is to identify the multiple channels and possibly to invert those in order to... more
    • by 
    •   15  
      EngineeringSignal ProcessingAutomatic Speech RecognitionSpeech Recognition
Since 1990 the DRA Speech Research Unit has conducted research into applications of speech recognition technology to speech and language development for young children. This has been done in collaboration wirh Hereford and Worcester... more
    • by 
    •   15  
      Computer Science EducationAnimationLiteratureComputer Animation
    • by 
    •   5  
      Speech ProsodySpeech perceptionSpeech RecognitionSpeech Communication
This paper presents the design of a FPGA-based hardware co-processor, based on the SPHINX 3 speech recognition engine from CMU; capable of performing Acoustic Modeling (AM) for medium sized vocabularies in real-time. By creating an... more
    • by 
    •   14  
      Computer ScienceSpeech RecognitionField-Programmable Gate ArraysAdvanced Placement
This paper describes the development and validation of an Embedded Isolated Word Recognition System (IWR) for the Argentinian Spanish language, implemented on the STM32F4-Discovery platform. Its front-end extracts Mel Frequency Cepstral... more
    • by 
    •   10  
      EngineeringEmbedded SystemsAutomatic Speech RecognitionSpeech Recognition
The hearing abilities of a group of 30 elderly (67–93yr of age) subjects were compared with those of a group of 30 young (19–27yr of age) normal hearing volunteers with the aim of characterizing the changes in the peripheral and central... more
    • by 
    •   9  
      Speech RecognitionTemporal ResolutionMedicinePresbycusis
There exists a large conceptual gap between symbolic models and emergent models for the mind. Many emergent models work on low-level sensory data, while many symbolic models deal with high-level abstract (i.e., action) symbols. There has... more
    • by 
    •   18  
      RoboticsComputer ArchitectureArtificial IntelligenceComputer Vision
Current predictors of speech intelligibility are inadequate for understanding and predicting speech confusions caused by acoustic interference. We develop a model of auditory speech processing that includes a phenomenological... more
    • by 
    •   17  
      Cognitive ScienceSpeech SynthesisSpeech RecognitionLinguistics
Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multi-media collections in the near future. Considering the characteristics and monosyllabic structure of the... more
    • by 
    •   7  
      Cognitive ScienceSpeech RecognitionLinguisticsSpeech Communication
This article talks about how advances in human language technology can help overcomesome of the barriers that prevent community participation in cyberspace. Human languagetechnology refers to the set of technologies, such as speech... more
    • by 
    •   3  
      Speech RecognitionCommunity ParticipationHuman Language Technology
Sound is essential to enhance visual experience and human robot interaction, but usually most research and development efforts are made mainly towards sound generation, speech synthesis and speech recognition. The reason why only a little... more
    • by 
    •   8  
      Speech SynthesisSpeech RecognitionAuditory Scene AnalysisHuman behavior
In the case of a trlgr~m language model, the probability of the next word conditioned on the previous two words is estimated from a large corpus of text. The resulting static trigram language model (STLM) has fixed probabilities that are... more
    • by 
    •   5  
      Speech RecognitionMultidisciplinaryNatural languageBit Error Rate
An input device should be natural and convenient for a user to transmit information to a computer, and should be designed from an understanding of the task to be performed and the interrelationship between the task and the device from the... more
    • by 
    •   6  
      Human FactorsUser InterfaceSpeech RecognitionExperimental Design