Made in Chicago Company Directory

This paper proposes text independent automatic speaker verification system using IMFCC (Inverse/ Reverse Mel Frequency Coefficients) and IT-EM (Information Theoretic Expectation Maximization). To perform speaker verification, feature... more

Bookmark
Download
- by Sania Bhatti
- •
- 2
  Computer Science, Speech Recognition

This paper aims to build a Bangla speech sentence recognition system by Hidden Markov Model (HMM). This system includes two phases; such as, a feature extraction phase to generate speech features from the Bangla speech sentence and a... more

Bookmark
Download
- by Md. Mijanur Rahman and +1
  Ashraful Kadir
- •
- 2
  Speech Recognition, Hidden Markov Models

A new form of augmentative and alternative communication (AAC) device for people with severe speech impairment-the voice-input voice-output communication aid (VIVOCA)-is described. The VIVOCA recognizes the disordered speech of the user... more

Bookmark
Download
- by Pamela Enderby
- •
- 2
  Speech Recognition, Speech Translation

Most elderly people monitoring systems include the detection of abnormal situations, in particular distress situations, as one of their main goals. In order to reach this objective, many solutions end up combining several modalities such... more

This paper describes a study on tone statistics of peoples' names in Mandarin Chinese. The problem was brought out when we tried to apply an English version of a speech recognizer to a Chinese voice tag dialing task. The questions were:... more

For many audiovisual applications, the integration and synchronization of audio and video signals is essential. The objective of this paper is to develop a system that displays the active objects in the captured video signal, integrated... more

This paper presents an application, "LentInfo", which is a system used to provide information about programmes for the Festival Lent in Slovenia. The Festival Lent consists of different open-air theatre and music performances and raws... more

Bookmark
Download
- by Matej Rojc
- •
- 12
  Cognitive Science, Speech Synthesis, Speech Recognition, Linguistics

The estimation of initial language models for new applications of spoken dialogue systems without large taskspecific training corpora is becoming an increasingly important issue. This paper investigates two different approaches in which... more

Vocal communication is most effective when the listener is able to observe the mouth of the speaker. This is especially true for the hearing impaired, and dramatically true for the deaf, who rely on lip-reading for comprehending speech.

Bookmark
Download
- by Robert Rodman
- •
- 11
  Animation, Computer Animation, Speech Recognition, Speech Communication

We present an improved system combination technique,iROVER. Our approach obtains significant improvements over ROVER, and is consistently better across varying numbers of component systems. A classifier is trained on features from the... more

Bookmark
Download
- by Dustin Hillard
- •
- Speech Recognition

The Sphinx-4 speech recognition system is the latest addition to Carnegie Mellon University's repository of Sphinx speech recognition systems. It has been jointly designed by Carnegie Mellon University, Sun Microsystems Laboratories... more

Bookmark
Download
- by Paul Lamere
- •
- 5
  Speech Recognition, System Design, Information Sources, Language Model

This paper proposes several speech technology improvements for increasing robustness, reliability and ergonomics in speech interfaces for controlling aerial vehicles. These improvements consist of including a statistical language model... more

Bookmark
Download
- by Victor Perez
- •
- 20
  Mechanical Engineering, Aerospace Engineering, Ergonomics, Semantics

Life is a blessing from the graces of God that he granted to all His creatures. God has excelled in creating this life in its most beautiful form, making the difference of people and the difference of graces among them a cornerstone for... more

We present two real-time hidden Markov model-based systems for recognizing sentence-level continuous American Sign Language (ASL) using a single camera to track the user's unadorned hands. The first system observes the user from a desk... more

This paper describes a method to detect smiles and laughter sounds from the video of natural dialogue. A smile is the most common facial expression observed in a dialogue. Detecting a user's smiles and laughter sounds can be useful for... more

Bookmark
Download
- by Akinori Ito
- •
- 8
  Face Recognition, User Interface, Speech Recognition, Facial expression

With the advancement of technology, we can implement a variety of ideas to serve mankind in numerous ways. Inspired by this, we have developed a smart hand glove system which will be able to help the people having hearing and speech... more

Bookmark
Download
- by IJRASET Publication
- •
- Speech Recognition

Speech recognition and speaker identification are important for authentication and verification in security purpose, but they are difficult to achieve. Speaker identification methods can be divided into textindependent and text-dependent.... more

In this paper, we present a mismatch-aware stochastic matching (MASM) algorithm to alleviate the performance degradation under mismatched training and testing conditions. MASM first computes a reliability measure of applying a set of... more

A methodology and environment for building adaptive speech recognition systems is presented. The development environment is designed for isolated word recognition systems. A small speech recognition system is developed for a home... more

Prosody has been widely used in many speech-related applications including speaker and word recognition, emotion and accent identification, topic and sentence segmentation, and text-to-speech applications. An important application we... more

Bookmark
Download
- by Omair Khan
- •
- 7
  Speech Recognition, Decision Trees, Fundamental Frequency, Word Recognition

Growing needs for French closed-captioning of live TV broadcasts in Canada cannot be met only with stenography-based technology because of a chronic shortage of skilled stenographers. Using speech recognition for live closed-captioning,... more

Bookmark
Download
- by claude chapdelaine
- •
- 5
  Speech Recognition, Model Updating, Collaborative Work, Real Time

This paper deals with the introduction of an efficient speech front-end for automatic speech recognition. This front-end not only performs well, in comparison to the traditional and widely used MFCC, but is also efficiently implemented in... more

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 8 6 ( 2 0 0 7 ) 21-29 a b s t r a c t Personal Digital Assistant devices are becoming a frequently used device for the bedside care of the patient. Ways of... more

Bookmark
Download
- by Simone Tassani and +1
  Cinzia Cacciari
- •
- 15
  Biomedical Engineering, Telecommunications, Speech Recognition, Orthopaedics

This paper describes a database of dysarthric speech produced by 19 speakers with cerebral palsy. Speech materials consist of 765 isolated words per speaker: 300 distinct uncommon words and 3 repetitions of digits, computer commands,... more

In the present work excerpts of research are presented, concerning the application of modified acoustic signal processing methods in the problem of "understanding" of selected pathologies of vocal tract. The presented concept of the... more

Bookmark
Download
- by Antoni Izworski and +2
  Ryszard Tadeusiewicz
  Wszolek Wszolek
- •
- 10
  Speech Recognition, Neural Network, Speech Processing, Signal Analysis

This paper focuses on microphone arrays to realize distant-talking speech recognition in real environments. In distant-talking situations, users can speak at arbitrary positions while moving. Therefore, it is very important for high... more

We present an approach to automatically recognize sign language and translate it into a spoken language. A system to address these tasks is created based on state-ofthe-art techniques from statistical machine translation, speech... more

Bookmark
Download
- by Philippe Dreuw
- •
- 8
  Technology, Image Processing, Sign Language, Speech Recognition

It is well known that the introduction of acoustic background distortion and the variability resulting from environmentally induced stress causes speech recognition algorithms to fail. In this paper, several causes for recognition... more

It is well known that the introduction of acoustic background distortion and the variability resulting from environmentally induced stress causes speech recognition algorithms to fail. In this paper, several causes for recognition performance degradation are explored. It is suggested that recent studies based on a Source Generator Framework can provide a viable foundation in which to establish robust speech recognition techniques. This research encompasses three inter-related issues: (i) analysis and modeling of speech characteristics brought on by workload task stress, speaker emotion/stress or speech produced in noise (Lombard effect), (ii) adaptive signal processing methods tailored to speech enhancement and stress equalization, and (iii) formulation of new recognition algorithms which are robust in adverse environments. An overview of a statistical analysis of a Speech Under Simulated and Actual Stress (SUSAS) database is presented. This study was conducted on over 200 parameters in the domains of pitch, duration, intensity, glottal source and vocal tract spectral variations. These studies motivate the development of a speech modeling approach entitled Source Generator Framework in which to represent the dynamics of speech under stress. This framework provides an attractive means for performing feature equalization of speech under stress. In the second half of this paper, three novel approaches for signal enhancement and stress equalization are considered to address the issue of recognition under noisy stressful conditions. The first method employs (Auto:I,LSP:T) constrained iterative speech enhancement to address background noise and maximum likelihood stress equalization across formant location and bandwidth. The second method uses a feature enhancing artificial neural network which transforms the input stressed speech feature set during parameterization for keyword recognition. The final method employs morphological constrained feature enhancement to address noise and an adaptive Mel-cepstral compensation algorithm to equalize the impact of stress. Recognition performance is demonstrated for speech under a range of stress conditions, signal-to-noise ratios and background noise types. Es ist wohlbekannt, dass die Einftihrung von Hintergrundger'riuschen und von VariabilitPt der Umgebung dazu ftihren, dass Spracherkennungsalgorithmen versagen. In diesem Paper werden verschiedene l%lle untersucht, die zu einer Minderung des Erkennungsgrades ftihren. Es wird vorgeschlagen, dass gegenw'%tige Untersuchungen, basierend auf Source Generafor Framework, eine variable Grundlage bilden, in der robuste Spracherkennungstechniken aufgebaut werden kannen. Diese

We describe a system for model based speech separation which achieves superhuman recognition performance when two talkers speak at similar levels. The system can separate the speech of two speakers from a single channel recording with... more

STRAIGHT, a speech analysis, modification synthesis system, is an extension of the classical channel VOCODER that exploits the advantages of progress in information processing technologies and a new conceptualization of the role of... more

Extractive speech summarization, which purports to select an indicative set of sentences from a spoken document so as to succinctly represent the most important aspects of the document, has garnered much research over the years. In this... more

The main steps of document processing have been reviewed, especially those implemented on Arabic writing. The techniques used in this research, such as Vector Quantization (VQ), Hidden Markov Models (HMM), and Induction of Decision Trees... more

This paper presents an emerging application of multimodal interface research to distributed applications. We have developed the QuickSet prototype, a pen/voice system running on a hand-held PC, communicating via wireless LAN through an... more

This paper considers the problem of constructing an efficient inverted index for the spoken term detection (STD) task. More specifically, we construct a deterministic weighted finite-state transducer storing soft-hits in the form of... more

In this paper, we present a set of optimizations for a spoken language interface for mobile devices that can improve the recognition accuracy and user interaction experience. A comparison between a speech and a graphical interface, when... more

Bookmark
Download
- by António Calado
- •
- 10
  Speech Recognition, User eXperience, Mobility, Usability Evaluation

Bookmark
Download
- by Deniz Başkent
- •
- 2
  Speech Recognition, Dynamic Range

Acoustic signals recorded simultaneously in a reverberant environment can be described as sums of differently convolved sources. The task of source separation is to identify the multiple channels and possibly to invert those in order to... more

Since 1990 the DRA Speech Research Unit has conducted research into applications of speech recognition technology to speech and language development for young children. This has been done in collaboration wirh Hereford and Worcester... more

Bookmark
Download
- by Martin Russell
- •
- 15
  Computer Science Education, Animation, Literature, Computer Animation

This paper presents the design of a FPGA-based hardware co-processor, based on the SPHINX 3 speech recognition engine from CMU; capable of performing Acoustic Modeling (AM) for medium sized vocabularies in real-time. By creating an... more

This paper describes the development and validation of an Embedded Isolated Word Recognition System (IWR) for the Argentinian Spanish language, implemented on the STM32F4-Discovery platform. Its front-end extracts Mel Frequency Cepstral... more

The hearing abilities of a group of 30 elderly (67–93yr of age) subjects were compared with those of a group of 30 young (19–27yr of age) normal hearing volunteers with the aim of characterizing the changes in the peripheral and central... more

Bookmark
Download
- by Josef Syka
- •
- 9
  Speech Recognition, Temporal Resolution, Medicine, Presbycusis

There exists a large conceptual gap between symbolic models and emergent models for the mind. Many emergent models work on low-level sensory data, while many symbolic models deal with high-level abstract (i.e., action) symbols. There has... more

Bookmark
Download
- by Juyang Weng
- •
- 18
  Robotics, Computer Architecture, Artificial Intelligence, Computer Vision

Current predictors of speech intelligibility are inadequate for understanding and predicting speech confusions caused by acoustic interference. We develop a model of auditory speech processing that includes a phenomenological... more

Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multi-media collections in the near future. Considering the characteristics and monosyllabic structure of the... more

Bookmark
Download
- by Hsin-min Wang
- •
- 7
  Cognitive Science, Speech Recognition, Linguistics, Speech Communication

This article talks about how advances in human language technology can help overcomesome of the barriers that prevent community participation in cyberspace. Human languagetechnology refers to the set of technologies, such as speech... more

Sound is essential to enhance visual experience and human robot interaction, but usually most research and development efforts are made mainly towards sound generation, speech synthesis and speech recognition. The reason why only a little... more

In the case of a trlgr~m language model, the probability of the next word conditioned on the previous two words is estimated from a large corpus of text. The resulting static trigram language model (STLM) has fixed probabilities that are... more

Bookmark
Download
- by Don Strong
- •
- 5
  Speech Recognition, Multidisciplinary, Natural language, Bit Error Rate

An input device should be natural and convenient for a user to transmit information to a computer, and should be designed from an understanding of the task to be performed and the interrelationship between the task and the device from the... more

Bookmark
Download
- by Alex Stedmon
- •
- 6
  Human Factors, User Interface, Speech Recognition, Experimental Design

Directory

Speech Recognition

Log In