Information about Speech Processing

Speech processing is the study of speech signals and the processing methods of these signals.

The signals are usually processed in a digital representation whereby speech processing can be seen as the intersection of digital signal processing and natural language processing.

Speech processing can be divided in the following categories:

See also

External links

Speech communication refers to the processes associated with the production and perception of sounds used in spoken language. A number of academic disciplines study speech and speech sounds, including acoustics, psychology, speech pathology, linguistics, and computer science.
..... Click the link for more information.
A digital system is one that uses discrete values (often electrical voltages), representing numbers or non-numeric symbols such as letters or icons, for input, processing, transmission, storage, or display, rather than a continuous range of values (ie, as in an analog system).
..... Click the link for more information.
Digital signal processing ('DSP') is the study of signals in a digital representation and the processing methods of these signals. DSP and analog signal processing are subfields of signal processing.
..... Click the link for more information.
Natural language processing (NLP) is a subfield of artificial intelligence and computational linguistics. It studies the problems of automated generation and understanding of natural human languages.
..... Click the link for more information.
Speech recognition (in many contexts also known as automatic speech recognition, computer speech recognition or erroneously as voice recognition) is the process of converting a speech signal to a sequence of words in the form of digital data, by means of an
..... Click the link for more information.
Linguistics is the scientific study of language, which can be theoretical or applied. Someone who engages in this study is called a linguist.
..... Click the link for more information.
Speaker recognition, or voice recognition is the task of recognizing people from their voices. Such systems extract features from speech, model them and use them to recognize the person from his/her voice.
..... Click the link for more information.
Identity is an umbrella term used throughout the social sciences to describe an individual's comprehension of him or herself as a discrete, separate entity. This term, though generic, can be further specified by the disciplines of psychology and sociology, including the two forms
..... Click the link for more information.


Noise reduction is the process of removing noise from a signal. Noise reduction techniques are conceptually very similar regardless of the signal being processed, however a priori knowledge of the characteristics of an
..... Click the link for more information.
Speech coding is the application of data compression of digital audio signals containing speech. Speech coding uses speech-specific parameter estimation using audio signal processing techniques to model the speech signal, combined with generic data compression algorithms to
..... Click the link for more information.
data compression or source coding is the process of encoding information using fewer bits (or other information-bearing units) than an un-encoded representation would use through use of specific encoding schemes.
..... Click the link for more information.
Telecommunication is the transmission of signals over a distance for the purpose of communication. In modern times, this process typically involves the sending of electromagnetic waves by electronic transmitters, but in earlier times telecommunication may have involved the use of
..... Click the link for more information.
Voice analysis is the study of speech sounds for purposes other than linguistic content, such as in speech recognition. Such studies include mostly medical analysis of the voice i.e. phoniatrics, but also speaker identification.
..... Click the link for more information.
Vocal loading is the stress inflicted on the speech organs when speaking for long periods.

Background

Of the working population, about 15% have professions where their voice is their primary tool.
..... Click the link for more information.
The vocal folds, also known popularly as vocal cords, are composed of twin infoldings of mucous membrane stretched horizontally across the larynx. They vibrate, modulating the flow of air being expelled from the lungs during phonation.
..... Click the link for more information.
Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech synthesizer, and can be implemented in software or hardware.
..... Click the link for more information.
Audio signal processing, sometimes referred to as audio processing, is the processing of a representation of auditory signals, or sound. The representation can be digital or analog.
..... Click the link for more information.
Linguistics is the scientific study of language, which can be theoretical or applied. Someone who engages in this study is called a linguist.
..... Click the link for more information.
Phonetics (from the Greek word φωνή, phone meaning 'sound, voice') is the study of the sounds of human speech. It is concerned with the actual properties of speech sounds (phones), and their production, audition and perception, while phonology, which
..... Click the link for more information.
An utterance is a complete unit of speech in spoken language. It is generally but not always bounded by silence.

It can be represented and delineated in written language in many ways.
..... Click the link for more information.
Speech signal processing refers to the acquisition, manipulation, storage, transfer and output of human utterances by a computer. The main goals are the recognition, synthesis and compression of human speech:

..... Click the link for more information.
Packet Loss Concealment (PLC) is a technique to mask the effects of packet loss in VoIP communications. Because the voice signal is sent as packets on a VoIP network, they may travel different routes to get to destination.
..... Click the link for more information.
Digital signal processing ('DSP') is the study of signals in a digital representation and the processing methods of these signals. DSP and analog signal processing are subfields of signal processing.
..... Click the link for more information.
Estimation theory is a branch of statistics and signal processing that deals with estimating the values of parameters based on measured/empirical data. The parameters describe the physical scenario or object that answers a question posed by the estimator.
..... Click the link for more information.
Detection theory, or signal detection theory, is a means to quantify the ability to discern between signal and noise. Much of the early work in detection theory was done by radar researchers. [1] Detection theory was used in 1966 by John A. Swets and David M.
..... Click the link for more information.
Audio signal processing, sometimes referred to as audio processing, is the processing of a representation of auditory signals, or sound. The representation can be digital or analog.
..... Click the link for more information.
Control engineering is the engineering discipline that focuses on mathematical modelling of systems of a diverse nature, analyzing their dynamic behavior, and using control theory to create a controller that will cause the systems to behave in a desired manner.
..... Click the link for more information.
Digital image processing is the use of computer algorithms to perform image processing on digital images. Digital image processing has the same advantages over analog image processing as digital signal processing has over analog signal processing — it allows a much wider
..... Click the link for more information.
Statistical signal processing is an area of signal processing that treats signals as stochastic processes, dealing with their statistical properties (e.g., mean, covariance, etc.).
..... Click the link for more information.
discrete Fourier transform (DFT), occasionally called the finite Fourier transform, is a transform for Fourier analysis of finite-domain discrete-time signals. As with most Fourier analyses, it expresses an input function in terms of a sum of sinusoidal components by determining
..... Click the link for more information.


This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus


page counter