Algorithms and Software for Predictive and Perceptual Modeling of Speech

Algorithms and Software for Predictive and Perceptual Modeling of Speech
Author :
Publisher : Morgan & Claypool Publishers
Total Pages : 124
Release :
ISBN-10 : 9781608453887
ISBN-13 : 160845388X
Rating : 4/5 (88X Downloads)

Book Synopsis Algorithms and Software for Predictive and Perceptual Modeling of Speech by : Venkatraman Atti

Download or read book Algorithms and Software for Predictive and Perceptual Modeling of Speech written by Venkatraman Atti and published by Morgan & Claypool Publishers. This book was released on 2010-05-05 with total page 124 pages. Available in PDF, EPUB and Kindle. Book excerpt: From the early pulse code modulation-based coders to some of the recent multi-rate wideband speech coding standards, the area of speech coding made several significant strides with an objective to attain high quality of speech at the lowest possible bit rate. This book presents some of the recent advances in linear prediction (LP)-based speech analysis that employ perceptual models for narrow- and wide-band speech coding. The LP analysis-synthesis framework has been successful for speech coding because it fits well the source-system paradigm for speech synthesis. Limitations associated with the conventional LP have been studied extensively, and several extensions to LP-based analysis-synthesis have been proposed, e.g., the discrete all-pole modeling, the perceptual LP, the warped LP, the LP with modified filter structures, the IIR-based pure LP, all-pole modeling using the weighted-sum of LSP polynomials, the LP for low frequency emphasis, and the cascade-form LP. These extensions can be classified as algorithms that either attempt to improve the LP spectral envelope fitting performance or embed perceptual models in the LP. The first half of the book reviews some of the recent developments in predictive modeling of speech with the help of MatlabTM Simulation examples. Advantages of integrating perceptual models in low bit rate speech coding depend on the accuracy of these models to mimic the human performance and, more importantly, on the achievable "coding gains" and "computational overhead" associated with these physiological models. Methods that exploit the masking properties of the human ear in speech coding standards, even today, are largely based on concepts introduced by Schroeder and Atal in 1979. For example, a simple approach employed in speech coding standards is to use a perceptual weighting filter to shape the quantization noise according to the masking properties of the human ear. The second half of the book reviews some of the recent developments in perceptual modeling of speech (e.g., masking threshold, psychoacoustic models, auditory excitation pattern, and loudness) with the help of MatlabTM simulations. Supplementary material including MatlabTM programs and simulation examples presented in this book can also be accessed here. Table of Contents: Introduction / Predictive Modeling of Speech / Perceptual Modeling of Speech


Algorithms and Software for Predictive and Perceptual Modeling of Speech Related Books

Algorithms and Software for Predictive and Perceptual Modeling of Speech
Language: en
Pages: 124
Authors: Venkatraman Atti
Categories: Technology & Engineering
Type: BOOK - Published: 2010-05-05 - Publisher: Morgan & Claypool Publishers

DOWNLOAD EBOOK

From the early pulse code modulation-based coders to some of the recent multi-rate wideband speech coding standards, the area of speech coding made several sign
Algorithms and Software for Predictive and Perceptual Modeling of Speech
Language: en
Pages: 113
Authors: Venkatraman Atti
Categories: Technology & Engineering
Type: BOOK - Published: 2022-05-31 - Publisher: Springer Nature

DOWNLOAD EBOOK

From the early pulse code modulation-based coders to some of the recent multi-rate wideband speech coding standards, the area of speech coding made several sign
Bandwidth Extension of Speech Using Perceptual Criteria
Language: en
Pages: 71
Authors: Visar Berisha
Categories: Technology & Engineering
Type: BOOK - Published: 2022-06-01 - Publisher: Springer Nature

DOWNLOAD EBOOK

Bandwidth extension of speech is used in the International Telecommunication Union G.729.1 standard in which the narrowband bitstream is combined with quantized
Engineer Your Software!
Language: en
Pages: 121
Authors: Scott A. Whitmire
Categories: Technology & Engineering
Type: BOOK - Published: 2022-06-01 - Publisher: Springer Nature

DOWNLOAD EBOOK

Software development is hard, but creating good software is even harder, especially if your main job is something other than developing software. Engineer Your
Despeckle Filtering for Ultrasound Imaging and Video, Volume I
Language: en
Pages: 154
Authors: Christos P. Loizou
Categories: Technology & Engineering
Type: BOOK - Published: 2022-05-31 - Publisher: Springer Nature

DOWNLOAD EBOOK

It is well known that speckle is a multiplicative noise that degrades image and video quality and the visual expert's evaluation in ultrasound imaging and video