IJERT-Residual Excited Linear Predictive Coding

IJERT Journal

Outline

IJERT-Residual Excited Linear Predictive Coding

IJERT Journal

2015, International Journal of Engineering Research and Technology (IJERT)

Sign up for access to the world's latest research

checkGet notified about relevant papers

checkSave papers to use in your research

checkJoin the discussion with peers

checkTrack your impact

Abstract

https://www.ijert.org/residual-excited-linear-predictive-coding https://www.ijert.org/research/residual-excited-linear-predictive-coding-IJERTV4IS050982.pdf In this paper we present a low bit rate voice coding technique called the residual-excited linear prediction (RELP) coding. It uses 10 th order Levinson-Durbin Recursive algorithm. It provides very good and accurate estimates of speech parameters and is relatively efficient for computation. In the RELP system, vocal tract modeling is done by the LPC technique, and the LPC residual signal is used as the excitation signal. The range of the transmission rate is reduced to 9.6 kbits/s the synthetic speech in this range is quite good. As the transmission rate is lowered, the synthetic speech quality degrades very gradually. Since no pitch extraction is required, it is robust in any operating environment .The speech signal of males and females were coded and the results showed that the coding technique gives good speech quality with low complexity.

Ibrahim Mansour

The International Conference on Electrical Engineering

Speech coding is a very important area that finds civilian and military applications. It can be considered as one of the important stages in speech processing. It is used to compress speech; this is because the speech signal is very redundant. Speech coding has many applications; it is used in digital telephony, in multimedia and in security of digital communications. In this paper, we focused on developing algorithms and methods for a waveform speech coder operating at low bit rate with good quality reconstructed speech signal. Moreover, a new model for linear predictive coding of speech that can be used to produce high quality speech at low data rate is introduced. In this model, we divided the residual (excitation signal) to subframes and made energy and voice / unvoice classifications to choose the best pulses in the residual that give us low bit rate and good quality for the reconstructed speech. Hence, this vocoder forms an excitation sequence which consists of groups of uniformly spaced pulses. During analysis the amplitude and LP coefficients of the pulses are determined. In addition, a new technique in the quantization of the amplitude of each pulse as well as linear prediction parameters is proposed.

downloadDownload free PDF View PDFchevron_right

Voice Excited Lpc for Speech Compression by V/Uv Classification

IOSR Journals

Speech coding is an important application of speech processing. Linear predictive coding (LPC) is the powerful speech coding technique used for encoding speech signals at a low bit rate. This method provides accurate estimation of parameters with less complexity. In this paper we discuss the implementation of plain linear predictive coding (LPC) voice coder and voice excited linear predictive (VELP) voice coder. Both of these voice coders are based on the principle of linear prediction where the current sample is predicted by the linear function of past values. VELP is an improved version of plain LPC voice coder. It is implemented by using DCT for coefficients to improve quality of speech. Simulation results of plain LPC and VELP are compared and we find that VELP produces better quality of signal than LPC.

downloadDownload free PDF View PDFchevron_right

Low‐Bit‐Rate Speech Coding

Miguel Arjona Ramírez

2003

This article is focused on speech coding methods for achieving communication quality speech at bit rates of 4 kbit/s and lower. The speech coding techniques are based on an all-pole model of the vocal tract which may be implemented in the time domain with appropriately selected excitation functions or else may be fit to a spectral analysis of the speech signal. Three main types of coders are described below. Code-excited linear prediction (CELP) coders select their excitation from waveform codebooks using analysis-by-synthesis closed-loop techniques, which need to be supplemented by speech classification and open-loop parametric techniques for keeping up with quality at lower rates. The prototypical sinusoidal coder (SC) has a bank of oscillators for signal synthesis, driven by a model of the magnitude spectrum. However, phase regeneration is important in enhancing speech reconstruction at low rates. Waveform interpolation (WI) coders afford a wider timefrequency footprint for the representation of the excitation, showing a good potential for achieving toll quality at bit rates below 4 kbit/s.

downloadDownload free PDF View PDFchevron_right

Predictive Coding of Speech at Low Bit Rates

Bishnu Atal

IEEE Transactions on Communications, 1982

Abstracr-Predictive coding is a promising approach for speech coding. In this paper, we review the recent work on adaptive predictive coding of speech signals, with particular emphasis on achieving high speech quality at low bit rates (less than 10 kbits/s). Efficient prediction of the redundant structure in speech signals is obviously important for proper functioning of a predictive coder. It is equally important to ensure that the distortion in the coded speech signal be perceptually small. The subjective loudness of quantization noise depends both on the short-time spectrum of the noise and its relation to the short-time spectrum of the speech signal. The noise in the formant regions is partially masked by the speech signal itself. This masking of quantization noise by speech signal allows one to use low bit rates while maintaining high speech quality. This paper will present generalizations of predictive coding for minimizing subjective distortion in the reconstructed speech signal at the receiver. The quantizer in predictive coders quantizes its input on a sample-by-sample basis. Such sample-by-sample (instantaneous) quantization creates difficulty in realizing an arbitrary noise spectrum, particularly at low bit rates. We will describe a new class of speech coders in this paper which could be considered to be a generalization of the predictive coder. These new coders not only allow one to realize the precise optimum noise spectrum which is crucial to achieving very low bit rates, but also represent the important first step in bridging the gap between waveform coders and vocoders without suffering from their limitations.

downloadDownload free PDF View PDFchevron_right

Design of MELPe-Based Variable-Bit-Rate Speech Coding with Mel Scale Approach Using Low-Order Linear Prediction Filter and Representing Excitation Signal Using Glottal Closure Instants

P. Sathidevi

Arabian Journal for Science and Engineering, 2019

In this paper, we propose a variable-bit-rate speech codec-based on mixed excitation linear prediction enhanced (MELPe) with an average bit rate of 2 kbps and with a better representation of excitation signal. The order of the prediction filter in MELPe coding architecture is reduced from 10 to 7 without affecting the perceptual quality of the decoded speech by using psychoacoustic Mel scale. An efficient two-split vector quantization is developed with weighted Euclidean distance measure for Mel scale-based linear predictive coding (Mel-LPC), and it requires only 18 bits/frame. The instantaneous pitch or epoch that is vital for many speech processing applications is preserved in this codec by including it in the excitation signal used for reconstructing the voiced speech. The quantization scheme developed for glottal closure instants (GCIs) causes an increase in the bit requirement for voiced frames by 4-25 bits depending on the position of GCIs. To compensate for that, the Mel-LPC order for both silence and unvoiced frames has been brought down to 4 without compromising the perceptual quality of reconstructed speech. The lowered bit budget for unvoiced frame is 41 bits/frame, and for silence, it is 31 bits/frame. Further reduction of 10 bits for silence frame is obtained by reducing the number of transmitted parameters and by tuning the quantization bit requirement for each. For categorizing the speech frames at the entry of the encoder, a neural network-based voiced/unvoiced/silence classification algorithm using five-dimensional feature set is created. The experimental results show that the proposed coding scheme operates at an average bit rate of 2 kbps, which is less than the bit rate of MELPe (2.4 kbps), but with a better perceptual score. In addition to all these, the incorporation of Mel-LPC gives a better performance in the estimation of formants and GCIs.

downloadDownload free PDF View PDFchevron_right

On improving voice periodicity prediction in codebook-excited LPC coders

Daniel Lin

The Journal of the Acoustical Society of America, 1988

downloadDownload free PDF View PDFchevron_right

The Influence of Speech Enhancement Algorithm in Speech Compression with Voice Excited Linear Predictive Coding

deepa dhanaskodi

Problem statement: Speech Enhancement plays an important role in any of the speech processing systems like speech recognition, speech coding, mobile communication, hearing aid, etc., Approach: In this work, the performance of the speech coding method is enhanced by using speech enhancement as the preprocessing technique. The purpose of the proposed method is to reduce the bit rate of the speech signal to be transmitted, so that the bandwidth can be utilized efficiently. In noisy environment speech coding is done both for desired speech and the unwanted noise signal. If the noise is reduced before coding the speech signal, the bit rate required will also be reduced. In this work a simple adaptive speech enhancement technique, using an adaptive sigmoid type function to determine the weighting factor of the TSDD algorithm is employed based on a subband approach for speech enhancement and Voice excited Linear predictive coding (VELP) method is used for coding the speech signal. Results:...

downloadDownload free PDF View PDFchevron_right

A mixed sinusoidally excited linear prediction coder at 4 kb/s and below

Vishu Viswanathan

There is currently a great deal of interest in the development of speech coding algorithms capable of delivering toll quality at 4 kb/s and below. For synthesizing high quality speech, accurate representation of the voiced portions of speech is essential. For bit rates of 4 kb/s and below, conventional code excited linear prediction (CELP) may likely not provide the appropriate degree of periodicity. It has been shown that good quality low bit rate speech coding can be obtained by frequency domain techniques such as sinusoidal transform coding (STC), multi-band excitation (MBE), mixed excitation linear prediction (MELP), and multi-band LPC (MB-LPC) vocoders. In this paper, a speech coding algorithm based on an improved version of MB-LPC is presented. Main features of this algorithm include a multi-stage time/frequency pitch estimation and an improved mixed voicing representation. An efficient quantization scheme for the spectral amplitudes of the excitation, called formant weighted ...

downloadDownload free PDF View PDFchevron_right

A unified framework for LPC excitation representation in residual speech coders

David Malah

International Conference on Acoustics, Speech, and Signal Processing, 1989

In this paper the efficient representation of the excitation signal to an LPC synthesis filter by means of a vector expansion of the residual signal is examined. According to this approach the excitation signal is represented as a linear combination of a small number of vectors taken from a given vector set, known at both ends of the transmission channel. It is demonstrated that this approach provides a unified framework for describing and analyzing a wide range of residual speech coders, from Multipulse LPC and CELP to Residual Transform Coders and leads to generalization of some of these schemes. Optimality conditions based on the singular value decomposition (SVD) of the impulse response matrix of the perceptually weighted LPC synthesis tilter are glven. A resulting simplified Predictive Transform Coder is proposed and examined by computer simulations.

downloadDownload free PDF View PDFchevron_right

Principles of Speech Coding

SIm NARASIMHA

Principles of Speech Coding, 2010

Introduction to LTT Systems 2.1.1 Linearity 2.1.2 Time Invariance 2.1.3 Representation Using Impulse Response 2.1.4 Representation of Any Continuous-Time (CT) Signal .. 2.1.5 Convolution 2.1.6 Differential Equation Models 2.2 Review of Digital Signal Processing 2.2.1 Sampling 2.2.2 Shifted Unit Pulse: 8 (wk) 2.2.3 Representation of Any DT Signal 2.2.4 Introduction to Z Transforms 2.2.5 Fourier Transform, Discrete Fourier Transform 2.2.6 Digital Filter Structures 2.3 Review of Stochastic Signal Processing 2.3.1 Power Spectral Density 2.4 Response of a Linear System to a Stochastic Process Input.... 2.5 Windowing 2.6 AR Models for Speech Signals, Yule-Walker Equations 2.7 Short-Term Frequency (or Fourier) Transform and Cepstrum. 2.7.1 Short-Term Frequency Transform (STFT) 2.7.2 The Cepstrum 2.8 Periodograms 2.9 Spectral Envelope Determination for Speech Signals 2.10 Voiced/Unvoiced Classification of Speech Signals 2.10.1 Time-Domain Methods 2.10.1.1 Periodic Similarity 2.10.1.2 Frame Energy 2.10.1.3 Pre-Emphasized Energy Ratio 2.10.1.4 Low-to Full-Band Energy Ratio 2.10.1.5 Zero Crossing 2.10.1.6 Prediction Gain 2.10.1.7 Peakiness of Speech 2.10.1.8 Spectrum Tilt 2.10.2 Frequency-Domain Methods 2.10.3 Voiced/Unvoiced Decision Making 2.11 Pitch Period Estimation Methods 2.12 Summary Exercise Problems References Bibliography Contents ix 3. Sampling Theory 61 3.1 4.10 ITU G.711 |i-Law and A-Law PCM Standards 4.10.1 Conversion between Linear and Companded Codes ... 4.10.1.1 Linear to |x-Law Conversion 92 4.10.1.2^i-Law to Linear Code Conversion 93 4.10.1.3 Linear to A-Law Conversion 94 4.10.1.4 A-Law to Linear Conversion 95 4.11 Optimum Quantization 95 4.11.1 Closed Form Solution for the Optimum Companding Characteristics 96 4.11.2 Lloyd-Max Quantizer 97 4.12 Adaptive Quantization

downloadDownload free PDF View PDFchevron_right

Loading Preview

Sorry, preview is currently unavailable. You can download the paper by clicking the button above.

References (4)

Zarkadis, D.J.; Evans, B.G, "Performance considerations of a 9.6kb/s RELP coder" IEEE Trans, pp.172-177, August 2002.
Katterfeldt, H.," A DFT-based residual-excited linear predictive coder" IEEE INFOCOM 2003, pp.824-827, January 2003.
Katterfeldt, H.; Behl, E.," Implementation of a robust RELP speech coder", IEEE 1983, pp.1316-1319.
Chong Un; Magill, D.," The Residual-Excited Linear Prediction Vocoder"IEEE comm., pp.1466-1474, jan.2003 International Journal of Engineering Research & Technology (IJERT) ISSN: 2278-0181 www.ijert.org IJERTV4IS050982 (This work is licensed under a Creative Commons Attribution 4.0 International License.) Vol. 4 Issue 05, May-2015

muhammad sajid

Telecommunication industry is growing and different services are rapidly introduced by different competitors to attract the users. Speech communication and its quality conservation is the most prevalent and common service provided by almost all companies. The objective of this project is the development of a LPC (Linear Predictive Coding) based voice coder. Attributes for speech like pitch, voiced and unvoiced decision and silence were extracted and speech was modeled using LDR (Levinson Durbin Recursion) and SDA (Steepest Descent Algorithm). LPC filter is analyzed and its model is implemented. LPC's different attributes complexity, delay and bitrate are deliberated and tradeoffs are highlighted. The results were analyzed and quality of speech was determined using spectrograph and by listening to the synthesized speech. At the end quality of original and synthesized speech is discussed and shown graphically and a soft comparison between both above mentioned technique is also added.

downloadDownload free PDF View PDFchevron_right

A Wide Band Speech Coding Technique using Low Delay Code Excited Linear Predictive Algorithm (LD-CELP)

Dr. HEMANT PUROHIT

Proceedings of the Second International Conference on Research in Intelligent and Computing in Engineering, 2017

A fair level of speech quality is desired in speech transmission for mobile voice services. The effective utilization of bandwidth and higher bit rate is must for a best quality speech coder. But at a time the both requirements are not fulfilled in desired format. The research is ongoing in the area of designing speech coder's. In general the CELP is an algorithm to design a good quality speech coder. From 80's to present the advancement in this technique is going on. In this paper a wide band speech coding technique is proposed using LD-CELP algorithm. The overall performance of LD-CELP (16Kbps) is summarized and computed on MATLAB version R2016a with parameters MSE and SNR. In conclusion we observe that SNR for LD-CELP is not much better and enhancement in this is necessary.

downloadDownload free PDF View PDFchevron_right

Code-excited linear prediction(CELP): High-quality speech at very low bit rates

Arun Raj

1985

We describe in this paper a code-excited linear predictive coder in which the optimum innovation sequence is selected from a code book of stored sequences to optimize a given fidelity criterion. Each sample of the innovation sequence is filtered sequentially through two time-varying linear recursive filters, one with a long-delay (related to pitch period) predictor in the feedback loop and the other with a short-delay predictor (related to spectral envelope) in the feedback loop. We code speech, sampled at 8 kHz, in blocks of 5-msec duration. Each block consisting of 40 samples is produced from one of 1024 possible innovation sequences. The bit rate for the innovation sequence is thus 1/4 bit per sample. We compare in this paper several different random and deterministic code books for their effectiveness in providing the optimum innovation sequence in each block. Our results indicate that a random code book has a slight speech quality advantage at low bit rates. Examples of speech produced by the above method will be played at the conference.

downloadDownload free PDF View PDFchevron_right

RELP Coding for Voice Communication

norman lopez

ece.uprm.edu

In mobile communication systems, bandwidth is a precious commodity, and service providers are continuously faced with the challenge of accommodating more users within a limited allocated bandwidth. Linear Predictive Coding (LPC) offers low bit-rate speech coding that can be used to meet this challenge. The lower the bit rate at which the coder can deliver toll quality speech, the more speech channels can be compressed within a given bandwidth. Work done in a class of LPC, Residual Excited LPC (RELP), is presented in this paper. The RELP coding of speech and its transmission were implemented using the MATLAB numerical computation software package.

downloadDownload free PDF View PDFchevron_right

Implementation of attractive Speech Quality for Mixed Excited Linear Prediction

Aalay Mehta

Nowadays the number of mobile subscribers is increasing all over the world, so the system for the communication has to be improved. Mixed Excited Linear Prediction (MELP) algorithm is developed for reducing the bandwidth of the signal as well as transmission of large data on a single channel. This results in increase channel capacity. This also results in, increasing the number of user in a channel. MELP is basically a speech coding method, relying on a Speech Encoder and Speech Decoder. The MELP speech coder reduces the redundancy of the signal and compresses it which is represented by the MELP code. Speech Decoder includes a Linear Predictive Coding (LPC) filter providing a synthesized speech at its output side in response to voice and unvoiced. MELP also reduces jitter voice. The bit rate of MELP is reducing the reserves of the code book and calculation complexity. This paper describes "the bit rates of MELP coder can be reduced to as low as 2.4kbps without apparent damage to the speech quality."

downloadDownload free PDF View PDFchevron_right

Predictive coding of speech signals and subjective error criteria

Bishnu Atal

ICASSP '78. IEEE International Conference on Acoustics, Speech, and Signal Processing, 1978

downloadDownload free PDF View PDFchevron_right

A Comparative Study of Speech Coding Techniques for Electro Larynx Speech Production

Abdulkareem Kadhim

Iraqi journal of information and communication technology, 2022

Speech coding is a method of earning a tight speech signals representation for efficient storage and efficient transmission over band-limited wired or wireless channels. This is usually achieved with acceptable representation and the least number of bits without depletion in the perceptual quality. A number of speech coding methods have already been developed and various speech coding algorithms for speech analysis and synthesis are used. This paper deals with the comparison of selected coding methods for speech signals produced by the Electro Larynx (EL) device. The latter is a device used by cancer patients with their vocal laryngeal cords being removed. The used methods are Residual-Excited Linear Prediction (RELP), Code Excited Linear Prediction (CELP), Algebraic Code Excited Linear Predictive (ACELP), Phase Vocoders based on Wavelet Transform (PVWT), Channel Vocoders based on Wavelet Transform (CVWT), and Phase vocoder based on Dual-Tree Rational-Dilation Complex Wavelet Transform (PVDT-RADWT). The aim here is to select the best coding approach based on the quality of the reproduced speech. The signal used in the test is speech signal recorded either directly by normal persons or else produced by EL device. The performance of each method is evaluated using both objective and subjective listening tests. The results indicate that PVWT and ACELP coders perform better than other methods having about 40 dB SNR and 3 PESQ score for EL speech and 75 dB with 3.5 PESQ score for normal speech, respectively.

downloadDownload free PDF View PDFchevron_right

Pitch synchronous innovation code excited linear prediction (PSI-CELP)

Takehiro Moriya

Electronics and Communications in Japan (Part III: Fundamental Electronic Science), 1994

This paper proposes a new speech coding method pitch synchronous innovation code excited linear predictor (PSI-CELP). This method is based on CELP but adds pitch synchronous innovation. This results in even random codevectors being adaptively converted to have pitch periodicity for voiced frames. This scheme can improve the synthesized speech quality of voiced frames in the low bit-rate CELP without increasing either computational complexity or bit rate.

downloadDownload free PDF View PDFchevron_right

High Quality Low-Delay Speech Coding at 12 kb/s

Majid Foodeei

downloadDownload free PDF View PDFchevron_right

Speech coding at 4 kb/s and lower using single-pulse and stochastic models of LPC excitation

Bishnu Atal

1991

Accurate representation of periodic speech segments IS essentral for synthesizmg high-quality digital speech. For bit rates at and below 4 kb/s, conventional code-excited hnear predictive codmg (CELP) does not provide the appropriate degree of periodicity. Small codebook size and coarse quantrzation of gain factors result m large spectral fluctuations between pitch penods. At low bit rates, smoothness of spectral changes can be achieved by an excitanon function whtch contams one excitation pulse of fixed or slowly time-varying shape for each pitch period. Earher work has shown that single-pulse excitation can achieve reasonably good speech quality. However, this method was based on an optimization procedure that caused a very large coding delay. In this paper, we present a linear predrctive coder that classifies speech into periodic and nonperiodic intervals. Nonperiodic speech is synthesized as in CELP. Periodic speech is synthesized using single-pulse excitation. The coder is based on a new algorithm for determining pitch markers wrthin short blocks of periodic speech. This algorithm requires a small coding delay and is implemented efficiently by dynamic programming.

downloadDownload free PDF View PDFchevron_right

IJERT-Residual Excited Linear Predictive Coding

Sign up for access to the world's latest research

Abstract

Related papers

References (4)

Related papers

Related topics