Academia.eduAcademia.edu

Acoustic features representing the audio information can be extracted from the speech signal at the segmental level. The segmental features are the features extracted from short (0 to 5 minutes) segments of the speech signal. These features represent the short-time spectrum of the speech signal. The short-time spectrum envelope of the speech signal is attributed primarily to the shape of the vocal tract. Mel-frequency cepstral coefficients (MFCC) have been commonly used in speech processing. Fig. 2. illustrates the computation of MEFCC features for a segment of audio signal which is  described as follows:

Figure 2 Acoustic features representing the audio information can be extracted from the speech signal at the segmental level. The segmental features are the features extracted from short (0 to 5 minutes) segments of the speech signal. These features represent the short-time spectrum of the speech signal. The short-time spectrum envelope of the speech signal is attributed primarily to the shape of the vocal tract. Mel-frequency cepstral coefficients (MFCC) have been commonly used in speech processing. Fig. 2. illustrates the computation of MEFCC features for a segment of audio signal which is described as follows: