Design and implementation of a DSP based MPEG-1 audio encoder
1999, IEEE Transactions on Consumer Electronics
https://doi.org/10.1109/30.754414…
6 pages
1 file
Sign up for access to the world's latest research
Abstract
AI
AI
The paper discusses the design and implementation of a Digital Signal Processor (DSP) based MPEG-1 audio encoder, highlighting its advantages over ASIC implementations. The encoder converts audio signals into MPEG bitstreams at varying bit rates, facilitating immediate transmission to a PC for storage. It details the MPEG encoding process, including psychoacoustic modeling and subband filtering, and provides insights into potential issues with fixed-point calculations versus floating-point calculations in audio encoding.
Key takeaways
AI
AI
- The DSP-based MPEG audio encoder dynamically allocates bits among 32 subbands based on signal masking.
- The encoder operates at bit rates ranging from 32 kbps to 320 kbps, constrained by PC transfer limitations.
- Psychoacoustic calculations for Signal to Mask Ratio (SMR) are approximated due to FFT removal.
- The DSP56302EVM offers a low-cost platform with essential features for real-time audio processing.
- The implementation minimizes hardware requirements by using a standard bidirectional parallel port for PC interfacing.
Related papers
1993
Compared to most digital data types, with the exception of digital video, the data rates associ-ated with uncompressed digital audio are substan-tial. Digital audio compression enables more effi-cient storage and transmission of audio data. The many forms of audio compression techniques offer a range of encoder and decoder complexity, compressed audio quality, and differing amounts of data com-pression. The -law transformation and ADPCM coder are simple approaches with low-complexity, low-compression, and medium audio quality algo-rithms. The MPEG/audio standard is a high-complexity, high-compression, and high audio qual-ity algorithm. These techniques apply to general au-dio signals and are not specifically tuned for speech signals.
IEEE Signal Processing Magazine, 2000
JOURNAL-AUDIO …, 2007
An overview of the recently finalized ISO/MPEG standard for multichannel audio com-pression MPEG Surround is provided. This audio compression scheme enables backward-compatible multichannel audio coding and transmission at unsurpassed coding efficiency. This is ...
The Journal of the Acoustical Society of America, 2007
List of Contents 1. Introduction 1.1. Historical perspective 1.2. A General perceptual audio coding architecture 1.3. Audio Coder Attributes 1.3.1. Audio quality 1.3.2. Bitrates 1.3.3. Complexity 1.3.4. Codec delay 1.3.5. Error robustness 1.4. Types of audio coders-An overview 1.5. Organization of the book 1.6. Notational conventions 2. Signal Processing Essentials 2.1. Introduction 2.2. Spectra of analog signals 2.3. Review of convolution and filtering 2.4. Uniform sampling 2.5. Discrete-time signal processing 2.5.1. Transforms for discrete-time signals 2.5.2. The discrete and the fast Fourier transform 2.5.3. The discrete cosine transform 2.5.4. The short-time Fourier transform 2.6. Difference equations and digital filters 2.7. The transfer and the frequency response functions 2.7.1. Poles, zeros, and frequency response 2.7.2. Examples of digital filters for audio applications 2.8. Review of multirate signal processing 2.8.1. Down-sampling by an integer 2.8.2. Up-sampling by an integer 2.8.3. Sampling rate changes by non-integer factors 2.8.4. Quadrature mirror filter banks 2.9. Discrete-time random signals 2.9.1. Random signals processed by LTI digital filters 2.9.2. Autocorrelation estimation from finite-length data 2.10. Summary 3. Quantization and Entropy Coding 3.1.
KnE Engineering, 2018
An eight channels subband audio codec is implemented for signals with 44.1 KHz and 16 bits per sample using Matlab. To achieve perfect reconstruction, a two channels QMF filter bank with cutoff frequency ω=π/2 is designed, based on an equiriple filter of 99 order. Seven stages of this bank are used to split the input into eight signals with sample rates from 2.76 to 11 KHz, which are coded from 1 to 16 bits depending on the band energy. To evaluate performance for three tracks in terms of similarity of input and output signals, a Mean Opinion Score (MOS) experiment with fifteen subjects was performed. The Euclidean Distance between spectrums was also measured. Results showed a fair similitude for two tracks and excellent for one. Compression factors above 96% were achieved.Keywords: subband coding, mean opinion score MOS, QMF filter banks, audio compression
Presented at the …, 2006
In 2004 the ISO/MPEG Audio standardization group started a new work item on efficient and backward-compatible coding of high-quality multichannel sound using parametric coding techniques. Finalized in the fall of 2006, the resulting MPEG Surround specification allows the transmission of surround sound at bit rates that have been commonly used for coding of mono or stereo sound. The results of the standardization process are summarized by describing the underlying ideas and providing an overview of the MPEG Surround technology. The performance of the scheme is characterized by the results of recent verification tests. These tests include several operation modes as they would be used in typical application scenarios to introduce multichannel audio into existing audio services.
2004
Lossless Wideband Audio Compression: Prediction and Transform This thesis studies lossless audio compression. In the domain of lossless compression, research takes place on two broad development sections, signal modeling and coding algorithm. The former is concerned with the understanding of the source signal, while coding is the more tightly specified task of efficiently representing a single symbol as a code. The focus of this thesis is the evaluation and the development of signal modeling techniques for lossless compression. Related with the modeling method used to decorrelate a signal, the data compression schemes are generally divided in two categories, predictive modeling and transform-based modeling. In the thesis, all two categories are investigated in depth and handled from the lossless viewpoint. The first contribution of the thesis is an exploration of the general audio compression systems including the lossy compression system. In predictive modeling, the structures of various linear prediction filters are introduced by presenting the fundamental autoregressive modeling. The prediction filters including the approaches to the nonstationary signal modeling and to the adaptive linear prediction filters are explored and evaluated by testing within a prototypical lossless audio compression system. For transform modeling, two well-known subband transform coding methods, Laplacian pyramid and subband coding scheme, are first described, and then the design methods of perfect reconstruction multirate filter banks are studied. Concerning with the modulated lapped orthogonal transform, the efficiency of linear prediction from subband and from fullband is formally compared and empirically examined. Wavelet transform is in depth studied from the various viewpoints in order to find the theoretical relationship between the wavelet and the multirate filter banks. Theoretical and practical aspects of reversible transforms are discussed by introducing the S-transform, S+P transform, and RTS transform. The lifting method is examined as a means to realize the biorthogonal wavelets. Integer lifting scheme with rounding-off method is investigated to construct reversible version of wavelet transforms and its performance is validated by applying to lossless audio compression. Finally, some of the more important results presented in this thesis are summarized with the suggesting directions for future research. ZUSAMMENFASSUNG Verlustfreie Kompression für Breitband-Audiosignale: Prädiktion und Transformation Das Ziel dieser Dissertation ist die Untersuchung von verlustfreier Audiokompression. Im Bereich der verlustfreien Kompression teilt sich die Forschung in zwei große Entwicklungsabschnitte, nämlich Signalmodellierung und Codierungsverfahren. Die Signalmodellierung befasst sich mit dem Verstehen des Quellsignals, während sich die Codierung mit dem speziellen Problem der effizienten Repräsentation von Einzelsymbolen befaßt. Der Fokus dieser Dissertation liegt auf der Evaluation und Entwicklung von Signalmodellierungstechniken für die verlustfreie Kompression. Mit Bezug auf die Modellierungsmethode zur Dekorrelation des Signals fallen die Datenkompressionssysteme in zwei Kategorien, nämlich die prädiktive Modellierung und die auf Transformation basierende Modellierung. In der Dissertation wurden beide Kategorien aus der Sicht verlustfreier Kompression ausführlich untersucht. Der erste Beitrag der Dissertation ist der Erforschung von Audiokompressionssystemen einschließlich der verlustbehafteten Kompressionssysteme gewidmet. Bei der prädiktiven Modellierung werden die verschiedenen Strukturen der Linear-Prädiktionsfilter durch Vorlage der fundamentalen autoregressiven Modellierung vorgestellt. Die Prädiktionsfilter einschließlich der Ansätze zur Modellierung des nicht-stationären Signals und adaptiven Linear-Prädiktionsfiltern werden vorgestellt, und deren Effizienz im Experiment mit einem prototypischen verlustfreien Audiokompressionssystem evaluiert. Bezüglich der Transformationsbasierten Modellierung werden zwei prominente Codierungsmethoden beschrieben, die auf Subbandtransformation basieren, nämlich Laplacian Pyramide und Subbandcodierung, beschrieben, und die Entwurfsverfahren für perfekt rekonstruierbare Multirate-Filterbänke werden dargestellt. Die Effizienz linearer Prädiktion von Vollbandsignalen und Subbandsignalen wird theoretisch untersucht und empirisch validiert. Die Wavelet-Transformation wird nach verschiedenen Aspekten gründlich untersucht, um die theoretische Verknüpfung zwischen der Wavelet-Transformation und der Multirate-Filterbank zu diskutieren. Theoretische und praktische Aspekte von reversiblen Transformationen werden in Zusammenhang mit S-Transformation, S+P Transformation, und RTS Transformation untersucht. Die Liftingmethode, die die biorthogonalen Wavelets realisiert, wird untersucht. Um eine reversible Wavelet-Transformation zu konstruieren, wird das mit Ganzzahlen operierende Liftingsystem durch Abrundungsverfahren entwickelt. Dessen Funktion wird durch Anwendung auf die verlustfreie Audiokompression validiert. Zum Schluss sind einige wichtige Untersuchungsergebnisse der Dissertation mit Vorschlägen für zukünftige Forschungsarbeiten zusammengefasst. Postgraduate-Scholarship NaFöG. This thesis is dedicated to the memory of my supervisor Prof. Dr.-Ing. Manfred Krause. As a teacher and a partner, he introduced me to this intriguing field of research and continuously taught me precise scientific thinking and writing. I feel deeply grateful for his academic support and moral endorsement through the years. The scientific community of audio researchers suffered from a great loss of a person who contributed outstanding ardour and affection to the area, when he passed away a few months ago. I would like to thank Prof. Dr. Klaus Hobohm and Prof. Dr. de la Motte-Haber for reviewing this thesis and numerous valuable comments and suggestions. I also thank Dr.-Ing. Axel Röbel who works currently at IRCAM, France, for fruitful discussions in this work. Doris Grasse deserves special thanks for all arrangements of my postgraduate life in the Institute. My current colleagues at Multimedia Lab of University Augsburg have also been very supportive for successful conclusion of this work. I would like to thank Prof. Dr. Elisabeth André who gave me the opportunity to continue and finish my work I started in Berlin. Thanks to Dr. Martin Müller for being my best friend. Finally, my deepest thanks for beaming love from my parents and brothers. Thanks to Jongkwan for bringing me to the sunny places. Thanks, Jongihn, gathering our family together always in a good atmosphere. Thanks, Jongho, for reminding how science and art go together, and being my best friend. Thanks to my mother who always understood the value of continuing my long schooling. Thanks to my father for encouraging me to follow my own path to happiness, letting me know you are always there for me, and I deeply respect your honest life of 40 years as a schoolteacher. Eternal gratitude goes to my wife Heejin whose unfathomable love and understanding during this work made it possible for me to keep going. Thanks to our son Nova who does not pose any questions related to the contents of this thesis yet.
2008
Publication in the conference proceedings of EUSIPCO, Lausanne, Switzerland, 2008
Journal of the Audio …, 2007
Recently the MPEG Audio standardization group has successfully concluded the standardization process on technology for lossless coding of audio signals. A summary of the scalable lossless coding (SLS) technology as one of the results of this standardization work is given. MPEG-4 scalable lossless coding provides a fine-grain scalable lossless extension of the well-known MPEG-4 AAC perceptual audio coder up to fully lossless reconstruction at word lengths and sampling rates typically used for high-resolution audio. The underlying innovative technology is described in detail and its performance is characterized for lossless and near lossless representation, both in conjunction with an AAC coder and as a stand-alone compression engine. A number of application scenarios for the new technology are discussed.
2004 23rd IEEE Convention of Electrical and Electronics Engineers in Israel, 2004

Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.
References (4)
- C. D. Murphy and K. Anandakumar, "Real-Time MPEG-1 Audio Coding and Decoding on a DSP Chip", IEEE transactions on Consumer Electronics, Vol. 43, NO. 1, February 1997, pp. 40-47.
- Davis Pan, "A Tutorial on MPEWAudio Compression", IEEE Multimedia, Vol. 2, NO. 2, Summer 1995, pp. 60-74.
- ISO/IEC I n k r M t i O ! l d Standard 11172-3 "Information technology -Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbits/s -part 3:Audio", Switzerland Aug 1993.
- Davis Pan, Working Draft of Technical Report, ISO/IEC JTCl/SC29/WG 11, Nov. 1992. MPEWAudio Eric Hwkstra is a MSBS student in Computer Engineering at Rochester Lnstitute of Technology, Rochester, New York. His research interest include data compression and digital signal processing. Muhammad Shaaban is currently an assistant professor in the Department of Computer Engineerhg at Rochester Institute of Technology. He received his BS and MS in electrical engineering from University of Petroleum and Minerals, Saudi Arabia and his Ph.D. in computer engineering from University of Southern California.