Digital Audio

description904 papers

group1,377 followers

lightbulbAbout this topic

Digital audio refers to the representation of sound in a digital format, where audio signals are converted into binary data for storage, processing, and transmission. This field encompasses various techniques for audio encoding, compression, and playback, enabling high-fidelity sound reproduction and manipulation in various applications, including music production, broadcasting, and telecommunications.

lightbulbAbout this topic

Key research themes

1. How are digital audio effects designed and modeled to manipulate and enhance musical sound?

This research area investigates the design, modeling, and implementation of digital audio effects (DAFx) that manipulate musical sound for creative and production purposes. It focuses on algorithmic development, real-time processing, and the application of machine learning for black-box and parametric modeling of audio effects such as equalization, amplification, and non-linear distortions. These advances are significant for music production, mixing, live performance, and the emulation of analog hardware in digital environments.

Special Issue on Digital Audio Effects

by Federico Fontana

2021, Applied Sciences

Key finding: This comprehensive collection reveals that the field of digital audio effects is undergoing a shift towards integrating machine learning methods, as shown by contributions applying deep neural networks for modeling analog... Read more

articleView Paper downloadDownload

Modulation And Delay Line Based Digital Audio Effects

by Sascha Disch

2016

Key finding: This work categorizes audio effects based on amplitude and phase modulation techniques using delay lines, explaining the signal processing foundations underlying effects such as vibrato, flanging, chorus, and rotary speaker... Read more

articleView Paper downloadDownload

All About Audio Equalization: Solutions and Frontiers

by Vesa Välimäki

2025, Applied Sciences

Key finding: This seminal review consolidates diverse digital audio equalization techniques from classic to cutting-edge approaches within a unified mathematical framework, emphasizing parametric and shelving filters, graphic equalizers,... Read more

articleView Paper downloadDownload

New Enhancements to Immersive Sound Field Rendition (ISR) System

by Deepen Sinha

2024, Audio Engineering Society …

Key finding: This paper presents innovations to parametric multichannel audio coding through the Immersive Sound-field Rendition (ISR) system, focusing on improvements like phase compensated down-mixing and blind up-mixing schemes for low... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What advances enable automatic detection and removal of non-speech vocal sounds (e.g., breath sounds) in digital audio recordings?

This important subfield addresses the challenge of identifying and eliminating unwanted non-speech vocal sounds such as breaths that may detract from the clarity of vocal recordings in music, broadcasting, and speech processing. The focus encompasses algorithmic approaches employing signal processing and deep learning methods to automate this laborious process. Effective solutions improve both production efficiency and audio quality, with notable progress shown in attention U-Net architectures that combine accuracy with reduced computational complexity.

Attention-Based Efficient Breath Sound Removal in Studio Audio Recordings

by Nidula Elgiriyewithana

2024

Key finding: This study introduces a parameter-efficient deep learning model leveraging an attention U-Net architecture for automatic detection and eradication of breath sounds in vocal recordings. Trained on a unique DAPS-derived dataset... Read more

articleView Paper downloadDownload

Modulation And Delay Line Based Digital Audio Effects

by Sascha Disch

2016

Key finding: The presented modulation-based effects models inherently manage dynamic changes in audio signal properties suited to vocal manipulation. The insights into phase and amplitude modulation mechanisms inform the design of... Read more

articleView Paper downloadDownload

3. How are digital sound synthesis and digital sound reconstruction advancing in generating realistic audio?

This theme covers algorithmic innovations in synthesizing sounds digitally, particularly through physical modeling and digital reconstruction techniques. The goal is to produce realistic and high-fidelity synthetic musical or acoustic sounds. Physical modeling simulates vibrating physical structures, enabling expressive synthesis linked to instrument mechanics. Digital sound reconstruction (DSR) techniques aim to overcome limitations of traditional loudspeaker designs, especially in low-frequency domains, using array configurations and shutter gate mechanisms to enhance sound pressure output. These approaches enable real-time synthesis and new sound generation paradigms critical for musical instrument digital interfaces and audio reproduction devices.

Digital sound synthesis by physical modelling

by Rudolf Rabenstein

2024, ISPA 2001. Proceedings of the 2nd International Symposium on Image and Signal Processing and Analysis. In conjunction with 23rd International Conference on Information Technology Interfaces (IEEE Cat. No.01EX480)

Key finding: This article systematically presents physical modeling methods for digital sound synthesis, transforming partial differential equation-based models of vibrating structures (such as strings and drums) into discrete-time... Read more

articleView Paper downloadDownload

A New Method for Sound Generation Based on Digital Sound Reconstruction

by Manfred Kaltenbacher

2022, Journal of Theoretical and Computational Acoustics

Key finding: The paper introduces Advanced Digital Sound Reconstruction (ADSR), which improves upon classical DSR by incorporating shutter gates and redirection mechanisms to boost sound pressure level particularly in mid-to-low frequency... Read more

articleView Paper downloadDownload

All papers in Digital Audio

5 Prácticas Básicas de Adobe Audition

by José Carlos Barceló

Instrucciones para realizar cinco practicas de nivel básico en Adobe Audition

descriptionView Paper arrow_downwardDownload

The SuperCollider Italian Manual, at CIRMA

by Andrea Valle

2008

[ITA] A handbook that introduces some basics aspects of SuperCollider music and audio environment

descriptionView Paper arrow_downwardDownload

Del medio que luego invade: Experiencias creativas a partir del paisaje sonoro y el glitch, posturas estéticas mediadas por la tecnología (2013-15)

by fabián avila elizalde

La presente investigación exploró el papel de las mediaciones tecnológicas, psicoacústicas y socioculturales, involucradas en siete experiencias de creación sonora, tanto individuales como colaborativas, bajo las posturas estéticas del... more

descriptionView Paper arrow_downwardDownload

Homo Audiens III Conocer la radio: Textos teóricos para aprehenderla

by Virginia Medina Avila and

2018

La presencia de la radio en México se acerca a su centenario y a pesar de que la bibliografía publicada en torno a ella es abundante queda mucho por investigar sobre su pasado y su presente para intentar una prospectiva de su porvenir.... more

descriptionView Paper arrow_downwardDownload

Zen a Musing: A Suite of Recombinant Digital Music

by Colin P McGuire

This paper is about a suite of electroacoustic music inspired by multiculturalism and DNA. Samples, live playing, and synthetic sounds were combined using digital technology into a dance-informed, world-flavored, concert-oriented,... more

descriptionView Paper arrow_downwardDownload

An Improved and Generalized Diode Clipper Model for Wave Digital Filters

by Kurt J A M E S Werner and

Proceedings of the Audio Engineering Society

We derive a novel explicit wave-domain model for “diode clipper” circuits with an arbitrary number of diodes in each orientation, applicable, e.g., to wave digital filter emulation of guitar distortion pedals. Improving upon and... more

descriptionView Paper arrow_downwardDownload

Caleb Stuart, "The Object of Performance: Aural Performativity in Contemporary Laptop Music," Contemporary Music Review , 2003, V OL . 22, No. 4, 59–65

by Caleb Kelly

The use of the laptop in performance causes various negative responses from the audience, who feel a loss of spectacle and performativity in the action. This occurs as a result of the lack of gesture and visual cues from the performer and... more

descriptionView Paper arrow_downwardDownload

More Cowbell: a Physically-Informed, Circuit-Bendable, Digital Model of the TR-808 Cowbell

by Kurt J A M E S Werner and

2014, Proceedings of the 137th Audio Engineering Society Convention

We present an analysis of the cowbell voice circuit from the Roland TR-808 Rhythm Composer. A digital model based on this analysis accurately emulates the original. Through the use of physical models of each sub-circuit, this model... more

descriptionView Paper arrow_downwardDownload

Hybrid Reverberator Using Multiple Impulse Responses for Audio Rendering Improvement

by Stefania Cecchi

2013, 2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

In the recent years, hybrid reverberation algorithms have been widely explored aiming to reproduce the acoustic behavior of real environment at low computational load. On this basis, exploiting the advantages introduced from hybrid... more

descriptionView Paper arrow_downwardDownload

Codificación Digital y Criptografía Aplicada en la Transmisión de Datos

by Roberto Carlos Ramirez Caicedo

2018, Codificación Digital y Criptografía Aplicada en la Transmisión de Datos

Una introducción con aplicaciones a las comunicaciones móviles y a la
modulación en audio digital

descriptionView Paper arrow_downwardDownload

Designing Quasi-Linear Phase IIR Filters for Audio Crossover Systems by using Swarm Intelligence

by stefano squartini and

In sound reproduction systems the audio crossover plays a fundamental role. Nowadays, digital crossover based on IIR filters are commonly employed, of which non-linear phase is a relevant topic. For this reason, solutions aiming to IIR... more

descriptionView Paper arrow_downwardDownload

Deep Learning for Timbre Modification and Transfer: an Evaluation Study

by stefano squartini and

In the past years, several hybridization techniques have been proposed to synthesize novel audio content owing its properties from two audio sources. These algorithms, however, usually provide no feature learning, leaving the user, often... more

Fig. 1: Overview of the proposed architecture. Gabrielli, Cella, Vesperini, Droghini, Principi, Squartini Deep Learning for Timbre Modification

Gabrielli, Cella, Vesperini, Droghini, Principi, Squartini

descriptionView Paper arrow_downwardDownload

La era dorada del audio digital

by Luis Miguel Pedrero Esteban

Innovación Audiovisual

Aunque nació y fue etiquetado hace más de una década, el fenómeno del podcasting resurge hoy apoyado en la normalización del smartphone como dispositivo de consumo, en la versatilidad del audio para contar o reforzar historias y en la... more

descriptionView Paper arrow_downwardDownload

Polyphonic Sound Event Detection by using Capsule Neural Networks

by stefano squartini and

2019, IEEE Journal of Selected Topics in Signal Processing

Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and understand what is happening in the surroundings. Nowadays, deep learning offers valuable techniques for this goal such as convolutional neural... more

descriptionView Paper arrow_downwardDownload

Bridging the gap between performers and the audience using networked smartphones: the a.bel system

by Alexandre Clément

The a.bel project aims to provide artists with a way to easily interact with their audience, making use of their participation to effectively craft unique performances. This paper gives an overview of the a.bel system and details the... more

descriptionView Paper arrow_downwardDownload

La notificación push como estrategia informativa de la radio en el entorno digital

by Luis Miguel Pedrero Esteban

2017, El Profesional de la Información

Ante la normalización del smartphone como dispositivo dominante de acceso a la información en el entorno digital, la radio española ha asumido la necesidad de hacerse presente en estas pantallas con el fin de atraer y facilitar su... more

descriptionView Paper arrow_downwardDownload

Bit Bending: an Introduction

by Mayank Sanganeria and

2013, Proceedings of the 16th International Conference on Digital Audio Effects (DAFx-13)

We introduce the technique of "Bit Bending," a particularly fertile technique for circuit bending which involves chort circuits and manipulations upon digital serial information. We present a justification for computer modeling of... more

Figure 1: Circuit-bent Speak & Spell with MIDI control, Kurt James Werner, 2007

Computer models of circuit-bent instruments are desirable for several reasons. Circuit bending is often performed on devices that are antiquated and fragile. A computer model is used for ar- chival purposes, to preserve historical sounds and practices. Cre- ating a circuit-bent instrument requires specialized knowledge and a significant time investment. A software model of a circuit- bent instrument (in an audio plugin, for instance) fits convenient- ly into various electronic music workflows, is not susceptible to physical damage or decay, and scales easily. A model’s insuscep- tibility to electrical damage enables fearless experimentation.

Figure 3: Circuit-bent Roland TR-606, Kurt James Wer- ner, 2009/2011 The technique of Bit Bending arose out of circuit-bent instru- ments created by one of the authors (Kurt James Werner) be- tween 2008 and 2011. Werner worked on four circuit-bent in- struments in particular (Casio SK-1 and SK-5, and Roland TR- 505 and TR-626, as shown in Figure 3) which all featured banana jack patchbays connected to digital circuitry inside (ROM chips), allowing access to any number of reconfigurable patchbay con- nections. Further work on the Yamaha PSS-170 and PSS-270 involved putting switches and a patchbay in series with critical circuit traces (leading to the main FM synthesis chip), allowing them to be cut and rerouted at will, creating new and varied mu- sical effects. At this time, the author had also been using home- made clock circuits to enable controllable, floating clock rates for all of these devices (enabling arbitrary transposition, in effect). Part of these clock circuits was a binary counter chip that allowed quick clock pitching by exact octave increments.

The exciting results from these initial experiments led to the es- tablishment of a new technique for circuit bending: “Bit Bend- ing.” Bit Bending encompasses the insertion of one or more digi- tal logic chips in series with an existing or added circuit trace, in the context of creating a circuit-bent instrument, as shown in Figure 5. So far, Bit Bending has typically been accomplished by housing digital logic chips (counters, NAND gates, etc.) in small ABS project enclosures, and using banana cables to communicate signals and draw power from the main board.

Figure 4: Spectrogram of circuit-bent instrument with “Counter bend” tapped at successively more significant bits over time Experiments with shorting jacks on the patchbay through binary counters (connecting one jack to the input pin of the counter, and connecting another jack to various output pins of the counter) yielded even more interesting results. Tapping different output pins of the counter often cause octave-like effects. For instance, moving toward a more significant bit off the counter often causes tonal components of the instrument’s sound to drop by an octave but also causes secondary timbral and temporal effects. It is in- structive to note that if an arbitrary binary signal is input to a counter, each successively more significant bit off of the counter will result in a roughly one-octave drop in the spectral energy, but also result in a signal that gets closer and closer to a square wave (as shown in Figure 4). These techniques were also extend- ed to the use of non-counter digital logic chips.

Figure 7: Class hierarchy for ChipInput and ChipOutput

Figure 6: CircuitBoard object schema vances time on all of its associated chips by one sample, a con- cept from unit generator (UGen) based synthesis frameworks (such as the Synthesis Toolkit (STK) [18] and ChucK [19]). A schematic diagram is shown in Figure 6.

Figure 8: Class hierarchy for Chip objects

Our case study extends the basic NCO framework by cascading two NCOs together as shown in Figure 10, and by adding Mul- tiplier and ADSR chips (corresponding to the voltage-controlled oscillator (VCA) and envelope generator (EG) in the typical ex- pression of FM synthesis) on the output of the SPtAC chips. The Modulator ADSR chip corresponds to typical FM enveloped con- trol of the Modulation Index (8), and the Carrier ADSR chip cor- responds to typical enveloped control of the overall amplitude of the signal. Figure 9: Numerically controller oscillator block dia- gram

We have produced a computer model of FM synthesis imple- mented with two cascaded NCOs using our software library for RLT. This working model is now ready to be Bit-Bent. Even a simple circuit can be bent or reconfigured in any number of ways. Here we present only a few of the possible Bit Bending modifications to this model. Figure 10: Implementation of 2-operator FM synthesis with numerically controlled oscillators block diagram

Less intuitive Bit Bending extensions to the model involve inser- tion of additional circuitry along the main signal path. As shown in Figure 11, we inserted a Counter chip at the interface between the carrier SPtAC output and the ADSR multiplier. Taking dif- ferent pins off of this counter allows for complex, non-linear oc- tave-like effects that are familiar from Bit Bending in hardware. Figure 11: “Counter bend” block diagram

Delay bends are highly sensitive to the delay amount, and intro- duce signal-dependent digital noise, which can have a variety of effects depending on the bend’s location in the signal path. Figure 12: “Delay bend” block diagram

descriptionView Paper arrow_downwardDownload

Audio Quality X Accessibility: How Digital Technology Changed the Way We Listen and Consume Popular Music

by Ricardo M Gomes

Based on the studies of Milner (2009), Katz (2004) and Wikstrom (2009), the article explores in details how the experience of listening and consuming music has been changing through the years due to the appearance and evolution of digital... more

descriptionView Paper arrow_downwardDownload

Spectral Morphing by Phase-Vocoder Analysis

by Mattia Paterna

Il phase vocoder (PV) è uno degli strumenti classici utilizzati nell’ambito digitale per l’analisi e la re-sintesi (synthesis by analysis) di uno spettro sonoro. L’obiettivo principale dell’analisi attraverso PV è quella di separare in... more

descriptionView Paper arrow_downwardDownload

Aux frontières de l’homme - interfacé

by Hervé Zénouda

Nous discuterons dans cet article de la définition de l'humain dans ses rapports actuels entretenus avec les environnements numériques, la nature et le traitement des données. Nos propos seront illustrés par les oeuvres des artistes... more

descriptionView Paper arrow_downwardDownload

Muestra del libro: Introducción al Audio Digital

by Lino García Morales

2020, BoD

Este libro es un intento de explicar qué es el audio digital desde varios puntos de vista: físico, lógico y musical y, además, con el doble propósito de hacerlo de manera divulgativa (que se entienda) y de manera práctica (que sirva para... more

descriptionView Paper arrow_downwardDownload

A Combined One-Class SVM and Template-Matching Approach for User-Aided Human Fall Detection by Means of Floor Acoustic Features

by stefano squartini and

The primary cause of injury-related death for the elders is represented by falls. The scientific community devoted them particular attention, since injuries can be limited by an early detection of the event. The solution proposed in this... more

descriptionView Paper arrow_downwardDownload

Guest Editorial Special Issue on Computational Intelligence for End-to-End Audio Processing

by stefano squartini

2018, IEEE Transactions on Emerging Topics in Computational Intelligence

descriptionView Paper arrow_downwardDownload

The use of watermarks in the protection of digital multimedia products

by G. Voyatzis

1999, Proceedings of the IEEE

The watermarking of digital images, audio, video and multimedia products in general has been proposed for resolving copyright ownership and verifying originality of content. This paper studies the contribution of watermarking for developing protection schemes. A general watermarking framework (GWF) is studied and the fundamental demands are listed. The watermarking algorithms, namely watermark generation, embedding and detection, are analyzed and necessary conditions for a reliable and e cient protection are stated. Although the GWF satis es the majority of requirements for copyright protection and content veri cation, there are unsolved problems inside a pure watermarking framework. Particular solutions, based on product registration and related network services, are suggested to overcome such problems. The digital form of photographs, paintings, speech, music, video etc. became very popular in the last decade. Digital facilities for creating, processing and storing multimedia products have been found very convenient by creators, providers, editors and customers. At the same time, digital network communications have grown rapidly. In such an environment, digital products can be easily copied, processed for various purposes, broadcasted and/or publicly exposed. However, these revolutionary capabilities are also available to pirates who use them illegally for their personal interest by violating the legal rights of the providers and customers. Subsequently, security issues should be accounted for in the digital networked distribution systems for multimedia products. Digital piracy, dealing with multimedia products, generally, includes the following cases : Illegal access. A pirate tries to receive a digital product from a network site without permission. Intentional tampering. A pirate modi es a digital product in order to extract/insert features for malicious reasons and then proceeds to its retransmission. The authenticity of the original product is lost. Copyright violation. A pirate receives a product and resells it without getting the permission to do so from the copyright owner. Techniques based on cryptography, digital signatures and digital watermarks can be used for countering digital piracy 1]. Private or public key cryptography 2] can be used for data access control. Encrypted products are accessible, and decryption is possible only by someone who possesses a proper key. Well established algorithms (e.g. RSA 3] and DES 4]) can be used for this purpose. The encryption/decryption techniques should manipulate large amounts of digital data and should achieve real-time encryption/decryption e.g. for video and digital TV applications 1]. The

descriptionView Paper arrow_downwardDownload

On the limits of steganography

by Ross Anderson

1998, IEEE Journal on Selected Areas in Communications

In this paper, we clarify what steganography is and what it can do. We contrast it with the related disciplines of cryptography and traffic security, present a unified terminology agreed at the first international workshop on the subject,... more

descriptionView Paper arrow_downwardDownload

The Detection of Signal on Digital Audio Synthesizer Based-on Propeller

by Ferry Wahyu Wibowo

It has been programmed a propeller starter kit using SimpleIDE application to be a digital audio synthesizer. Generating signal of instruments is done by the keyboard as input and speaker as output to hear the sound. The method selection... more

Fig. 1 Generating of Piano Tone Signal the release state. The release is a speed of sound fades to the initial volume of the signal when the key is released, the piano is one of the examples which has a longer release signal. The ADSR envelopes determine the tone of the pianc is shown in Fig. 1. The use of these values of ADSR envelopes is to mal the sound of different instruments, for example by changing the attack to the maximum volume quickly will make sound like a guitar string” while saxophone has a grea Ke a er value. The waveforms for each instrument have different waveforms which are generated into square waveform and saw waveform, and they also have different of pulse wid modulation (PWM) as shown in Table 1. th

memory to be collected and processed by other Cogs. Although 2KB of data and programs seem small, but there are per Cog total to 16KB on the chip. However, no single 32Kb block RAM can be accessed by all the Cog. Access to this memory is set by the Hub, synchronization mechanisms ensure that only one Cog can read or write at a time. During all the Cog running parallel, over 20 million instructions per second for maximum performance 160 MIPS that access the shared memory and must wait until hub allocates processor time slots. Hub of spin round is at half of the clock speed of the system so as Cog can achieve access shared memory once every 16 clock cycles®. In addition to the 32KB block of shared RAM, propeller chip is also equipped with a 32KB block read-only memory (ROM). ROM has a number of useful data tables including a series of characters for generating video and mathematical tables to enable quick log, antilog and trigonometric functions, useful while generating a data graphic viewer®. The architecture of the Parallax Propeller is shown in Fig. 2. Language interpreters use 8Kb of ROM, which is referred to as spin. Programs can be written in assembler, but with highly interpreted language is more efficient and usually required to perform low-level jobs. Propeller starter kit already has an audio output connected to the stereo speaker/headphone, LED indicators, while graphical output can be connected to a television and / or monitor via VGA. The Propeller starter kit board has been put a microphone, keyboard and mouse PS/2 connectors as shown in Fig. 3.

Fig. 4. Design of Digital Audio Synthesizer The design of the digital audio synthesizer based on propeller consists of keyboard, propeller starter kit, and signal out as shown in Fig. 4.

Fig. 5. Flowchart of Digital Audio Synthesizer Propeller starter kit uses 80 MHz oscillator. For initialization programming should define the constants to determine the variables of input and output. The keyboard is assigned as input and defined on port 26, meanwhile the used audio is defined into two port which are right audio and left audio. The right audio is assigned on port 10 and left audio is assigned on port 11. Flowchart of the mechanism of digital audio synthesizer is shown in Fig. 5.

It shows that the sample rate of this signal is 44.1 kHz and the number of bits per sample is 16. The process of generating sound after key has been pressed, the signal will route to the ADSR envelope setting. When musical instrument is played on a real musical note, it has an envelope to the signal of sound that is different in beginning, middle, and ending of waveforms for each musical instruments.

Table 1 ADSR Envelopes of Some Instruments we ee Propeller starter kit is used to implement the principles and programming model systems. The propeller starter kit is programmed using assembly anguage of spin and functioned for data processing of input and output (I/O). In otherwise the propeller starter kit could be programmed using C and C++. Propeller chip made late in 1990 by engineers at Parallax Inc. and was aunched in 2006 on the existing microcontroller products. This chip is a single core processor that is small, but hrough an iterative process of design, test, based on programming software and the accuracy of the microcontroller of eight cores was able to be made. Eight core chip does not mean that this chip has eight processors inside a single package, but there are eight 32- bit processing units with its own independent program and the data area splitting with the access of other peripherals. Each processor of Parallax Propeller circuit has 2KB RAM serving program and data storage. Each processor also has two enumerators, video generation hardware and peripherals I/O. Each processor can control all 32-pins I/O and called as Cog. Cog can control each pin or even split a single

descriptionView Paper arrow_downwardDownload

La renovación digital del relato informativo sonoro: el auge del podcast narrativo

by Luis Miguel Pedrero Esteban

2022, El Periscopio. Blog del Master en Innovación Periodística. UMH

Los podcasts narrativos de no ficción se han consolidado en 2021 como un género imprescindible en la oferta de productoras y plataformas de audio digital no sólo por el atractivo de sus argumentos e historias, sino porque renuevan las... more

descriptionView Paper arrow_downwardDownload

Information hiding-a survey

by Nalamasa Madhu

1999, Proceedings of the …

Information hiding techniques have recently become important in a number of application areas. Digital audio, video, and pictures are increasingly furnished with distinguishing but imperceptible marks, which may contain a hidden copyright... more

Fig. 1. A classification of information hiding techniques based on [10]. Many of the ancient systems presented in Sections III-A and III-B are a form of ‘technical steganography’ (in the sense that messages are hidden physically) and most of the recent ex- amples given in this paper address ‘linguistic steganography’ and ‘copyright marking’.

Fig. 3. Generic digital watermark recovery scheme.

Fig. 4. Hiding information into music scores: Gaspar Schott simply maps the letters of the alphabet to the notes. Clearly, one should not try to play the music [29, p. 322].

Fig. 5. A typical use of masking and transform space for digital wa- termarking and fingerprinting. The signal can be an image or an audio signal. The perceptual analysis is based on the properties of the human visual or auditory systems respectively. © corre- sponds to the embedding algorithm and © to the weighting of the mark by the information provided by the perceptual model.

Fig. 6. Monograms figuring TGE RG (Thomas Goodrich Eliensis — Bishop of Ely, England — and Remy/Remigius Guedon, the paper-maker). One of the oldest watermarks found in the Cam- bridge area (c.1550). At that time, watermarks were mainly used to identify the mill producing the paper; a means of guaranteeing quality. Courtesy of Dr E. Leedham-Green, Cambridge Univer- sity Archives. Reproduction technique: beta radiography.

Fig. 7. When applied to images, the distortions introduced by Stir- Mark are almost unnoticeable: ‘Lena’ before (a) and after (b) StirMark with default parameters. For comparison, the same distortions have been applied to a grid (c & d).

descriptionView Paper arrow_downwardDownload

Design and Analysis of an Oversampling D/A Converter in DMT-ADSL Systems

by J Jacob Wikner

2002

Oversampling sigma-delta digital-to-analog converters are crucial building blocks for telecommunication applications. To reduce power consumption, lower oversampling ratios are preferred thus high-order digital sigma-delta modulators are... more

Fig. 2 Feedback structure of the 5-th order modulator. Fig. 1 Block diagram of the oversampling D/A converter.

Fig. 4. Root locus for the 5th order modulator.

Fig. 5. Root locus for a unstable modulator.

Fig. 7 Output spectrum of a 5-th order modulator with a single tone 431.25 kHz fullscale input signal. Two zeros _CfATMT _... _...... J Le. Ate. t......... 2... ff .:....71 Le...) Fig. 6 Output spectrum of a 5-th order modulator with a single tone 431.25 kHz fullscale input signal. All zeros of NTF reside at de. single tone 431.25 kHz fullscale input signal. Two zeros

Fig. 8 Output spectrum of the whole D/A converter with an input of multiple tones (from 138 kHz to 1.104 MHz)

descriptionView Paper arrow_downwardDownload

On the limits of steganography

by nishant mehta

1998, Selected Areas in …

descriptionView Paper arrow_downwardDownload

GLOSSARIO IN BREVE

by Alex Picciafuochi and

2000, Glossario Audio MIDI

A/D : Analog to Digital (da analogico a digitale) indica l'ingresso analogico di un convertitore audio digitale, per fare un esempio gli ingressi della vostra scheda audio o di un processore di effetti o di un mixer digitale, dopo lo stadio di preamplificazione hanno dei convertitori A/D. ADSR : acronimo di Attack, Decay Sustain e Release, è l'Envelope Generator (E.G.) il Generatore di Inviluppo presente anche in versioni ridotte su sintetizzatori, campionatori, batterie elettroniche ma anche su compressori, riverberi ecc. con le sue fasi di Attacco, Decadimento, Sostegno e Rilascio determina l'andamento nel tempo del parametro che sta controllando, decidendone quindi il tempo di risposta (Attack o Attack Time), la durata del picco dinamico (Decay), la durata del segnale (Sustain ma a volte Hold che sta per Tenuto) e la velocità di chiusura (Release). AIFF : acronimo di Audio Interchange File Format, uno dei formati dei file audio in uso soprattutto sui computer Apple. Aliasing : in un sistema di registrazione digitale, quando la frequenza da campionare supera quella consentita dalla frequenza di campionamento, contravvenendo al Teorema di Nyquist (che indica come la frequenza minima necessaria alla registrazione di un segnale sonoro sia pari al doppio della frequenza massima in esso contenuta) in cui i punti di quantizzazione non sono sufficienti a convertirla creando effetti indesiderati in zona udibile, vere e proprie distorsioni non presenti nel segnale analogico di origine, spesso se ne parla quando viene citato il filtro Anti-Aliasing , che serve appunto ad evitare questo grave inconveniente tramite l'applicazione al segnale di ingresso di una Filtro Passa Basso LPF (Low Pass Filter) onde evitare che giungano al convertitore A/D frequenze superiori alla metà della frequenza di campionamento. Bandwith : larghezza di banda; indica l'ampiezza di banda, in pratica le frequenze che possono passare attraverso un apparecchio o che vengono coinvolte da un filtro ad esempio di un Equalizzatore. Bass Reflex: è il sistema di caricamento del Woofer (l'altoparlante che si occupa specificamente delle basse frequenze) in una cassa acustica/monitor da studio. Consta in una o più aperture "accordate" della cassa, l'emissione è controllata nel tempo tramite l'impiego di materiale fonoassorbente e da una lunghezza ottimale del condotto d'aria, che rallentandone la propagazione riportano in fase le vibrazioni prodotte dalla superficie posteriore del Woofer,

descriptionView Paper arrow_downwardDownload

Robust data-hiding in audio

by Hafiz Malik

2004

A novel high capacity data hiding technique for digital audio is proposed. Imperceptibility of the embedded data is ensured based on the masking property of the human auditory system (HAS). Audio signal is decomposed into subband signals,... more

descriptionView Paper arrow_downwardDownload

Robust Data Hiding in Audio Using Allpass Filters

by Hafiz Malik

2007, IEEE Transactions on Audio, Speech & Language Processing

A novel technique is proposed for data hiding in digital audio that exploits the low sensitivity of the human auditory system to phase distortion. Inaudible but controlled phase changes are introduced in the host audio using a set of... more

Fig. 1. Pole-zero layouts of H% p,(z):1 =0, 1 (1 =0, 1, 2, 3) for binary (4-ary) encoding/decoding.

Fig. 2. Second-order APF, H3, APo (<), phase response (left), and group-delay response (right) for binary encoding/decoding.

Fig. 3. Phase response (left) and group-delay response (right) of second-order APF Hf, Po (<) used for 4-ary encoding/decoding scheme.

Fig. 4. Magnitude response of the L-length truncated second-order AP func- tion for different values of truncation length (L). finite-length segment is used, which in effect takes into account only a truncated APF impulse response. The finite-length trun- cation of an APF impulse response introduces distortion in the magnitude response around the pole-zero location frequency 0. The level of this magnitude distortion directly depends on the length L of the truncated impulse response of an APF. Our anal- ysis shows that the level of this magnitude distortion around the pole-zero location frequency due to FLT of the APF impulse re- sponse decreases as truncation length increases and vice versa. This phenomenon is illustrated here for a first-order APF. In order to determine the effect of a length-Z truncation on the magnitude response of an APF, consider the transfer function of a stable and causal first-order APF

Let us denote the first term on the right-hand side in (12) by H} yp 4p(2). Here, H}7p 4p(z) is the z-transform of the length-(Z + 1) of truncated impulse response H},p»(z), which can be expressed as

Fig. 5. Power spectral analysis of unprocessed (first row) and processed audio using fourth-order (-- -) and 16th-order (—) APF with a = 0.95¢39-5"

Fig. 7. Decoding bit error performance of the proposed watermarking scheme against lossy compression (MP3), resampling (Res), random sample drop (RSD), and lowpass filtering (LPF), highpass filtering (HPF), and bandpass filtering (BPF) attacks. Fig. 6. Detection performance P- of the proposed audio watermarking scheme against AWGN attack.

APF PARAMETERS USED FOR 4-ARY ENCODING delay T)(w) and the group delay 7,(w) are generally used to characterize the phase response of a given system where

APF PARAMETERS UsED FOR BINARY ENCODING TABLE I

PERFORMANCE OF THE PROPOSED SCHEME AGAINST DESYNCHRONIZATION ATTACK

descriptionView Paper arrow_downwardDownload

Data-hiding in audio using frequency-selective phase alteration

by Rashid Ansari

2004 IEEE International Conference on Acoustics, Speech, and Signal Processing

A novel perception-based data hiding technique for digital audio is proposed. It exploits lower sensitivity of human auditory system (HAS) to phase distortion in audio compared with magnitude distortion. Audio is decomposed into subband... more

descriptionView Paper arrow_downwardDownload

Digital Audio

Key research themes

1. How are digital audio effects designed and modeled to manipulate and enhance musical sound?

2. What advances enable automatic detection and removal of non-speech vocal sounds (e.g., breath sounds) in digital audio recordings?

3. How are digital sound synthesis and digital sound reconstruction advancing in generating realistic audio?

Related Topics

All papers in Digital Audio