Academia.eduAcademia.edu

Digital Audio

description904 papers
group1,377 followers
lightbulbAbout this topic
Digital audio refers to the representation of sound in a digital format, where audio signals are converted into binary data for storage, processing, and transmission. This field encompasses various techniques for audio encoding, compression, and playback, enabling high-fidelity sound reproduction and manipulation in various applications, including music production, broadcasting, and telecommunications.
lightbulbAbout this topic
Digital audio refers to the representation of sound in a digital format, where audio signals are converted into binary data for storage, processing, and transmission. This field encompasses various techniques for audio encoding, compression, and playback, enabling high-fidelity sound reproduction and manipulation in various applications, including music production, broadcasting, and telecommunications.

Key research themes

1. How are digital audio effects designed and modeled to manipulate and enhance musical sound?

This research area investigates the design, modeling, and implementation of digital audio effects (DAFx) that manipulate musical sound for creative and production purposes. It focuses on algorithmic development, real-time processing, and the application of machine learning for black-box and parametric modeling of audio effects such as equalization, amplification, and non-linear distortions. These advances are significant for music production, mixing, live performance, and the emulation of analog hardware in digital environments.

Key finding: This comprehensive collection reveals that the field of digital audio effects is undergoing a shift towards integrating machine learning methods, as shown by contributions applying deep neural networks for modeling analog... Read more
Key finding: This work categorizes audio effects based on amplitude and phase modulation techniques using delay lines, explaining the signal processing foundations underlying effects such as vibrato, flanging, chorus, and rotary speaker... Read more
Key finding: This seminal review consolidates diverse digital audio equalization techniques from classic to cutting-edge approaches within a unified mathematical framework, emphasizing parametric and shelving filters, graphic equalizers,... Read more
Key finding: This paper presents innovations to parametric multichannel audio coding through the Immersive Sound-field Rendition (ISR) system, focusing on improvements like phase compensated down-mixing and blind up-mixing schemes for low... Read more

2. What advances enable automatic detection and removal of non-speech vocal sounds (e.g., breath sounds) in digital audio recordings?

This important subfield addresses the challenge of identifying and eliminating unwanted non-speech vocal sounds such as breaths that may detract from the clarity of vocal recordings in music, broadcasting, and speech processing. The focus encompasses algorithmic approaches employing signal processing and deep learning methods to automate this laborious process. Effective solutions improve both production efficiency and audio quality, with notable progress shown in attention U-Net architectures that combine accuracy with reduced computational complexity.

Key finding: This study introduces a parameter-efficient deep learning model leveraging an attention U-Net architecture for automatic detection and eradication of breath sounds in vocal recordings. Trained on a unique DAPS-derived dataset... Read more
Key finding: The presented modulation-based effects models inherently manage dynamic changes in audio signal properties suited to vocal manipulation. The insights into phase and amplitude modulation mechanisms inform the design of... Read more

3. How are digital sound synthesis and digital sound reconstruction advancing in generating realistic audio?

This theme covers algorithmic innovations in synthesizing sounds digitally, particularly through physical modeling and digital reconstruction techniques. The goal is to produce realistic and high-fidelity synthetic musical or acoustic sounds. Physical modeling simulates vibrating physical structures, enabling expressive synthesis linked to instrument mechanics. Digital sound reconstruction (DSR) techniques aim to overcome limitations of traditional loudspeaker designs, especially in low-frequency domains, using array configurations and shutter gate mechanisms to enhance sound pressure output. These approaches enable real-time synthesis and new sound generation paradigms critical for musical instrument digital interfaces and audio reproduction devices.

Key finding: This article systematically presents physical modeling methods for digital sound synthesis, transforming partial differential equation-based models of vibrating structures (such as strings and drums) into discrete-time... Read more
Key finding: The paper introduces Advanced Digital Sound Reconstruction (ADSR), which improves upon classical DSR by incorporating shutter gates and redirection mechanisms to boost sound pressure level particularly in mid-to-low frequency... Read more

All papers in Digital Audio

Instrucciones para realizar cinco practicas de  nivel básico en  Adobe Audition
[ITA] A handbook that introduces some basics aspects of SuperCollider music and audio environment
La presente investigación exploró el papel de las mediaciones tecnológicas, psicoacústicas y socioculturales, involucradas en siete experiencias de creación sonora, tanto individuales como colaborativas, bajo las posturas estéticas del... more
La presencia de la radio en México se acerca a su centenario y a pesar de que la bibliografía publicada en torno a ella es abundante queda mucho por investigar sobre su pasado y su presente para intentar una prospectiva de su porvenir.... more
This paper is about a suite of electroacoustic music inspired by multiculturalism and DNA. Samples, live playing, and synthetic sounds were combined using digital technology into a dance-informed, world-flavored, concert-oriented,... more
We derive a novel explicit wave-domain model for “diode clipper” circuits with an arbitrary number of diodes in each orientation, applicable, e.g., to wave digital filter emulation of guitar distortion pedals. Improving upon and... more
The use of the laptop in performance causes various negative responses from the audience, who feel a loss of spectacle and performativity in the action. This occurs as a result of the lack of gesture and visual cues from the performer and... more
We present an analysis of the cowbell voice circuit from the Roland TR-808 Rhythm Composer. A digital model based on this analysis accurately emulates the original. Through the use of physical models of each sub-circuit, this model... more
In the recent years, hybrid reverberation algorithms have been widely explored aiming to reproduce the acoustic behavior of real environment at low computational load. On this basis, exploiting the advantages introduced from hybrid... more
Una introducción con aplicaciones a las comunicaciones móviles y a la
modulación en audio digital
In sound reproduction systems the audio crossover plays a fundamental role. Nowadays, digital crossover based on IIR filters are commonly employed, of which non-linear phase is a relevant topic. For this reason, solutions aiming to IIR... more
In the past years, several hybridization techniques have been proposed to synthesize novel audio content owing its properties from two audio sources. These algorithms, however, usually provide no feature learning, leaving the user, often... more
Aunque nació y fue etiquetado hace más de una década, el fenómeno del podcasting resurge hoy apoyado en la normalización del smartphone como dispositivo de consumo, en la versatilidad del audio para contar o reforzar historias y en la... more
Artificial sound event detection (SED) has the aim to mimic the human ability to perceive and understand what is happening in the surroundings. Nowadays, deep learning offers valuable techniques for this goal such as convolutional neural... more
The a.bel project aims to provide artists with a way to easily interact with their audience, making use of their participation to effectively craft unique performances. This paper gives an overview of the a.bel system and details the... more
Ante la normalización del smartphone como dispositivo dominante de acceso a la información en el entorno digital, la radio española ha asumido la necesidad de hacerse presente en estas pantallas con el fin de atraer y facilitar su... more
We introduce the technique of "Bit Bending," a particularly fertile technique for circuit bending which involves chort circuits and manipulations upon digital serial information. We present a justification for computer modeling of... more
Based on the studies of Milner (2009), Katz (2004) and Wikstrom (2009), the article explores in details how the experience of listening and consuming music has been changing through the years due to the appearance and evolution of digital... more
Il phase vocoder (PV) è uno degli strumenti classici utilizzati nell’ambito digitale per l’analisi e la re-sintesi (synthesis by analysis) di uno spettro sonoro. L’obiettivo principale dell’analisi attraverso PV è quella di separare in... more
Nous discuterons dans cet article de la définition de l'humain dans ses rapports actuels entretenus avec les environnements numériques, la nature et le traitement des données. Nos propos seront illustrés par les oeuvres des artistes... more
Este libro es un intento de explicar qué es el audio digital desde varios puntos de vista: físico, lógico y musical y, además, con el doble propósito de hacerlo de manera divulgativa (que se entienda) y de manera práctica (que sirva para... more
The primary cause of injury-related death for the elders is represented by falls. The scientific community devoted them particular attention, since injuries can be limited by an early detection of the event. The solution proposed in this... more
The watermarking of digital images, audio, video and multimedia products in general has been proposed for resolving copyright ownership and verifying originality of content. This paper studies the contribution of watermarking for... more
In this paper, we clarify what steganography is and what it can do. We contrast it with the related disciplines of cryptography and traffic security, present a unified terminology agreed at the first international workshop on the subject,... more
It has been programmed a propeller starter kit using SimpleIDE application to be a digital audio synthesizer. Generating signal of instruments is done by the keyboard as input and speaker as output to hear the sound. The method selection... more
Los podcasts narrativos de no ficción se han consolidado en 2021 como un género imprescindible en la oferta de productoras y plataformas de audio digital no sólo por el atractivo de sus argumentos e historias, sino porque renuevan las... more
Information hiding techniques have recently become important in a number of application areas. Digital audio, video, and pictures are increasingly furnished with distinguishing but imperceptible marks, which may contain a hidden copyright... more
Oversampling sigma-delta digital-to-analog converters are crucial building blocks for telecommunication applications. To reduce power consumption, lower oversampling ratios are preferred thus high-order digital sigma-delta modulators are... more
A/D : Analog to Digital (da analogico a digitale) indica l'ingresso analogico di un convertitore audio digitale, per fare un esempio gli ingressi della vostra scheda audio o di un processore di effetti o di un mixer digitale, dopo lo... more
A novel high capacity data hiding technique for digital audio is proposed. Imperceptibility of the embedded data is ensured based on the masking property of the human auditory system (HAS). Audio signal is decomposed into subband signals,... more
A novel technique is proposed for data hiding in digital audio that exploits the low sensitivity of the human auditory system to phase distortion. Inaudible but controlled phase changes are introduced in the host audio using a set of... more
A novel perception-based data hiding technique for digital audio is proposed. It exploits lower sensitivity of human auditory system (HAS) to phase distortion in audio compared with magnitude distortion. Audio is decomposed into subband... more
Download research papers for free!