Academia.eduAcademia.edu

Digital Audio

description905 papers
group1,377 followers
lightbulbAbout this topic
Digital audio refers to the representation of sound in a digital format, where audio signals are converted into binary data for storage, processing, and transmission. This field encompasses various techniques for audio encoding, compression, and playback, enabling high-fidelity sound reproduction and manipulation in various applications, including music production, broadcasting, and telecommunications.
lightbulbAbout this topic
Digital audio refers to the representation of sound in a digital format, where audio signals are converted into binary data for storage, processing, and transmission. This field encompasses various techniques for audio encoding, compression, and playback, enabling high-fidelity sound reproduction and manipulation in various applications, including music production, broadcasting, and telecommunications.

Key research themes

1. How are digital audio effects designed and modeled to manipulate and enhance musical sound?

This research area investigates the design, modeling, and implementation of digital audio effects (DAFx) that manipulate musical sound for creative and production purposes. It focuses on algorithmic development, real-time processing, and the application of machine learning for black-box and parametric modeling of audio effects such as equalization, amplification, and non-linear distortions. These advances are significant for music production, mixing, live performance, and the emulation of analog hardware in digital environments.

Key finding: This comprehensive collection reveals that the field of digital audio effects is undergoing a shift towards integrating machine learning methods, as shown by contributions applying deep neural networks for modeling analog... Read more
Key finding: This work categorizes audio effects based on amplitude and phase modulation techniques using delay lines, explaining the signal processing foundations underlying effects such as vibrato, flanging, chorus, and rotary speaker... Read more
Key finding: This seminal review consolidates diverse digital audio equalization techniques from classic to cutting-edge approaches within a unified mathematical framework, emphasizing parametric and shelving filters, graphic equalizers,... Read more
Key finding: This paper presents innovations to parametric multichannel audio coding through the Immersive Sound-field Rendition (ISR) system, focusing on improvements like phase compensated down-mixing and blind up-mixing schemes for low... Read more

2. What advances enable automatic detection and removal of non-speech vocal sounds (e.g., breath sounds) in digital audio recordings?

This important subfield addresses the challenge of identifying and eliminating unwanted non-speech vocal sounds such as breaths that may detract from the clarity of vocal recordings in music, broadcasting, and speech processing. The focus encompasses algorithmic approaches employing signal processing and deep learning methods to automate this laborious process. Effective solutions improve both production efficiency and audio quality, with notable progress shown in attention U-Net architectures that combine accuracy with reduced computational complexity.

Key finding: This study introduces a parameter-efficient deep learning model leveraging an attention U-Net architecture for automatic detection and eradication of breath sounds in vocal recordings. Trained on a unique DAPS-derived dataset... Read more
Key finding: The presented modulation-based effects models inherently manage dynamic changes in audio signal properties suited to vocal manipulation. The insights into phase and amplitude modulation mechanisms inform the design of... Read more

3. How are digital sound synthesis and digital sound reconstruction advancing in generating realistic audio?

This theme covers algorithmic innovations in synthesizing sounds digitally, particularly through physical modeling and digital reconstruction techniques. The goal is to produce realistic and high-fidelity synthetic musical or acoustic sounds. Physical modeling simulates vibrating physical structures, enabling expressive synthesis linked to instrument mechanics. Digital sound reconstruction (DSR) techniques aim to overcome limitations of traditional loudspeaker designs, especially in low-frequency domains, using array configurations and shutter gate mechanisms to enhance sound pressure output. These approaches enable real-time synthesis and new sound generation paradigms critical for musical instrument digital interfaces and audio reproduction devices.

Key finding: This article systematically presents physical modeling methods for digital sound synthesis, transforming partial differential equation-based models of vibrating structures (such as strings and drums) into discrete-time... Read more
Key finding: The paper introduces Advanced Digital Sound Reconstruction (ADSR), which improves upon classical DSR by incorporating shutter gates and redirection mechanisms to boost sound pressure level particularly in mid-to-low frequency... Read more

All papers in Digital Audio

In this paper, an audio effect (AE) algorithm is proposed which can be applied to portable digital imaging devices to enjoy video contents effectively. The proposed AE algorithm enhances speech signals corrupted by background noise in... more
Traditional convolutional neural networks (CNNs) face significant limitations in medical imaging when detecting small, spatially variable objects such as kidney stones, primarily due to their inability to preserve pose information and... more
In this paper a new method for analysis and modeling of nonlinear audio systems is presented. The method is based on swept-sine excitation signal and nonlinear convolution firstly presented in (1, 2). It can be used in nonlinear... more
Blockchain technology has become a major focus in data security and reliability. A foundation for innovations such as non-fungible token (NFT), which opens up new opportunities in managing ownership of digital assets. We investigate NFTs... more
Abstract Science and technology are no longer isolated domains of innovation; they are deeply entangled with social, political, and epistemological structures. This essay explores three transformative frameworks—Actor-Network Theory... more
The field of speech compression has advanced rapidly due to cost-effective digital technology and diverse commercial applications. In voice communication a real-time system should be considered. It is not still possible to compress... more
El capítulo aborda la creación del Archivo Sonoro de San Juan del Río, Querétaro, como iniciativa para preservar el patrimonio cultural inmaterial a través de la memoria acústica de la ciudad. Partiendo del concepto de paisaje sonoro... more
Ponencia realizada en el Congreso de Historia Pública celebrado en la Universidad Autónoma de Madrid en abril de 2023, organizado por la Asociación de Historia Pública.
In this paper technique of embedding image into Audio file is proposed and successfully implemented to improve quality of watermark audio signal. Robustness of this technique is checked for various attacks including MP3 compression, Echo... more
Publicado en el suplemento Babelia de El País el 9 de noviembre de 2021. Obituario conjunto de  Robert Murray Schafer, padre adoptivo de los sound studies, e Ian Rawes, uno de sus cultivadores más compulsivos y sugerentes.
Aggregated trading volume in February 2023 across the leading six NFT marketplaces totalled USD 1.89 billion. This reflects a continuing positive trajectory, marked by a 91.9% month-on-month (MoM) growth from January 2023, where NFT... more
Three-dimensional spider webs feature highly intricate fiber architectures, which can be represented via 3-D scanning and modeling. To allow novel interpretations of the key features of a 3-D Cyrtophora citricola spider web, we translate... more
he name MPEG-4 High-Efficiency AAC (HE-AAC) refers to a family of recent audio coders that was developed by the International Organization for Standardization/ International Electrotechnical Commission (ISO/IEC) Moving Picture Experts... more
Flexibility of Internet technology gives rise to concern about the protection of digital data. Digital audio watermarking is the art of hiding important data in Digital Audio. This research paper deals with a new methodology which helps... more
Flexibility of Internet technology gives rise to concern about the protection of digital data. Digital audio watermarking is the art of hiding important data in Digital Audio. This research paper deals with a new methodology which helps... more
Audio watermarking has been proved as a powerful tool against illegal manipulation of audio products. It is generally used as a multimedia copyright protection tool. In this paper, we propose an audio watermarking algorithm based on two... more
In this paper, we propose a new Sound Event Classification (SEC) method which is inspired in recent works for out-ofdistribution detection. In our method, we analyse all the activations of a generic CNN in order to produce feature... more
In this paper, we propose a new Sound Event Classification (SEC) method which is inspired in recent works for out-ofdistribution detection. In our method, we analyse all the activations of a generic CNN in order to produce feature... more
Storytelling is a practice which is critical for the communication of lived experience, the development of empathy, and for the creation of a rich sense of collective being. While essential, it is also deeply complex and fragile—wrought... more
A model of music listening has been automated. A program takes digital audio as input, for example from a compact disc, and outputs an explanation of the music in terms of repeated sections and the implied structure. For example, when the... more
Digital video offers an interesting source of control information for musical applications. A novel synthesis technique is introduced where digital video controls sound spectra in real time. Light intensity modulates the amplitudes of 32... more
En este artículo se abordan las narrativas de la ciudad en el radioteatro El camino en la sombra, producido por la Radiodifusora Nacional de Colombia a partir de la obra homónima de José Antonio Osorio Lizarazo. El abordaje se nutre de... more
In this work, we develop a theoretical framework for reliable digital recording system identification from digital audio files alone, for forensic purposes. A digital recording system consists of a microphone and a digital sound... more
Araz, and Dr. Nazlı Candan. I would also like to express my thanks to Mehmet Uğur Doğan, Özgür Devrim Orman and Tuba İslam from the Speech Lab for their critical discussions and providing me the data repertoires. Finally, but especially,... more
--- This research explores the intersection of artificial intelligence (AI) and podcasting through the lens of the soundscape. The study analyzes how AI can transform the creation, manipulation, and perception of the sonic elements that... more
The relationship between the sound industry and its audience is influenced by the widespread use of smartphones as the primary means of accessing the internet. This has led to a transformation in media logics, particularly among young... more
Teknologi aplikasi musik saat ini makin berkembang pesat dan hampir mampu mengakomodasi berbagai karakter instrumen yang terdapat dalam musik gamelan. Penyesuaian kebiasan dalam memproduksi musik gamelan secara tradisional ke dalam bentuk... more
The dynamic progression of technology has induced a profound metamorphosis within the realm of commerce, ushering in novel prospects and trials for enterprises spanning diverse sectors. In contemporary times, the rise in non-fungible... more
Technology has been widely used for educational processes; in the arts several researchers have investigated the employment of computational systems and gadgets to approach younger audiences. This paper describes the design and... more
RHM Assisant Editor Podcast Interview with Dr. David Gruber and Dr. Jason Kalin on their article,"Gut Rhetorics: Towards Experiments in Living with Microbiota."
People identify powerfully with music: someone might say ÒthatÕs my song!Ó but they are unlikely to say ÒthatÕs my book!Ó or ÒthatÕs my picture!Ó A digital library of popular music therefore has the potential to be a compelling... more
Most mobile and wearable devices present digital audio signal processing capabilities. Since the nature of audio signals is analog, there is a need to use analog-to-digital converters (ADCs) with high-resolution for a high signal-to-noise... more
Propuesta educativa sobre la obra / instalación sonora «Las cuatro estaciones. Haikus sonoros para Rosario», dirigida a docentes de educación primaria.

Florencia Ruiz Ferretti. Año 2024.
The intuitive starting point for this project was the idea that one could use the voice interaction capacities of smart speakers to interface with oral history and audio documentary collections. In this paper we present an empirical... more
Puji syukur dipanjatkan kehadirat Allah SWT yang telah melimpahkan rahmat dan taufik-Nya sehingga buku pedoman ini dapat diselesaikan. Buku Pedoman ini berjudul "Pedoman Peningkatan Keterampilan Berbicara Bahasa Inggris melalui Mobile... more
La radio encuentra en el automóvil una audiencia cautiva. Los automovilistas escuchan la radio por la música y noticias en general, pero también prestan atención a los reportes viales en tiempo real, a pesar del reciente aumento de las... more
Veinte años después de su aparición como innovación tecnológica para incluir audio en los blogs, el podcast comienza a cristalizar como una industria creativa y cultural de gran alcance y eficiencia comercial. Su naturaleza dúctil, que... more
In this study, we propose a dynamic template adaptation approach for noise-robust sound classification and distance estimation in single-channel audio environments. Traditional cross-correlation methods rely on fixed sound templates that... more
This research is entitled "Analysis of Music Recording Process with Digital Methods in Sanggar Buana Banda Aceh during the COVID-19 Pandemic". This study aims to determine the process of recording music using the Digital Method... more
Veinte años después de su aparición como innovación tecnológica para incluir audio en los blogs, el podcast comienza a cristalizar como una industria creativa y cultural de gran alcance y eficiencia comercial. Su naturaleza dúctil, que... more
Audio enthusiasts nowadays listen to or stream their music through digital devices which makes their listening more towards digital consumption. The sound appreciation towards digital consumption has simplified the listening experience in... more
An inherent property of many DSP algorithms is that they tend to exhibit uniform frequency resolution from zero to Nyquist frequency. This is a direct consequence of using unit delays as building blocks; a frequency independent delay... more
Download research papers for free!