Academia.eduAcademia.edu

Voice Production

description61 papers
group15 followers
lightbulbAbout this topic
Voice production is the physiological and acoustic process by which humans generate sound through the vocal folds in the larynx, modulating airflow and pressure to create speech and other vocalizations. It encompasses the study of the mechanics of phonation, resonance, and articulation in the context of communication and performance.
lightbulbAbout this topic
Voice production is the physiological and acoustic process by which humans generate sound through the vocal folds in the larynx, modulating airflow and pressure to create speech and other vocalizations. It encompasses the study of the mechanics of phonation, resonance, and articulation in the context of communication and performance.

Key research themes

1. How can speech synthesis systems be adapted to support both speech and singing voice production from neutral speech corpora?

This research area focuses on developing text-to-speech (TTS) frameworks that extend beyond conventional speech synthesis to incorporate singing voice production without requiring dedicated singing databases. The motivation lies in the cost, feasibility, and flexibility challenges of recording supplementary singing corpora, especially when the original speaker is unavailable or unable to sing well. The key insight is integrating speech-to-singing (STS) conversion within unit selection or corpus-based TTS systems using neutral speech databases, enabling synthesis of expressive vocal outputs for applications like storytelling, assistive devices, and immersive experiences.

Key finding: Introduced a unit selection-based TTS and singing (US-TTS&S) framework that integrates speech-to-singing conversion to generate both speech and singing from a single neutral speech corpus. The system was validated objectively... Read more
Key finding: Developed an expressive speech synthesizer tailored for military training applications using corpus-based concatenative synthesis with samples classified by speaking style. The system exhibited versatile, high-quality... Read more
Key finding: Provided an overview and design of TTS synthesizers using concatenative and formant synthesis approaches, highlighting unit selection and diphone synthesis. Emphasized the trade-offs between database size, naturalness, and... Read more
Key finding: Integrated GlórCáil voice analysis-synthesis system into a DNN-based TTS framework to manipulate glottal source and vocal tract parameters globally, enabling control over speaker identity (gender, age) and affective coloring.... Read more

2. What computational and vocal models facilitate the control and learning of expressive vocal intonation and prosody, including for language learning and voice training?

This theme explores computational synthesis techniques and interactive training methods designed to improve vocal expressiveness, particularly intonation patterns and prosodic features critical for natural speech and singing. The focus includes how speech synthesis models are manipulated for expressive control and how novel interfaces support second language (L2) speakers in mastering challenging intonation, as well as models for professional voice training to optimize vocal and prosodic quality. These approaches provide actionable methods for enhancing voice performance through controlled vocal synthesis and targeted training.

Key finding: Demonstrated that real-time hand-gesture controlled vocal synthesis (Performative Vocal Synthesis, PVS) enables L2 learners (French speakers learning English intonation) to produce more comprehensible categorical intonation... Read more
Key finding: Designed and experimentally validated a vocal training program to improve vocal and prosodic elements (breathing, articulation, loudness, pitch, jitter, speech rate, pauses, stress) in journalism students. Post-training... Read more
Key finding: Provided a computational framework for manipulating glottal and vocal tract parameters to generate variations in affective expression and speaker identity within synthetic speech, revealing that global parameter shifts can... Read more

3. How can physical and computational models of vocal fold physiology and acoustics improve understanding and simulation of voice production?

This theme surveys synthetic vocal fold models and numerical approaches that accurately represent the biomechanics and aerodynamics of phonation to better simulate human voice production. It includes the design of self-oscillating vocal fold models, quantification of vocal fold geometry, and stabilized finite element methods for wave equations in moving vocal tracts. These advances help elucidate the complex coupling of tissue vibration, airflow, and acoustics, yielding insights for synthesis, voice therapy, and model-based voice production research.

Key finding: Provided a comprehensive review of two principal classes of synthetic self-oscillating vocal fold models—membranous (e.g., water-filled latex tubes) and elastic solid (e.g., multi-layered ultrasoft silicone)—detailing their... Read more
Key finding: Quantified 3D medial surface geometry of porcine vocal folds using microCT before and after freezing, finding ~5% non-uniform expansion due to freezing. Demonstrated qualitative similarity of porcine vocal fold geometry to... Read more
Key finding: Proposed a subgrid scale stabilized finite element method (FEM) to solve the mixed form wave equation within an arbitrary Lagrangian-Eulerian (ALE) framework, addressing inf-sup compatibility and high-frequency oscillations... Read more
Key finding: Analyzed how inclusion of a finite relaxation length for the flow to transition to one-dimensionality downstream of the glottis affects low-order vocal fold voice production models. Demonstrated that shorter relaxation... Read more

All papers in Voice Production

This article explores the intersection of emotional intelligence (EI) and vocal development, emphasizing the psychological factors that shape vocal training. Drawing on educational psychology, vocal pedagogy, and real-world teaching... more
ABSTRAKT Byly sledovány akusticko-mechanické vlastnosti reproduktorových soustav v závislosti na tvaru a materiálu ozvučnice. Byly navrženy a zkonstruovány ozvučnice na bázi partikulárních kompozitních materiálů. U těchto materiálů byly... more
Working with the wave equation in mixed rather than irreducible form allows one to directly account for both, the acoustic pressure field and the acoustic particle velocity field. Indeed, this becomes the natural option in many problems,... more
The negative peak amplitude of the differentiated glottal flow (dpeak) is known to correlate strongly with the sound pressure level (SPL) of speech. Therefore, the function between d peak and SPL is usually modeled as a single line. In... more
Sound for the human voice is produced by vocal fold flow-induced vibration and involves a complex coupling between flow dynamics, tissue motion, and acoustics. Over the past three decades, synthetic, self-oscillating vocal fold models... more
Echolokační schopnosti netopýrů a specifita jejich hlasů umožňuje sledování struktury a proměny jejich společenstev na dálku bez nutnosti kontaktní manipulace s jedinci. Prostřednictvím celonočních akustických záznamů lze získat... more
Les articles auxquels j'ai participé sont numérotés (ex : [1], [2], ...) et la liste est consultable au chapitre 6 (page 98). Les autres articles sont référencés en utilisant un style alphanumérique (ex : [DGO03]) et constituent la... more
Existuje řada důkazů, které potvrzují vliv delta-9-tetrahydrokanabinolu (THC) na indukci pozitivních psychotických symptomů u lidí, na zhoršení symptomů probíhající psychózy, na rozvoji psychózy u chronických adolescentních uživatelů... more
Computational speech reconstruction algorithms have the ultimate aim of returning natural sounding speech to aphonic and dysphonic individuals. These algorithms can also be used by unimpaired speakers for communicating sensitive or... more
Three laryngeal models were used to investigate the aerodynamic and elastic properties of vocal fold vibration: cadaveric human, excised canine, and synthetic silicone vocal folds. The aim was to compare the characteristics of these... more
Emergency call system, which in the case of traffic accidents ensure rapid assistance to motorists, will be mandatory for all new passenger cars approved since 2017 - 2018. The speech intelligibility as one of the parameters of eCall... more
Une étude numérique du transport et du dépôt d'aérosols dans un modèle idéalisé des voies aériennes supérieures humaines est présentée. La géométrie de la région laryngée pendant la respiration est obtenue à partir d'une étude clinique... more
Structures-Risques (3S-R), Domaine universitaire-BP 53, 38041 Grenoble Cedex 9 {lucie.bailly, nathalie.henrich}@gipsa-lab.grenoble-inp.fr Les bandes ventriculaires sont des structures laryngées situées au-dessus et à proximité des cordes... more
remor is a rhythmic oscillating movement of body parts caused by alternating contractions of muscle agonists and antagonists. It is an involuntary movement occurring in healthy individuals as well as in individuals with neurological... more
National audienceLes travaux présentés s’inscrivent dans le cadre d’un projet labellisé par le pôle de compétitivité ‘Véhicule du Futur’. L’acronyme du projet est SIMBA pour SIMulation de la Boucle d’Air automobile. L’objectif global est... more
Communications orales du dimanche 14 octobre A63 des récidives de chondrosarcome sous-glottique doit encore faire l'objet d'études prospectives complémentaires.
16 ème Congrès Français d'Acoustique 11-15 Avril 2022, Marseille L'influence du nombre de Mach et du nombre de Reynolds sur le bruit produit par une couche de mélange bidimensionnelle est étudiée à l'aide de simulations numériques... more
Des simulations numériques de quatre jets rond supersoniques sous-détendus ont été réalisées. Les quatre jets impactent une paroi avec un angle normal, située à une distance comprise entre L = 4.16r 0 et L = 9.32r 0 des lèvres de la buse,... more
The process of voiced sounds production can be described as follows: air coming from the lungs is forced through the narrow space between the two vocal folds, which are set in motion in a frequency governed by the tension of their... more
Le phénomène connu sous le nom de "tuyau chantant" concerne l'émission acoustique de tuyaux qui peuvent se mettre à siffler lorsqu'ils sont soumis à un écoulement interne de gaz. Ce sifflement trouve son origine dans la géométrie... more
Register shift between the chest and falsetto register is generally studied in the higher-than-speaking pitch range. However, a similar difference can also be produced at speaking pitch level. The shift from breathy "falsetto" phonation... more
Nous caractérisons les instabilités qui apparaissent sur un anneau de vorticité à l'aide d'une méthode particulaire de type "vortex blob" dans laquelle un remaillage axi-symétrique évite l'apparition de modes parasites. On initialise... more
Lors de la phase d'atterrissage d'un avion de transport civil, une partie de l'écoulement entrant dans les réacteurs est réorientée grâce aux inverseurs de poussée. Ces derniers créent une contre poussée qui participe au freinage de... more
The 60-minutes sessions were characterised by short bouts of MVPA interspersed by short bouts of LPA. Participants spent 58.3% of the duration of the session in MVPA, 30% in LPA and only 11.8% in sedentary behaviour. For all the sessions,... more
Voice education is a crucial aspect for professionals (journalists, teachers, politicians, actors, etc.) who use their voices as a working tool. The main concerns about such education are that, first, there is little awareness of the... more
The main part of the thesis is experimental determination of transmission in side-branch resonator. In first part are described basics of digital signal processing and sound damping in piping systems. A special attention is payed to... more
Notre etude se concentre sur un phenomene souvent rencontre dans les tuyaux qui transportent des ecoulements de gaz. Les instabilites de l'ecoulement provoquees par les singularites geometriques internes au tuyau peuvent exciter les... more
Dýchání a jeho poruchy jsou oblastí zájmu mnoha zdravotnických lékařských i nelékařských oborů. U pacientů s neurologickým onemocněním dochází k paréze, tedy poklesu svalové síly, kromě jiného i dechových svalů, což má za následek vznik... more
Dýchání a jeho poruchy jsou oblastí zájmu mnoha zdravotnických lékařských i nelékařských oborů. U pacientů s neurologickým onemocněním dochází k paréze, tedy poklesu svalové síly, kromě jiného i dechových svalů, což má za následek vznik... more
National audienceLes travaux présentés s’inscrivent dans le cadre d’un projet labellisé par le pôle de compétitivité ‘Véhicule du Futur’. L’acronyme du projet est SIMBA pour SIMulation de la Boucle d’Air automobile. L’objectif global est... more
Rád bych poděkoval svému vedoucímu Ing. Janu Skapovi, Ph.D, za jeho rady a vstřícný přístup při vedení mé bakalářské práce. Zároveň bych chtěl také poděkovat pedagogům Hudební fakulty Akademie múzických umění za poskytnutí materiálů z... more
Research investigating the correlation of acoustic measures of noise and the perception of pathological voice quality has consistently demonstrated a moderate association. However, this correlational approach cannot address basic... more
At present, two important questions about voice remain unanswered: When voice quality changes, what physiological alteration caused this change, and if a change to the voice production system occurs, what change in perceived quality can... more
This thesis investigates the use of gesture and body-movement as teaching and learning tools in Western classical singing. The introduction draws together a number of theoretical threads to argue why this study has been undertaken and... more
During respiration, the glottis opens a fraction of second before air is drawn in by descent of the diaphragm, Green and Neil 1955. 2 This opening is brought about by contraction of the posterior cricoarytenoid muscles (Figure 1).... more
In this era of minimally invasive surgical intervention s, the knowledge of the physiology and pathophysiology of the larynx is vital to the laryngologist.  The conventional procedure of laryngeal surgery has been superseded by functional... more
L'etude concerne la prise en glace d'une paroi thermiquement controlee placee au sein d'une conduite dans laquelle se developpe un ecoulement turbulent d'air humide. L'ensemble des essais experimentaux a ete realise a... more
In this paper, electric discharges were studied in atmospheric air in order to modify subsonic airflows. The flows induced by a DC surface corona discharge and an AC Dielectric Barrier Discharge were measured with the PIV system. They... more
The aim of this paper is to use Bayesian statistics to update a probability density function (p.d.f.) related to the tension parameter of the vocal folds, which is one of the main parameters responsible for the changing of the fundamental... more
Computational speech reconstruction algorithms have the ultimate aim of returning natural sounding speech to aphonic and dysphonic individuals. These algorithms can also be used by unimpaired speakers for communicating sensitive or... more
Voice education is a crucial aspect for professionals (journalists, teachers, politicians, actors, etc.) who use their voices as a working tool. The main concerns about such education are that, first, there is little awareness of the... more
Tato závěrečná diplomová práce se zabývá problematikou měření vibrací a hluku převodových ústrojí. Rešeršní část popisuje základní problematiku vzniku hluku a vibrací, jejich měření, dále také zdroje vibrací v převodových ústrojích.... more
Résumé : Des mesures PIV associées à des mesures de vitesse acoustique par anémométrie à fil chaud permettent de caractériser l’écoulement et l’acoustique dans un tube corrugué. Un sifflement intense est provoqué par la cohérence entre... more
Working with the wave equation in mixed rather than irreducible form allows one to directly account for both, the acoustic pressure field and the acoustic particle velocity field. Indeed, this becomes the natural option in many problems,... more
Chtěla bych poděkovat Ing. Tereze Kráčmerové za odborné vedení, poskytování cenných rad a vstřícný přístup po celou dobu vedení mé diplomové práce. Dále bych chtěla poděkovat kolegům ze Samostatného oddělení lékařské fyziky Fakultní... more
La stabilité linéaire de l'écoulement de Taylor-Couette entre deux cylindres aux parois compliantes (déformables) est considérée. Les parois sont modélisées comme des coques minces élastiques supportées par un ensemble de ressorts... more
Download research papers for free!