Voice Production

description61 papers

group15 followers

lightbulbAbout this topic

Voice production is the physiological and acoustic process by which humans generate sound through the vocal folds in the larynx, modulating airflow and pressure to create speech and other vocalizations. It encompasses the study of the mechanics of phonation, resonance, and articulation in the context of communication and performance.

lightbulbAbout this topic

Key research themes

1. How can speech synthesis systems be adapted to support both speech and singing voice production from neutral speech corpora?

This research area focuses on developing text-to-speech (TTS) frameworks that extend beyond conventional speech synthesis to incorporate singing voice production without requiring dedicated singing databases. The motivation lies in the cost, feasibility, and flexibility challenges of recording supplementary singing corpora, especially when the original speaker is unavailable or unable to sing well. The key insight is integrating speech-to-singing (STS) conversion within unit selection or corpus-based TTS systems using neutral speech databases, enabling synthesis of expressive vocal outputs for applications like storytelling, assistive devices, and immersive experiences.

A unit selection text-to-speech-and-singing synthesis framework from neutral speech: proof of concept

by Joan Claudi Socoró Carrié

2023, EURASIP Journal on Audio, Speech, and Music Processing

Key finding: Introduced a unit selection-based TTS and singing (US-TTS&S) framework that integrates speech-to-singing conversion to generate both speech and singing from a single neutral speech corpus. The system was validated objectively... Read more

articleView Paper downloadDownload

Limited domain synthesis of expressive military speech for animated characters

by Lewis Johnson

2024, Proceedings of 2002 IEEE Workshop on Speech Synthesis, 2002.

Key finding: Developed an expressive speech synthesizer tailored for military training applications using corpus-based concatenative synthesis with samples classified by speaking style. The system exhibited versatile, high-quality... Read more

articleView Paper downloadDownload

Design and Development of a Text-To-Speech Synthesizer System

by VINEET CHAUHAN CHAUHAN

2023

Key finding: Provided an overview and design of TTS synthesizers using concatenative and formant synthesis approaches, highlighting unit selection and diphone synthesis. Emphasized the trade-offs between database size, naturalness, and... Read more

articleView Paper downloadDownload

Integrating a Voice Analysis-Synthesis System with a TTS Framework for Controlling Affect and Speaker Identity

by Ailbhe Chasaide

2025

Key finding: Integrated GlórCáil voice analysis-synthesis system into a DNN-based TTS framework to manipulate glottal source and vocal tract parameters globally, enabling control over speaker identity (gender, age) and affective coloring.... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What computational and vocal models facilitate the control and learning of expressive vocal intonation and prosody, including for language learning and voice training?

This theme explores computational synthesis techniques and interactive training methods designed to improve vocal expressiveness, particularly intonation patterns and prosodic features critical for natural speech and singing. The focus includes how speech synthesis models are manipulated for expressive control and how novel interfaces support second language (L2) speakers in mastering challenging intonation, as well as models for professional voice training to optimize vocal and prosodic quality. These approaches provide actionable methods for enhancing voice performance through controlled vocal synthesis and targeted training.

Performative Vocal Synthesis for Foreign Language Intonation Practice

by Christophe d'Alessandro

2024, Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems

Key finding: Demonstrated that real-time hand-gesture controlled vocal synthesis (Performative Vocal Synthesis, PVS) enables L2 learners (French speakers learning English intonation) to produce more comprehensible categorical intonation... Read more

articleView Paper downloadDownload

A Training Model for Improving Journalists' Voice

by Emma Rodero

2024, Journal of Voice

Key finding: Designed and experimentally validated a vocal training program to improve vocal and prosodic elements (breathing, articulation, loudness, pitch, jitter, speech rate, pauses, stress) in journalism students. Post-training... Read more

articleView Paper downloadDownload

Integrating a Voice Analysis-Synthesis System with a TTS Framework for Controlling Affect and Speaker Identity

by Ailbhe Chasaide

2025

Key finding: Provided a computational framework for manipulating glottal and vocal tract parameters to generate variations in affective expression and speaker identity within synthetic speech, revealing that global parameter shifts can... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can physical and computational models of vocal fold physiology and acoustics improve understanding and simulation of voice production?

This theme surveys synthetic vocal fold models and numerical approaches that accurately represent the biomechanics and aerodynamics of phonation to better simulate human voice production. It includes the design of self-oscillating vocal fold models, quantification of vocal fold geometry, and stabilized finite element methods for wave equations in moving vocal tracts. These advances help elucidate the complex coupling of tissue vibration, airflow, and acoustics, yielding insights for synthesis, voice therapy, and model-based voice production research.

Synthetic, Self-Oscillating Vocal Fold Models for Voice Production Research

by Scott Thomson

2024, Journal of the Acoustical Society of America

Key finding: Provided a comprehensive review of two principal classes of synthetic self-oscillating vocal fold models—membranous (e.g., water-filled latex tubes) and elastic solid (e.g., multi-layered ultrasoft silicone)—detailing their... Read more

articleView Paper downloadDownload

Quantification of porcine vocal fold geometry

by Scott Thomson

2019, Journal of Voice

Key finding: Quantified 3D medial surface geometry of porcine vocal folds using microCT before and after freezing, finding ~5% non-uniform expansion due to freezing. Demonstrated qualitative similarity of porcine vocal fold geometry to... Read more

articleView Paper downloadDownload

A Stabilized Finite Element Method for the Mixed Wave Equation in an ALE Framework With Application to Diphthong Production

by hector espinoza

2024, Acta Acustica united with Acustica

Key finding: Proposed a subgrid scale stabilized finite element method (FEM) to solve the mixed form wave equation within an arbitrary Lagrangian-Eulerian (ALE) framework, addressing inf-sup compatibility and high-frequency oscillations... Read more

articleView Paper downloadDownload

Relaxation to one-dimensional postglottal flow in a vocal fold model

by Denisse Sciamarella

2022, Speech Communication

Key finding: Analyzed how inclusion of a finite relaxation length for the flow to transition to one-dimensionality downstream of the glottis affects low-order vocal fold voice production models. Demonstrated that shorter relaxation... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Voice Production

Emotional Intelligence and Vocal Development: A Psychological Perspective in Vocal Pedagogy

by Anzhela Akhtiamova

2025

This article explores the intersection of emotional intelligence (EI) and vocal development, emphasizing the psychological factors that shape vocal training. Drawing on educational psychology, vocal pedagogy, and real-world teaching... more

descriptionView Paper arrow_downwardDownload

Similitude en lit fluidisé gaz-solide: influence du distributeur et du plenum

by Lounes TADRIST

2025

descriptionView Paper arrow_downwardDownload

Analýza frekvenčních závislostí reproduktorových soustav v závislosti na tvaru a materiálu ozvučnice

by Jiří Jirásek

2025

ABSTRAKT Byly sledovány akusticko-mechanické vlastnosti reproduktorových soustav v závislosti na tvaru a materiálu ozvučnice. Byly navrženy a zkonstruovány ozvučnice na bázi partikulárních kompozitních materiálů. U těchto materiálů byly... more

descriptionView Paper arrow_downwardDownload

A Stabilized Finite Element Method for the Mixed Wave Equation in an ALE Framework With Application to Diphthong Production

by hector espinoza

2024, Acta Acustica united with Acustica

Working with the wave equation in mixed rather than irreducible form allows one to directly account for both, the acoustic pressure field and the acoustic particle velocity field. Indeed, this becomes the natural option in many problems,... more

descriptionView Paper arrow_downwardDownload

Linearity of the function between the sound pressure level of speech and the negative peak amplitude of the differentiated glottal flow for voices of different intensities

by Erkki Vilkman

2024

The negative peak amplitude of the differentiated glottal flow (dpeak) is known to correlate strongly with the sound pressure level (SPL) of speech. Therefore, the function between d peak and SPL is usually modeled as a single line. In... more

descriptionView Paper arrow_downwardDownload

Synthetic, Self-Oscillating Vocal Fold Models for Voice Production Research

by Scott Thomson

2024, Journal of the Acoustical Society of America

Sound for the human voice is produced by vocal fold flow-induced vibration and involves a complex coupling between flow dynamics, tissue motion, and acoustics. Over the past three decades, synthetic, self-oscillating vocal fold models... more

descriptionView Paper arrow_downwardDownload

Bioacoustic pattern of a bat community: seasonal dynamics of bat communities in the Kruger NP, SAR

by Markéta Staňková

2024

Echolokační schopnosti netopýrů a specifita jejich hlasů umožňuje sledování struktury a proměny jejich společenstev na dálku bez nutnosti kontaktní manipulace s jedinci. Prostřednictvím celonočních akustických záznamů lze získat... more

descriptionView Paper arrow_downwardDownload

Analysis of the functionning of wind musical instruments

by Christophe Vergez

2024

Les articles auxquels j'ai participé sont numérotés (ex : [1], [2], ...) et la liste est consultable au chapitre 6 (page 98). Les autres articles sont référencés en utilisant un style alphanumérique (ex : [DGO03]) et constituent la... more

descriptionView Paper arrow_downwardDownload

[Is Acute Administration of Delta-9-tetrahydrocannabinol (THC) to Rats a Model of Psychosis? Comparison of Behavioral and EEG Findings]

by Pavlína Nováková

2024

Existuje řada důkazů, které potvrzují vliv delta-9-tetrahydrokanabinolu (THC) na indukci pozitivních psychotických symptomů u lidí, na zhoršení symptomů probíhající psychózy, na rozvoji psychózy u chronických adolescentních uživatelů... more

descriptionView Paper arrow_downwardDownload

Phonated speech reconstruction using twin mapping models

by Iman Ardekani

2024

Computational speech reconstruction algorithms have the ultimate aim of returning natural sounding speech to aphonic and dysphonic individuals. These algorithms can also be used by unimpaired speakers for communicating sensitive or... more

descriptionView Paper arrow_downwardDownload

Comparison of Aerodynamic and Elastic Properties in Tissue and Synthetic Models of Vocal Fold Vibrations

by Jacob Michaud-Dorko

2024

Three laryngeal models were used to investigate the aerodynamic and elastic properties of vocal fold vibration: cadaveric human, excised canine, and synthetic silicone vocal folds. The aim was to compare the characteristics of these... more

Table 1. Summary of mean measured parameters and standard deviation for each vocal fold model. 4. Discussion

Figure 1. Schematic of the experimental setup used to capture the intraglottal flow field and glottal wall geometry using particle image velocimetry (PIV). (a) Cadaveric human larynx, (b) excised canine larynx, and (c) synthetic larynx, all mounted on an aerodynamic nozzle. Figure 1. Schematic of the experimental setup used to capture the intraglottal flow field and glottal section, connected to the trachea, was 25 mm long, with an inlet diameter of 12.7 mm and an exit diameter o guidelines provid represen transduc ting the subglottal press er, with the reported va Kjaer, Neerum, Denmar he glottis where it did not inter f 17.0 mm. The settling chamber and nozzle were designed following the ed by Morel [33] and Mehta [34]. The static pressure inside the nozzle, ure (Psg), was measured using a Honeywell FPG pressure ues time-averaged. The airflow was humidified (Hudson RCL, ConchaTherm III) and regulated using a flow controller (Parker, MPC series), a flow meter (MicroMotion Inc, CMF02 regulator (ControlAir Inc, Type hane tubing (3/4 in. OD, 1/8 in. wall) connected the various instruments. Acoustic ments were captured using a 1/4-inch omnidirectional microphone (Type 4958, 5 Coriolis Flow Meter, Boulder, CO, USA), and a pressure 00 Precision Air Pressure Regulator, Amherst, NH, USA) k) placed approximately 30 cm laterally and superiorly to fere with the airflow.

Figure 2. Force-displacement measurements. (a) Cadaveric human vocal fold. (b) Excised canine vocal fold. The pink color is from the dye applied to minimize the reflections from tissue during PIV measurements (c) Synthetic vocal fold before (top) and after (bottom) applying paint. (d) Schematic of the experimental setup used to collect normal force—displacement measurements. Thee astic properties of the vocal folds for t o the procedure outlined by Dion et al. [36]. Fo he tissue models were bisected in the sagittal plane to create a hemilarynx configuration. A pecific ho der was molded from plaster material he three models were collected according lowing the aerodynamic measurements, for each larynx (Figure 2). The hemilarynx vas affixed to the model mold using cyanoacrylate adhesive. The tissues were kept moist efore and aster ho during the testing using a phosphate der was cast for the synthetic fold, bu he testing. buffered saline (PBS) solution. A similar the saline solution was not used during

Plotting the width of the glottal opening in the mid-coronal shows a comparison between the motion of the folds during vibration in each larynx model. This width was extracted at the superior aspect from the PIV images, and its value is assumed to be proportional to the glottal opening area. All models were characterized by opening, closing, and closed phases during their vibration cycle (Figure 3a). The resultant glottal width waveform was used to calculate the open quotient (OQ) and area skewness index (Slwigth) values (Figure 3b,c). Waveform examples for HL1, CL1, and SL1 are shown in Figure 3a.

Figure 3. Mid-coronal width analysis of human (left column), canine (middle column), and synthetic (right column) vocal fold models during phonation. (a) Glottal width waveform. (b) Open quotient. (c) Superior width skewness index [38-41].

Figure 4. Mid-coronal width trends of human, canine, and synthetic vocal fold models during phonation. (a) Open quotient. (b) Superior width skewness index. (c) Glottal divergence angle at the MFDR. In all cases, the OQ, SIwidth, and the divergence angle at the instant of the MFDR phase increased with Psg (Figure 4). The Slwiqe, values increased the most in the human (Figure 4b), while the canine and synthetic trends remained constant for both the OQ and Slwidth parameters. The glottal divergence angle, defined as the total angle between the folds at the MFDR phase, was extracted from the PIV measurements (shown in Section 3.2. Glottal Flow Characteristics). The magnitude of the divergence angle at the MFDR phase appeared to be similar in the tissue models and higher than in the synthetic models (Figure 4c).

Figure 5. Mid-coronal flow analysis of human (left column), canine (middle column), and synthetic (right column) vocal fold models during phonation. (a) Two-dimensional glottal flow waveform. (b) Flow skewness index. Plotting the two-dimensional flow rate (Q*) in the mid-coronal location compares the waveforms from each larynx model. The flow rate was calculated by integrating the axial velocity along the glottal exit (superior edge) at each phase, and examples of the resulting waveforms are shown for HL1, CL1, and SL1 in Figure 5a. Similar to the fold’s displacement, the peak Q* increased with the increasing Psg in all models, but the shift in Q* skewing was observed only in the tissue models. Furthermore, the value of the flow skewness index (Slow) in some of the synthetic models was less than one (SIfow < 1), signifying a prolonged closing phase (compared with opening), which was the opposite of the tissue models (Figure 5b).

Figure 6. Mid-coronal velocity flow fields of human (left column), canine (middle column), and synthetic (right column) vocal fold models at the MFDR. (a) Low Psg. (b) High Psg. (c) Axial velocity profiles extracted along the superior edge (white horizontal line). the folds. Although the magnitude of this angle increased with the increasing Psg in all models, its values were much greater in the tissue models.

Figure 7. Mid-coronal vorticity flow fields of human (left column), canine (middle column), and synthetic (right column) vocal fold models at the MFDR. (a) Low Psg. (b) High Psg. (c) Total circulation (within the outlined region).

Figure 8. Glottal flow and acoustic characteristics of human, canine, and synthetic vocal fold models. (9) Flow ckewnece index (h) |MBEBDRI! (c) Total circulation (d) VE. where R is the distance from the microphone to the sound source, SPL is the sound pressure level, Q is the mean glottal flow rate recorded from the upstream flow meter, and Psg is the mean subglottal pressure [43]. These parameters in the canine models are higher than in the human and synthetic models.

Figure 9. Elasticity measurements along the vertical height of each model in the mid—coronal plane (HL1 [left column], CL1 [middle column], and SL1 [right column]). (a) Stress—strain curve. Vertical lines mark the location where the maximum displacement of the folds was observed at high and low Psg values. (b) Corresponding Young’s modulus values calculated at the maximum fold displacement [18—20,26].

Figure 10. VSG as a function of Psg (i.e., strain) for the human, canine, and synthetic vocal fold models The VSG as a function of Psg highlights the notable differences between the tissue and synthetic models (Figure 10). In tissue models, nonlinearity is evident as the VSG increases with strain, due to an increase in Psg. Conversely, in synthetic models, the VSG remains constant with increasing strain, also due to a rising Psg. This behavior occurs because the silicone synthetic model is primarily elastic and exhibits little to no hysteresis.

descriptionView Paper arrow_downwardDownload

Měření parametrů přenosového kanálu systému eCall v osobních automobilech

by Oldřich Tureček

2024

Emergency call system, which in the case of traffic accidents ensure rapid assistance to motorists, will be mandatory for all new passenger cars approved since 2017 - 2018. The speech intelligibility as one of the parameters of eCall... more

descriptionView Paper arrow_downwardDownload

Transport et dépôt d'aérosols liquides dans les voies aériennes supérieures

by Lucie Bailly

2024, HAL (Le Centre pour la Communication Scientifique Directe)

Une étude numérique du transport et du dépôt d'aérosols dans un modèle idéalisé des voies aériennes supérieures humaines est présentée. La géométrie de la région laryngée pendant la respiration est obtenue à partir d'une étude clinique... more

descriptionView Paper arrow_downwardDownload

Contribution des bandes ventriculaires lors dun effort vocal. Impact sur la vibration glottique

by Lucie Bailly

2024, HAL (Le Centre pour la Communication Scientifique Directe)

Structures-Risques (3S-R), Domaine universitaire-BP 53, 38041 Grenoble Cedex 9 {lucie.bailly, nathalie.henrich}@gipsa-lab.grenoble-inp.fr Les bandes ventriculaires sont des structures laryngées situées au-dessus et à proximité des cordes vocales. Bien que leurs propriétés biomécaniques diffèrent de celles des cordes vocales, elles sont capables de se rapprocher, de rentrer en contact, voire même de vibrer lors de gestes phonatoires parlés ou chantés. Dans cette étude, nous nous intéressons à leur comportement lors d'un effort vocal (crescendo-decrescendo, cri, grognement). Pour ce faire, une base de données a été constituée par l'enregistrement par cinématographie ultrarapide de 5 locuteurs et 3 chanteurs lors de ces divers gestes phonatoires. Les signaux audio et électroglottographique de chaque production ont été enregistrés simultanément, et synchronisés aux images laryngées. L'observation du comportement des bandes ventriculaires montre un rapprochement de ces structures lors d'un effort vocal, comparativement au geste de voisement usuel. Leur rapprochement peut s'accompagner d'une augmentation conjointe de l'énergie acoustique dans la bande de fréquence 2-4 kHz, sans influence directe sur l'intensité vocale globale. Le geste phonatoire peut également s'accompagner d'un accolement des bandes ventriculaires, observé sur la partie médiane, antéro-médiane ou sur l'intégralité de leur longueur. Dans la continuité de leur mouvement de compression, les bandes ventriculaires peuvent entrer en vibration, périodiquement ou non, en phase ou non avec l'oscillation des cordes vocales selon le contexte phonatoire. Une modélisation théorique aérodynamique a permis de mettre en évidence l'influence d'une constriction supralaryngée sur le mouvement vibratoire glottique. Cette modélisation est appliquée ici à l'étude physique de l'impact des constrictions observées par cinématographie ultra-rapide sur la vibration glottique. L'aire ventriculaire estimée à partir des images laryngées est introduite comme paramètre d'entrée du modèle. Le comportement vibratoire glottique résultant est simulé par application d'un modèle à deux masses inspiré de Ruty et al. (2007), et comparé à la vibration glottique mesurée par électroglottographie.

descriptionView Paper arrow_downwardDownload

Caractérisation expérimentale multi-échelle et interdisciplinaire des plis vocaux

by Lucie Bailly

2024, HAL (Le Centre pour la Communication Scientifique Directe)

descriptionView Paper arrow_downwardDownload

Tremor diagnostics of tennis players using accelerometer

by Lenka Škochová

2024

remor is a rhythmic oscillating movement of body parts caused by alternating contractions of muscle agonists and antagonists. It is an involuntary movement occurring in healthy individuals as well as in individuals with neurological... more

descriptionView Paper arrow_downwardDownload

Mesures de pression et mesures de champs de vitesse synchronisés en phase au niveau du répartiteur d'admission automobile

by David RAMEL

2024

National audienceLes travaux présentés s’inscrivent dans le cadre d’un projet labellisé par le pôle de compétitivité ‘Véhicule du Futur’. L’acronyme du projet est SIMBA pour SIMulation de la Boucle d’Air automobile. L’objectif global est... more

descriptionView Paper arrow_downwardDownload

La glottoplastie selon Wendler pour la féminisation de la voix en cas de transsexualisme homme–femme

by Sebastien Van der Vorst

2024, Annales françaises d'Oto-rhino-laryngologie et de Pathologie Cervico-faciale

Communications orales du dimanche 14 octobre A63 des récidives de chondrosarcome sous-glottique doit encore faire l'objet d'études prospectives complémentaires.

descriptionView Paper arrow_downwardDownload

Analyse de sensibilité du rayonnement acoustique d’une couche de mélange bidimensionnelle par différentiation complexe

by Christophe Bogey

2024, HAL (Le Centre pour la Communication Scientifique Directe)

16 ème Congrès Français d'Acoustique 11-15 Avril 2022, Marseille L'influence du nombre de Mach et du nombre de Reynolds sur le bruit produit par une couche de mélange bidimensionnelle est étudiée à l'aide de simulations numériques... more

descriptionView Paper arrow_downwardDownload

Simulation numérique du rayonnement acoustique de jets ronds supersoniques impactant une paroi

by Christophe Bogey

2024, HAL (Le Centre pour la Communication Scientifique Directe)

Des simulations numériques de quatre jets rond supersoniques sous-détendus ont été réalisées. Les quatre jets impactent une paroi avec un angle normal, située à une distance comprise entre L = 4.16r 0 et L = 9.32r 0 des lèvres de la buse,... more

descriptionView Paper arrow_downwardDownload

Comparison of some mechanical models of larynx in the synthesis of voiced sounds

by jorge Lucero

2024, Journal of the Brazilian Society of Mechanical Sciences and Engineering

The process of voiced sounds production can be described as follows: air coming from the lungs is forced through the narrow space between the two vocal folds, which are set in motion in a frequency governed by the tension of their... more

descriptionView Paper arrow_downwardDownload

Identification des sources acoustiques dans un tuyau corrugué sous écoulement

by Pierre-Olivier Mattei

2024, HAL (Le Centre pour la Communication Scientifique Directe)

Le phénomène connu sous le nom de "tuyau chantant" concerne l'émission acoustique de tuyaux qui peuvent se mettre à siffler lorsqu'ils sont soumis à un écoulement interne de gaz. Ce sifflement trouve son origine dans la géométrie particulière de ces tuyaux qu'on qualifie de "corrugués" parce que leur paroi interne est constituée d'une suite de cavités régulièrement espacées. Afin de prédire ce sifflement, il est important de comprendre les mécanismes qui sont à l'origine de l'émission sonore. L'étude s'appuie sur une approche numérique en DNS par la méthode Lattice-Boltzman (LBM) confrontée à des résultats expérimentaux. Une expérience de tuyau corrugué chantant a été montée à IRPHE autour d'un tuyau rectangulaire transparent de longueur L = 2m. Les corrugations sont de dimension caractéristique de 10mm. L'écoulement a été caractérisé par des mesures de vitesse et de pression acoustique jusqu'à des vitesses de 25m/s. Une analyse de la dynamique des structures de la turbulence qui se développent dans les couches cisaillées de l'écoulement a été réalisée par la méthode d'estimation stochastique linéaire (LSE). Nous avons observé que ce sifflement trouve son origine dans le couplage entre les mouvements de la couche de cisaillement qui se développe sur les parois du tuyau et les modes acoustiques du tuyau. Le rôle des premières corrugations ayant été exploré expérimentalement, le modèle numérique permet d'étendre cette observation à tout le tuyau corrugué, et d'explorer les contributions des différentes régions de l'écoulement à l'émission sonore. Le modèle utilise le code libre Palabos, basé sur la LBM. Cette méthode numérique donne accès à un grand nombre de paramètres en tous points de la géométrie et permet d'observer les fluctuations de pression et de les corréler avec l'écoulement. La complémentarité des approches expérimentale et numérique apporte des éléments de compréhension des mécanismes de génération du sifflement dans les tuyaux corrugués.

descriptionView Paper arrow_downwardDownload

Vocal-fold collision mass as a differentiator between registers in the low-pitch range

by anne-maria laukkanen

2024, Journal of Voice

Register shift between the chest and falsetto register is generally studied in the higher-than-speaking pitch range. However, a similar difference can also be produced at speaking pitch level. The shift from breathy "falsetto" phonation... more

descriptionView Paper arrow_downwardDownload

Instabilité secondaire sur un anneau de vorticité

by Elie Rivoalen

2024

Nous caractérisons les instabilités qui apparaissent sur un anneau de vorticité à l'aide d'une méthode particulaire de type "vortex blob" dans laquelle un remaillage axi-symétrique évite l'apparition de modes parasites. On initialise... more

descriptionView Paper arrow_downwardDownload

Modélisation numérique des écoulements externes d'inverseur de poussée. Une aide pour la conception des nacelles

by Elie Rivoalen

2024

Lors de la phase d'atterrissage d'un avion de transport civil, une partie de l'écoulement entrant dans les réacteurs est réorientée grâce aux inverseurs de poussée. Ces derniers créent une contre poussée qui participe au freinage de... more

descriptionView Paper arrow_downwardDownload

Valoració objectiva de l’activitat física en sessions d’exercici físic d’un programa multidisciplinari per al tractament de l’obesitat infantil

by Noemí Serra-Paya

2024, Apunts Educació Física i Esports

The 60-minutes sessions were characterised by short bouts of MVPA interspersed by short bouts of LPA. Participants spent 58.3% of the duration of the session in MVPA, 30% in LPA and only 11.8% in sedentary behaviour. For all the sessions,... more

descriptionView Paper arrow_downwardDownload

A Training Model for Improving Journalists' Voice

by Emma Rodero

2024, Journal of Voice

Voice education is a crucial aspect for professionals (journalists, teachers, politicians, actors, etc.) who use their voices as a working tool. The main concerns about such education are that, first, there is little awareness of the... more

descriptionView Paper arrow_downwardDownload

Dynamic properties of tuned branch

by Ondřej Čepl

2024

The main part of the thesis is experimental determination of transmission in side-branch resonator. In first part are described basics of digital signal processing and sound damping in piping systems. A special attention is payed to... more

descriptionView Paper arrow_downwardDownload

Contrôle du sifflement d’un tuyau corrugué sous écoulement : analyse des données expérimentales

by Ulf Kristiansen

2023

Notre etude se concentre sur un phenomene souvent rencontre dans les tuyaux qui transportent des ecoulements de gaz. Les instabilites de l'ecoulement provoquees par les singularites geometriques internes au tuyau peuvent exciter les... more

descriptionView Paper arrow_downwardDownload

Respiratory physiotherapy techniques used in patients with neurological disease

by Daniela Botikova

2023, Listy klinické logopedie

Dýchání a jeho poruchy jsou oblastí zájmu mnoha zdravotnických lékařských i nelékařských oborů. U pacientů s neurologickým onemocněním dochází k paréze, tedy poklesu svalové síly, kromě jiného i dechových svalů, což má za následek vznik... more

descriptionView Paper arrow_downwardDownload

Respiratory physiotherapy techniques used in patients with neurological disease

by Daniela Botikova

2023, Listy klinické logopedie

descriptionView Paper arrow_downwardDownload

Mesures de pression et mesures de champs de vitesse synchronisés en phase au niveau du répartiteur d'admission automobile

by Yannick Bailly

2023

descriptionView Paper arrow_downwardDownload

Popis vlastností zvukového signálu a metod jeho zpracování

by Tomas Oramus

2023

Rád bych poděkoval svému vedoucímu Ing. Janu Skapovi, Ph.D, za jeho rady a vstřícný přístup při vedení mé bakalářské práce. Zároveň bych chtěl také poděkovat pedagogům Hudební fakulty Akademie múzických umění za poskytnutí materiálů z... more

descriptionView Paper arrow_downwardDownload

Perceptually motivated modeling of noise in pathological voices

by Jody Kreiman

2023, The Journal of the Acoustical Society of America

Research investigating the correlation of acoustic measures of noise and the perception of pathological voice quality has consistently demonstrated a moderate association. However, this correlational approach cannot address basic... more

descriptionView Paper arrow_downwardDownload

Toward a unified theory of voice production and perception

by Jody Kreiman

2023, Loquens

At present, two important questions about voice remain unanswered: When voice quality changes, what physiological alteration caused this change, and if a change to the voice production system occurs, what change in perceived quality can... more

descriptionView Paper arrow_downwardDownload

Gesture and body-movement as teaching and learning tools in Western classical singing

by Julia Nafisi

2023

This thesis investigates the use of gesture and body-movement as teaching and learning tools in Western classical singing. The introduction draws together a number of theoretical threads to argue why this study has been undertaken and what its objectives are. These threads are elaborated on in the literature review which covers the fields of Vocal Pedagogy, Learning, Gesture Studies, Choral Rehearsal, Music Education and Acting. The study uses two methodologies: survey and experiment. Using terminology devised by the author as Nafisi-system of singing movements, a survey amongst singing teachers in Australia and Germany establishes the prevalence and thus relevance of gestures as tools to enhance and/or illustrate explanation and/or demonstration in the communication of singing related concepts; similarly the survey confirms that voice teachers encourage singing students to use gesture and/or body-movement as tools to facilitate understanding and learning of physiological functions, thought concepts or musical ideas. The survey further yields a wealth of hitherto unknown information about many facets of voice teachers' use of gesture and movement in their teaching, testifying both to the potential power and controversy inherent in this teaching tool. While the survey had collected teachers' subjective assessments, the experiment sought to actually prove the effectiveness of gesture and body-movement. Following the argument that the quality of the vocal tone constituted the single most important factor in Western classical singing technique, it was propounded that a teaching intervention could only rightfully claim validity if its efficacy was evident in an improved quality of vocal tone and an experiment was designed to show just that. Within the limits of the experimental design, the results were unambiguous: Compared with a teaching intervention that emulated 'traditional' voice teaching without movement, the teaching interventions that incorporated gestures and/or body-movements were clearly superior in their efficacy in two out of four tested vocal tasks and equally as effective in the other two tested vocal tasks. 8 Greek mythology: Orpheus, who has been given his lyre by the god Apollo, sings so beautifully that he mesmerized gods, men, beasts, and even plants. 9 Second king of Israel (1040-970 BC); apart from being a famed warrior, he played the harp and sang for King Saul. 10 Greek philosopher (384-322 BC) the first to create a comprehensive system of Western philosophy. 11 Roman (of Greek ethnicity) physician (129-200 AD), medical researcher and philosopher.

descriptionView Paper arrow_downwardDownload

Voices and Listeners: Toward a Model of Voice Perception

by Diana Sidtis

2023

descriptionView Paper arrow_downwardDownload

The human larynx physiological and pathophysiological aspects

by Kanwarpreet Sadhu

2023, International Journal of Otorhinolaryngology and Head and Neck Surgery

During respiration, the glottis opens a fraction of second before air is drawn in by descent of the diaphragm, Green and Neil 1955. 2 This opening is brought about by contraction of the posterior cricoarytenoid muscles (Figure 1).... more

descriptionView Paper arrow_downwardDownload

The human larynx physiological and pathophysiological aspects

by Kanwarpreet Sadhu

2023, International Journal of Otorhinolaryngology and Head and Neck Surgery

In this era of minimally invasive surgical intervention s, the knowledge of the physiology and pathophysiology of the larynx is vital to the laryngologist. The conventional procedure of laryngeal surgery has been superseded by functional... more

descriptionView Paper arrow_downwardDownload

Mécanismes de transfert au sein d'une couche de glace en développement en présence d'un écoulement turbulent : approches expérimentale et numérique

by hau huynh

2023

L'etude concerne la prise en glace d'une paroi thermiquement controlee placee au sein d'une conduite dans laquelle se developpe un ecoulement turbulent d'air humide. L'ensemble des essais experimentaux a ete realise a... more

descriptionView Paper arrow_downwardDownload

PTV-4D de l’écoulement de microparticules plastique modèles dans une bifurcation

by Valérie Massardier-nageotte

2023, HAL (Le Centre pour la Communication Scientifique Directe)

descriptionView Paper arrow_downwardDownload

Contrôle d'un écoulement subsonique par utilisation de décharges surfaciques

by Dunpin Hong

2023

In this paper, electric discharges were studied in atmospheric air in order to modify subsonic airflows. The flows induced by a DC surface corona discharge and an AC Dielectric Barrier Discharge were measured with the PIV system. They... more

descriptionView Paper arrow_downwardDownload

A computational method for updating a probabilistic model of an uncertain parameter in a voice production model

by Rubens Sampaio

2023

The aim of this paper is to use Bayesian statistics to update a probability density function (p.d.f.) related to the tension parameter of the vocal folds, which is one of the main parameters responsible for the changing of the fundamental... more

Figure 1: Two-mass model of the vocal folds. The dynamics of the system is given by Eqs. (1) and (2) (Cataldo et. al., 2008, 2009): Some authors have modeled the vocal folds dynamics, mainly in a deterministic way (Koizumi et al., 1976; Lous et al., 1998; Zhang et al., 2005). One of these models is the well-known model proposed by Ishizaka and Flanagan (1972) and it will be used here because it has provided a simple and effective representation of the system for studying the underlying dynamics of voice production.

In order to validate the development presented here, voice signals produced by one person have been analyzed and their statistics have been compared with simulations. A voice signal corresponding to a sustained vowel /a/ has been recorded from one person and 1,800 frames were obtained from this signal, each one with 0.01s of length. For each frame, the corresponding fundamental frequency was calculated. So, a corresponding p.d-f., the so-called experimental, can be constructed. Figure 2 shows the p.d.f. of the fundamental frequency constructed from the experimental data. The problem to be solved to update the p.d.f. of Q, using Bayesian statistics, will be divided into two parts: at first. an inverse problem will be solved in order to obtain, from simulations, a p.d.f. of the fundamental frequency near to the experimental one. Then, at the second part, the p.d.f. obtained in the first part will be updated, using experimental date and the Bayesian method. Consequently, the updated p.d.f. of Q will be obtained. As explained above, the stochastic system is deduced from the deterministic one substituting ago, y, g by the random variables Ago, Y, Q. Consequently, the random variable associated to the fundamental frequency Fo is given by Fo = M (Ago, Y,Q). However, the nonlinear mapping -@ is not explicitly known and it is implicitly defined by Eqs. (1) and (2) substituting ago, y, g by random variables Ago, Y, Q. The fundamental frequency associated to each realization of the voice signal is calculated through the glottal signal, calculating the inverse of its period.

Figure 3: Probability density functions: experimental (continuous line) and simulated (dashed line). Figure 3 shows the probability density function constructed from experimental data and the probability density func- ion constructed from simulations. The function ksdensity from MATLAB was used. 4.2 Description of the second part

Figure 4: Probability density functions pp,jg (100 plots) and the probability density function of the fundamental frequency obtained from experimental data (thick line). pitas i a 2 01 8 To construct the function pyjg, 100 deterministic values of g (from 0.6153 up to 0.6442) were considered and, for each value of q, a p.d.f. of the fundamental frequency was obtained by simulation. The corresponding conditional p.d.f.’s are shown in fig. 4. So, corresponding values of Prial fo|q) can now be calculated, for giving values of fo and g. Here, values of fo wer considered from 109 Hz up to 125 Hz, with 0.1 Hz of spacing.

Figure 5: Updated p.d.f.’s of the fundamental frequency for different values of Vexp. Let Fy XP be the random variable associated to the fundamental frequencies obtained experimentally and pom the random variable associated to the simulated fundamental frequencies. The aim is to update the probability density function of Fsim, using experimental values of the fundamental frequency, applying the Eq. 15. Fig. 5 shows the updated p.d_f. (pee d, of the fundamental frequency considering Vexp = 1, 10, 100, 800and 1,800. Starting from Vexp = 1000, the p.d.f. does not change anymore. It means that the same p.d.f. is obtained considering Vv = 1,000 or more. It should be observed that the p.d.f. obtained for Vexp = 1,800 is almost the same of the p.d.f. constructed with experimental values.

Figure 6: Functions | pre — pam | (dashed line) and | Pre — pipe | (continuous line). Calculating the area under the plots of the Fig. 6, the values found were:

Figure 7: Prior p.d.f. of Q and the corresponding updated p.d.f’s. Now, it is possible to obtain the p.d.f. of Q, using Eq. (11). Figure 7 shows the plots of the prior probability density function of Q and the posterior probability density function of Q, obtained with Vexp = 1, 100, and1, 800. The p.d.f. of Q is near a delta function located in Q = 0.633. With this value of the parameter g, and considering the random variables Ago and Y, it is possible, using the model, to obtain the p.d.f. of the fundamental frequency constructed with experimental values.

descriptionView Paper arrow_downwardDownload

Phonated speech reconstruction using twin mapping models

by Iman Ardekani

2023, 2015 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

descriptionView Paper arrow_downwardDownload

A Training Model for Improving Journalists' Voice

by Olatz Larrea Estefanía

2023, Journal of Voice

descriptionView Paper arrow_downwardDownload

Test Rig Design Proposal for the Experimental Validation of Transmission Parameters in Open Loop Torque Condition

by Jiří Máca

2023

Tato závěrečná diplomová práce se zabývá problematikou měření vibrací a hluku převodových ústrojí. Rešeršní část popisuje základní problematiku vzniku hluku a vibrací, jejich měření, dále také zdroje vibrací v převodových ústrojích.... more

descriptionView Paper arrow_downwardDownload

Résonances acoustiques dans un tube corrugué sous écoulement

by Pierre-Olivier Mattei

2023

Résumé : Des mesures PIV associées à des mesures de vitesse acoustique par anémométrie à fil chaud permettent de caractériser l’écoulement et l’acoustique dans un tube corrugué. Un sifflement intense est provoqué par la cohérence entre... more

descriptionView Paper arrow_downwardDownload

A Stabilized Finite Element Method for the Mixed Wave Equation in an ALE Framework With Application to Diphthong Production

by Hector Huayta Espinoza

2023, Acta Acustica united with Acustica

descriptionView Paper arrow_downwardDownload

DP Simunkova Simona

by Simona Šimůnková

2023

Chtěla bych poděkovat Ing. Tereze Kráčmerové za odborné vedení, poskytování cenných rad a vstřícný přístup po celou dobu vedení mé diplomové práce. Dále bych chtěla poděkovat kolegům ze Samostatného oddělení lékařské fyziky Fakultní... more

descriptionView Paper arrow_downwardDownload

Stabilité linéaire de l'écoulement dans un canal courbe aux parois compliantes

by Christophe Airiau

2023

La stabilité linéaire de l'écoulement de Taylor-Couette entre deux cylindres aux parois compliantes (déformables) est considérée. Les parois sont modélisées comme des coques minces élastiques supportées par un ensemble de ressorts... more

descriptionView Paper arrow_downwardDownload

Voice Production

Key research themes

1. How can speech synthesis systems be adapted to support both speech and singing voice production from neutral speech corpora?

2. What computational and vocal models facilitate the control and learning of expressive vocal intonation and prosody, including for language learning and voice training?

3. How can physical and computational models of vocal fold physiology and acoustics improve understanding and simulation of voice production?

Related Topics

All papers in Voice Production