Academia.eduAcademia.edu

Vocal Tract

description2,920 papers
group138 followers
lightbulbAbout this topic
The vocal tract is the anatomical structure comprising the throat, mouth, and nasal passages that shapes and modifies sound produced by the larynx during speech and singing. It plays a crucial role in phonetics, influencing resonance, articulation, and the production of various speech sounds.
lightbulbAbout this topic
The vocal tract is the anatomical structure comprising the throat, mouth, and nasal passages that shapes and modifies sound produced by the larynx during speech and singing. It plays a crucial role in phonetics, influencing resonance, articulation, and the production of various speech sounds.

Key research themes

1. How do neural control mechanisms regulate complex coordination of vocal fold and vocal tract functions in voice and swallowing?

This research direction investigates the central and peripheral nervous system control over vocal fold motion and vocal tract muscles during voice production and swallowing. Understanding these neural control systems is crucial for elucidating normal voice and swallowing physiology and their pathologies, particularly in how reflexive and volitional aspects integrate through cortical and brainstem circuits.

Key finding: This review synthesizes evidence that voice and swallowing control is an integrative system involving cortical as well as brainstem circuits, contradicting earlier views of strict cortical versus limbic and brainstem... Read more
Key finding: This review clarifies that despite historical uncertainty, intrinsic laryngeal muscles possess proprioceptive sensory receptors, and the larynx is richly endowed with sensory structures including muscle spindles, Golgi tendon... Read more
Key finding: This article outlines neuroauriculotherapy as a neurophysiologically based clinical approach exploiting precise mappings between auricular points and central nervous system regions relevant to vocal tract and laryngeal... Read more

2. What biomechanical and aerodynamic mechanisms underlie vocal fold hyperfunction and pathologies related to incomplete glottal closure, and how can computational modeling contribute?

This theme addresses the biomechanical basis of vocal hyperfunction disorders, especially phonotraumatic vocal fold lesions arising from compensatory behaviors triggered by incomplete glottal closure and maladaptive muscle activation patterns. Computational and numerical modeling approaches simulate vocal fold dynamics, aerodynamic forces, and feedback, providing measurable insights into mechanisms that cannot be directly observed in vivo.

Key finding: By implementing a lumped-element triangular glottis model incorporating prephonatory configurations and compensatory mechanisms (subglottal pressure, muscle activation, supraglottal constriction), this study quantified how... Read more
Key finding: Empirical in vivo aerodynamic data showed that women with phonotraumatic vocal hyperfunction exhibit elevated subglottal pressure, peak glottal airflow, and maximum flow declination rate relative to controls and... Read more
Key finding: Using computational fluid dynamics (CFD) simulations of normal and paradoxically adducted vocal cords, this study demonstrated that vocal cord dysfunction episodes produce elevated airflow resistance, chaotic flow, and... Read more

3. How do vocal tract anatomical configurations, including dynamic shape changes in various singing techniques and exercises, influence phonation acoustics and vocal quality?

This theme examines the morphological variations of the vocal tract, including vocal fold dimensions and vocal tract cavity shapes, across different phonatory modes and singing styles, utilizing imaging approaches such as MRI and CT. It explores how articulatory adjustments modulate acoustic resonances and vocal tract filtering, consequently shaping voice quality, pitch, and loudness, which is essential knowledge for therapeutic interventions and vocal pedagogy.

Key finding: MRI measurements revealed distinctive vocal tract configurations across four Complete Vocal Technique modes (Neutral, Curbing, Overdrive, Edge), with Edge showing maximal laryngeal and pharyngeal narrowing and shortest vocal... Read more
Key finding: Comparative MRI analyses of three singing styles within the same individuals revealed that Opera singing involves a lower larynx and larger pharyngeal space than Kulning, while Edge exhibits the highest larynx position and... Read more
Key finding: CT scans during phonation into tubes showed significant increases in vocal tract vertical length, oropharyngeal and hypopharyngeal areas, accompanied by lowered laryngeal position and velopharyngeal closure in patients with... Read more
Key finding: This CT-based study of two vocally healthy individuals phonating before, during, and after tube phonation found no consistent trends in changes to vocal fold dimensions (thickness, length, bulging, glottal width). The... Read more

All papers in Vocal Tract

In this work, we present a silent speech system that is able to generate audible speech from captured movement of speech articulators. Our goal is to help laryngectomy patients, i.e. patients who have lost the ability to speak following... more
High vertical laryngeal position (VLP), pharyngeal constriction, and laryngeal compression are common features associated with hyperfunctional voice disorders. The present study aimed to observe the effect on these variables of different... more
Formant frequency of vowels in a language is considered as one of the important acoustical parameter of speech signal. This parameter can be seen as acoustic resonance of human vocal tract. Although formant frequencies which is changeable... more
Emphasis (contrastive pharyngealization of coronals) in Arabic spreads from an emphatic consonant to neighboring segments. Previous research suggests that in addition to changing spectral characteristics of adjacent segments, emphasis... more
This paper presents a sensor-augmented saxophone mouthpiece which promotes data collection from musicians. The collected data aims to unveil the mechanics of saxophone tone formation from the embouchure and air flow control—for which only... more
This paper examines the articulatory correlates of the Hero and Villain Voice Types, which were auditorily identified in a separate study on cartoon voices, using the magnetic resonance imaging (MRI) technique. In general, the MRI images... more
Based on the assumption that the goals of phonemic speech movements are both auditory and somatosensory in nature, a biomechanical model of the vocal tract in conjunction with an adaptive controller inspired by the DIVA model of speech... more
Alcohol is known to impair fine articulatory control and movements. In drunken speech, incomplete closure of the vocal tract can result in deaffrication of the English affricate sounds /tʃ/ and /ʤ/, spirantization (fricative-like... more
Best Tree Encoding (BTE) is a promising feature extraction technique based on wavelet packet decomposition that is utilized in Automatic Speech Recognition (ASR). This research introduces an enhancement of Wavelet Packet Best Tree (WPBT)... more
Conduction aphasia is a language disorder characterized by frequent speech errors, impaired verbatim repetition, a deficit in phonological short-term memory, and naming difficulties in the presence of otherwise fluent and grammatical... more
The basic goal of the voice conversion system to mimics the characteristics of the target speaker voice by keeping the linguistic and paralinguistic information intact. The characteristics of a speaker in speech reflect at different level... more
The basic goal of the voice conversion system to mimics the characteristics of the target speaker voice by keeping the linguistic and paralinguistic information intact. The characteristics of a speaker in speech reflect at different level... more
The complex cepstrum vocoder is used to modify the speaker specific characteristics of the source speaker speech to that of the target speaker speech. The low time and high time liftering are used to split the calculated cepstrum into the... more
In speaker transformation, the speaker dependent spectral parameters are generally characterized by single scale features. These features approximate the vocal tract, but produce artifacts during speech signal reconstruction. In this... more
The objective of voice conversion system is to formulate the mapping function which can transform the source speaker characteristics to that of the target speaker. In this paper, we propose the General Regression Neural Network (GRNN)... more
In this paper we present results of three distinct studies addressing European Portuguese Nasal Vowels height. studies contemplated: analysis of EMMA data for one male speaker, analysis of first formant values for nasal vowels after stops... more
Patients with larynx cancer often lose their voice following total laryngectomy. Current methods for postlaryngectomy voice restoration are all unsatisfactory due to different reasons: requires frequent replacement due to biofilm growth... more
They use optical sensors and artificial intelligence methods for process supervision and diagnostics. Research is aimed to develop a system allowing a parametric evaluation of the quality of pulverized coal burner operation. Due to the... more
Extracting reliable 3D facial deformation parameters from static facial postures is a major component of our system for audiovisual synthesis. This paper describes several important improvements to that process, including reduction of... more
When creating realistic talking head animations, accurate modeling of speech articulators is important for speech perceptibility. Previous lip modeling methods such as simple numerical lip modeling focus on creating a general lip model... more
À primeira vista, flauta e saxofone parecem instrumentos muito diferentes: um é metálico com palheta, outro é um tubo aberto soprando diretamente. Mas ambos funcionam com o mesmo princípio físico: a vibração de uma coluna de ar dentro de... more
A articulação e a postura determinam a configuração do trato vocal, definindo a produção da voz. O objetivo foi caracterizar o perfil vocal de professores com e sem queixa vocal; descrever e comparar, por meio de análise objetiva e... more
é considerado o pai do saxofone clássico americano. Foi o primeiro professor universitário de saxofone nos Estados Unidos, lecionando na Universidade de Michigan de 1953 a 1974. Sua obra *The Art of Saxophone Playing*, publicada em 1963,... more
Nos últimos anos, proliferaram acessórios que prometem "melhorar" o som do saxofone por meio do aumento de massa em pontos específicos: tudel, anéis/cintas de fixação, parafusos especiais, campana e até bases de apoio. A alegação... more
Recent techniques of evaluation of vocal tract acoustic transfer functions are based on the external excitation of the tract at the thyroid cartilage level. This paper presents further developments of methods using Gaussian white noise or... more
How are glottal source and vocal tract involved in the sopranos' passagio around E5? This pilot study investigates a legit soprano singing C5 to C6 with a lyrical technique and C5 to D7 in a "light" voice. On (de)crescendi... more
In closely musical environment the consciousness of the influence of the vocal tract on the clarinet sound is consolidated enough; indeed some clarinet players, by means of little alterations of their own vocal tract, are able to obtain... more
In this paper we examine accounts of "exotic" components of the sound systems of Iroquoian, Polynesian and Khoisan languages, and their implications for the history of phonetic studies and linguistics in general. On the basis of examples... more
HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or... more
This work is part of a project aiming at defining possible speech prerequisites in the geometry, the musculature and the control of the vocal tract. In this paper we intend to reconstruct anatomical and geometrical landmarks of the vocal... more
This work is part of a project in a quest of the origin of speech. From classical bony landmarks of the head and jaw used in anthropology, and using a generic model of the vocal tract we attempted to apply the prediction of geometric... more
The objective of this work is twofold. First, a model of the vocal tract is positioned into the bony architecture of the male and female skulls from birth to adulthood. Second, vowel spaces are determined and vowel prototypes, for the... more
range of human speech''. There is, therefore, no reason to believe that the lowering of the larynx and a concomitant increase in pharynx size are necessary evolutionary preadaptations for speech.
There has been a lack of objective data on the singing voice registers, particularly on the so called "whistle" register, occurring in the top part of the female pitch range, which is accessible only to some singers. This study... more
There has been a lack of objective data on the singing voice registers, particularly on the so called “whistle” register, occurring in the top part of the female pitch range, which is accessible only to some singers. This study offers... more
We address the hypothesis that postures adopted during grammatical pauses in speech production are more ''mechanically advantageous'' than absolute rest positions for facilitating efficient postural motor control of vocal tract... more
-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however... more
A real-time MRI examination of retroflex stops and rhotics in Tamil reveals that in some contexts these consonants may in fact be achieved with little or no retroflexion of the tongue tip. Rather, maneuvering and shaping of the tongue in... more
The English past tense allomorph following a coronal stop (e.g., /bɑndəd/) includes a vocoid that has traditionally been transcribed as a schwa or as a barred i. Previous evidence has suggested that this entity does not involve a specific... more
We address the hypothesis that postures adopted during grammatical pauses in speech production are more ''mechanically advantageous'' than absolute rest positions for facilitating efficient postural motor control of vocal tract... more
Speech production can be described in multiple coordinate frames: articulatory configurations, gestural tasks, and acoustic patterns. Examination of the achievement of retroflex stops and liquids in Tamil suggests that we must consider... more
This paper presents an automatic procedure to analyze articulatory setting in speech production using real-time magnetic resonance imaging of the moving human vocal tract. The procedure extracts frames corresponding to inter-speech... more
The original task-dynamic model of speech production incorporated the theoretical tenets of Articulatory Phonology and provided a dynamics of inter-articulator coordination for single and co-produced constriction gestures, given a... more
Speech is produced when time varying vocal tract system is excited with time varying excitation source. Therefore, the information present in a speech such as message, emotion, language, speaker is due to the combined effect of both... more
The paper describes advances in the development of an ultrasound silent speech interface for use in silent communications applications or as a speaking aid for persons who have undergone a laryngectomy. It reports some first steps towards... more
BackgroundThe astonishing variety of sounds that birds can produce has been the subject of many studies aiming to identify the underlying anatomical and physical mechanisms of sound production. An interesting feature of some bird... more
Active control is widely used in industry. However, there have been relatively few applications to musical instruments, particularly wind instruments. The aim of this study is to attempt to control the sound quality and playability of... more
Download research papers for free!