Academia.eduAcademia.edu

Speaker Characterization

description8 papers
group11 followers
lightbulbAbout this topic
Speaker characterization is the analytical process of identifying and describing the distinctive traits, attributes, and vocal qualities of a speaker in spoken discourse. This field examines aspects such as tone, pitch, accent, and speech patterns to understand the speaker's identity, intentions, and emotional state within a communicative context.
lightbulbAbout this topic
Speaker characterization is the analytical process of identifying and describing the distinctive traits, attributes, and vocal qualities of a speaker in spoken discourse. This field examines aspects such as tone, pitch, accent, and speech patterns to understand the speaker's identity, intentions, and emotional state within a communicative context.

Key research themes

1. How do voice quality variations influence the perceived personality traits and charisma of a speaker?

This research area investigates how different laryngeal and supralaryngeal voice qualities produced by the same individual affect listeners’ perceptions of that speaker’s personality traits and charisma. It matters because voice quality conveys social and emotional cues crucial for interpersonal communication, speaker profiling, and forensic applications.

Key finding: This study found that voice quality variations, including modal, creaky, breathy (natural and artificial), nasalization, and smiling, produced by the same speakers significantly impacted listener ratings on personality traits... Read more
Key finding: Listeners showed generally low accuracy (~33%, chance level) in judging speakers’ personality traits from speech alone, although some traits like Aggression and Social Potency had slightly higher recognition rates.... Read more
Key finding: This paper identifies fundamental frequency (f0) measures that best correlate with perceived speaker charisma, finding that mean f0 is the most effective pitch-level metric while kurtosis and 80-percentile f0 range optimally... Read more

2. What acoustic and phonetic features capture speaker-specific variability in spontaneous and controlled speech for speaker characterization and recognition?

This research theme focuses on identifying phonetic, acoustic, and articulatory features that characterize speaker individuality across different speech styles (read and spontaneous) and linguistic contexts. This theme is vital for improving speaker recognition systems, forensic voice comparison, and understanding within-speaker variability versus between-speaker differences.

Key finding: The study extended prior findings from read speech to spontaneous speech for the same 99/100 talkers and showed that acoustic voice spaces remain highly similar across speaking styles, with fundamental frequency variability... Read more
Key finding: Using a large corpus of Japanese vowels produced in varied phonetic contexts, this study demonstrated that coarticulation affects lower-formant related sub-bands more strongly, whereas speaker effects dominate higher-formant... Read more
Key finding: This presentation summarized exploratory analysis on the complex interaction between speaker differences and phonetic context using cepstral measures, highlighting the importance of quantifying relative contributions of... Read more
Key finding: This paper investigated the phoneme distributions within Gaussian Mixture Model (GMM) clusters representing speakers, revealing that certain phonetic segments contribute disproportionately to speaker modeling efficacy. The... Read more

3. How can speaker demographic traits such as age, height, and physiognomic factors be automatically estimated from speech using i-vector frameworks and machine learning?

This theme investigates computational methods, especially i-vector representations combined with regression and classification models, to infer speaker profile traits like age and height from speech. These traits offer valuable auxiliary information in forensic cases, user profiling, and personalized human-computer interaction systems. Understanding effectiveness, limitations, and variability factors improves model design and forensic applicability.

Key finding: The thesis developed novel approaches for estimating speaker age, height, weight, and smoking habits from spontaneous telephone speech using i-vector and Non-negative Factor Analysis (NFA) frameworks combined with Artificial... Read more
Key finding: The study proposed an age estimation method leveraging i-vectors and Within-Class Covariance Normalization, followed by Least Squares Support Vector Regression, achieving lower mean absolute error and higher correlation with... Read more
Key finding: This paper presented an automatic speaker height estimation approach using i-vectors and regression models (ANN and LSSVR), yielding effective height predictions on the NIST 2008 and 2010 SRE corpora. This contributes to the... Read more

All papers in Speaker Characterization

In this paper, a new approach for age estimation from speech signals based on i-vectors is proposed. In this method, each utterance is modeled by its corresponding i-vector. Then, a Within-Class Covariance Normalization technique is... more
This paper proposes a novel approach for automatic speaker height estimation based on the i-vector framework. In this method, each utterance is modeled by its corresponding ivector. Then artificial neural networks (ANNs) and least-squares... more
The longstanding aim of achieving robust forensic voice identification is hampered by a number of complex and intertwined factors of variability in the speech signal such as: (1) speaker differences; (2) co-articulation effects; (3)... more
Download research papers for free!