Speaker Verification

description2,074 papers

group6,169 followers

lightbulbAbout this topic

Speaker verification is a biometric authentication process that uses voice characteristics to confirm an individual's identity. It involves analyzing vocal attributes, such as pitch, tone, and speech patterns, to determine if the speaker matches a pre-registered voice model, ensuring secure access to systems or information.

lightbulbAbout this topic

Key research themes

1. How can speaker verification systems be robustly defended against diverse spoofing attacks including voice conversion, speech synthesis, and replay?

This research area focuses on understanding the vulnerabilities of automatic speaker verification (ASV) systems to a broad range of spoofing attacks, such as voice conversion, speech synthesis, and replay attacks, which pose severe security threats. It also investigates the design and evaluation of anti-spoofing countermeasures, including databases, protocols, and methodologies to detect and mitigate both known and unknown spoofing types, particularly in the context of text-independent ASV systems. The work is significant because spoofing can undermine the reliability of ASV systems deployed in real-world applications such as call centers, banking, and forensic investigations.

Anti-Spoofing for Text Independent Speaker Verification

by International Journal of Scientific Research in Science, Engineering and Technology IJSRSET

2017

Key finding: This study introduces the first comprehensive spoofing and anti-spoofing (SAS) database comprising nine diverse spoofing techniques (including multiple speech synthesis and voice conversion systems) for text-independent... Read more

articleView Paper downloadDownload

ASVspoof: The Automatic Speaker Verification Spoofing and Countermeasures Challenge

by Tomi Kinnunen

2022, IEEE Journal of Selected Topics in Signal Processing

Key finding: Describes the community-driven ASVspoof initiative that addresses the lack of common datasets and standardized protocols by providing the ASVspoof 2015 dataset and organizing competitive evaluations, demonstrating the... Read more

articleView Paper downloadDownload

Spoofing and countermeasures for automatic speaker verification

by Tomi Kinnunen

2021

Key finding: Provides a detailed survey of vulnerabilities unique to text-independent ASV systems, emphasizing how prior countermeasures often rely on known spoofing attacks and lack generalizability. It highlights the need for standard... Read more

articleView Paper downloadDownload

Joint Speaker Verification and Antispoofing in the <inline-formula> <tex-math notation="LaTeX">$i$ </tex-math></inline-formula>-Vector Space

by Tomi Kinnunen

2016, IEEE Transactions on Information Forensics and Security

Key finding: Presents a novel joint modeling approach in the i-vector subspace that simultaneously addresses speaker verification and voice conversion spoofing attack detection without relying on tailored discriminative features. By... Read more

articleView Paper downloadDownload

Voice Spoofing Countermeasures: Taxonomy, State-of-the-art, experimental analysis of generalizability, open challenges, and the way forward

by Awais A. Khan

2024, arXiv (Cornell University)

Key finding: Provides an extensive taxonomy and comprehensive experimental comparison of spoofing countermeasures across diverse feature extraction and classification paradigms, examining their generalizability on ASVspoof2019 and VSDC... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

2. What techniques improve speaker verification performance and robustness under practical conditions such as limited data, language mismatch, recording channel variability, and multi-speaker environments?

This research theme focuses on enhancing speaker verification accuracy and reliability in realistic and challenging conditions. It includes methods dealing with limited-duration speech segments, channel distortions (e.g., GSM transcoded speech), multilingual and cross-lingual mismatches, and speaker overlap situations. The research addresses acoustic feature design, fusion of complementary feature sets, model adaptation, and joint optimization strategies to maintain verification performance in heterogeneous real-world scenarios.

i-Vector-Based Speaker Verification on Limited Data Using Fusion Techniques

by jayanthi kumari

2023, Journal of Intelligent Systems

Key finding: Demonstrates that combining vocal tract features (MFCC, LPCC) with excitation source features (LPR, LPRP) using feature- and score-level fusion significantly reduces equal error rate (EER) in i-vector based speaker... Read more

articleView Paper downloadDownload

The impact of mismatched recordings on an automatic-speaker-recognition system and human listeners

by Radek Skarnitzl

2024, Acta Universitatis Carolinae. Philologica

Key finding: Empirically shows that both automatic speaker recognition systems based on i-vectors/x-vectors and human listeners experience performance degradation when comparing recordings that differ in language and recording time. The... Read more

articleView Paper downloadDownload

Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation

by Man-wai Mak

2024, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

Key finding: Proposes a novel data-dependent score fusion algorithm that computes adaptive weights for fusing multiple utterance scores in GSM-transcoded speech speaker verification, using prior knowledge from enrollment scores. This... Read more

articleView Paper downloadDownload

Speaker Verification Based on Single Channel Speech Separation

by Mijit Ablimit

2025, IEEE Access

Key finding: Introduces an integrated approach combining feature-scale single-channel speech separation with back-end speaker verification, using neural network-based separation models and MFCC-T features. The proposed method trains both... Read more

articleView Paper downloadDownload

ScienceDirect Comparison of Text Independent Speaker Identification Systems using GMM and i-Vector Methods

by ab kh

2019

Key finding: Finds that i-vector-based speaker identification systems outperform Gaussian mixture model (GMM) methods, especially when combined with PLDA classifiers and features like PNCC and RASTA-PLP, and that augmenting features with... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

3. How can speaker verification fairness across demographic and language groups be improved without requiring subgroup labels or creating reliance on balanced data samples?

This research area addresses performance disparities in speaker verification systems arising from imbalanced representation of demographic groups such as gender and nationality, or language variability. The focus is on algorithmic fairness approaches that automatically identify underperforming groups without explicit annotations, using adversarial learning, group-adapted embeddings, fusion networks, and reweighting schemes. This direction is crucial for equitable deployment of speaker verification in diverse real-world populations and for mitigating biases inherent in training data.

Adversarial Reweighting for Speaker Verification Fairness

by Andreas Stolcke

2024, arXiv (Cornell University)

Key finding: Reformulates adversarial reweighting (ARW) for speaker verification with metric learning, enabling the adversarial network to assign higher weights to poorly performing instances without subgroup annotations. Demonstrates... Read more

articleView Paper downloadDownload

Improving Fairness in Speaker Verification via Group-Adapted Fusion Network

by Andreas Stolcke

2024, ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Key finding: Proposes a modular network architecture combining group-specific embedding adaptation and score fusion to mitigate model unfairness caused by imbalanced gender representation during training. Experiments show that this... Read more

articleView Paper downloadDownload

Enhancing speaker verification accuracy with deep ensemble learning and inclusion of multifaceted demographic factors

by International Journal of Electrical and Computer Engineering (IJECE) and

2023, International Journal of Electrical and Computer Engineering (IJECE)

Key finding: Develops an ensemble-based deep learning framework integrating gender and ethnicity classifiers with a Siamese verification network, and demonstrates improved equal error rates and decision cost functions on the large-scale... Read more

articleView Paper downloadDownload

keyboard_arrow_downShow more

All papers in Speaker Verification

Using Reversed MFCC and IT-EM for Automatic Speaker Verification

by Tariq Jamil Saifullah Khanzada

2025, Mehran University Research Journal of Engineering and Technology

This paper proposes text independent automatic speaker verification system using IMFCC (Inverse/ Reverse Mel Frequency Coefficients) and IT-EM (Information Theoretic Expectation Maximization). To perform speaker verification, feature... more

descriptionView Paper arrow_downwardDownload

Study of Speaker’s Emotion Identification for Hindi Speech

by Sushma Bahuguna

2025

Emotion based speaker Identification System is the process of automatically identifying speaker’s emotion based on features extracted from speech waves. This paper presents experiment with the building and testing of a Speaker’s emotion... more

descriptionView Paper arrow_downwardDownload

Limited-data automatic speaker verification algorithm using band-limitedphase-only correlation function

by Angel David Pedroza Ramírez

2025, TURKISH JOURNAL OF ELECTRICAL ENGINEERING & COMPUTER SCIENCES

In this paper, a new method to deal with automatic speaker verification based on band-limited phaseonly correlation (BLPOC) is proposed. The aim of this study is to validate the use of the BLPOC function as a new limited-data automatic... more

descriptionView Paper arrow_downwardDownload

Saudi Accented Arabic Voice Bank

by Mansour Alghamdi

2025, Journal of King Saud University - Computer and Information Sciences

The aim of this paper is to present an Arabic speech database that represents Arabic native speakers from all the cities of Saudi Arabia. The database is called the Saudi Accented Arabic Voice Bank (SAAVB). Preparing the prompt sheets,... more

descriptionView Paper arrow_downwardDownload

Noise robust LVCSR feature extraction based on stabilized weighted linear prediction

by Kalle Palomäki

2025

In this paper, we evaluate a recently proposed spectral envelope estimation method, stabilized weighted linear prediction (SWLP), in the feature extraction stage of a large vocabulary continuous speech recognizer (LVCSR) system. Using... more

descriptionView Paper arrow_downwardDownload

Support Vector Machine Based Approaches For Real Time Automatic Speaker Recognition System

by Satyanand Singh

2025

It is known that the Percentage of Identification Accuracy (PIA) of Automatic Speaker Recognition (ASR) systems is increasingly vulnerable, such as noise and channel degradation in real-time. This study presents a novel class SVM and... more

descriptionView Paper arrow_downwardDownload

Combining face and voice modalities for person verification from video sequences

by Aytul Ercil

2025, Proceedings of the IEEE 12th Signal Processing and Communications Applications Conference, 2004.

In this paper, a multimodal person verification system is presented. The system is based on face and voice modalities. Fusion of information derived from each modality is performed at the matching score level using sum rule. For face... more

descriptionView Paper arrow_downwardDownload

Face Recognition using Simplified Probabilistic Linear Discriminant Analysis

by Jerneja Gros

2025, International Journal of Advanced Robotic Systems

remains an open problem that has not been satisfactorily solved by existing recognition techniques. In this paper, we tackle this problem using a variant of the recently proposed Probabilistic Linear Discriminant Analysis (PLDA). We show... more

descriptionView Paper arrow_downwardDownload

The effect of removing semantic information upon the impact of a voice imitation

by Jan Van Doorn

2025, Proceedings of SST2002 (Melbourne, Australia) pp

ABSTRACT: Previous research has shown both that listeners' ability to detect high quality voice imitation results in judicially worrying misidentification rates (Schlichting & Sullivan, 1997) and that the semantic expectation of the... more

descriptionView Paper arrow_downwardDownload

Comparative Study on Spoken Language Identification Based on Deep Learning

by Panikos Heracleous

2025, 2018 26th European Signal Processing Conference (EUSIPCO)

Spoken language identification is the process by which the language in a spoken utterance is recognized automatically. Spoken language identification is commonly used in speech translation systems, in multi-lingual speech recognition, and... more

descriptionView Paper arrow_downwardDownload

Analysing Forensic Speaker Verification by Utilizing Artificial Neural Network

by Susanto Susanto

2025, Advances in Social Science, Education and Humanities Research

In this paper, we describe the use of Artificial Neural Network (ANN) to compute the acoustic features in analysing forensic speaker verification. In the computation, there are two datasets derived from speech recording of a simulated... more

descriptionView Paper arrow_downwardDownload

MFCC Based Text-Dependent Speaker Identification Using BPNN

by Dr SUVARNA NANDYAL

2025, International Journal of Signal Processing Systems

Speech processing has emerged as one of the important application area of digital signal processing. Various fields for research in speech processing are speech recognition, speaker recognition, speech synthesis, speech coding etc.... more

descriptionView Paper arrow_downwardDownload

Factors Affecting the Performance of Automated Speaker Verification in Alzheimer’s Disease Clinical Trials

by Jekaterina Novikova

2025

Detecting duplicate patient participation in clinical trials is a major challenge because repeated patients can undermine the credibility and accuracy of the trial's findings and result in significant health and financial risks.... more

descriptionView Paper arrow_downwardDownload

Large Margin Learning of Bayesian Classifiers Based on Gaussian Mixture Models

by Franz Pernkopf

2025, Springer eBooks

We present a discriminative learning framework for Gaussian mixture models (GMMs) used for classification based on the extended Baum-Welch (EBW) algorithm . We suggest two criteria for discriminative optimization, namely the class... more

descriptionView Paper arrow_downwardDownload

Voices Obscured in Complex Environmental Settings (VOiCES) Corpus

by Maria Barrios

2025, Interspeech 2018

This paper introduces the Voices Obscured In Complex Environmental Settings (VOICES) corpus, a freely available dataset under Creative Commons BY 4.0. This dataset will promote speech and signal processing research of speech recorded by... more

descriptionView Paper arrow_downwardDownload

Detecting audio-visual synchrony using deep neural networks

by Josef Vopicka

2025

In this paper, we address the problem of automatically detecting whether the audio and visual speech modalities in frontal pose videos are synchronous or not. This is of interest in a wide range of applications, for example spoof... more

descriptionView Paper arrow_downwardDownload

Detecting audio-visual synchrony using deep neural networks

by Josef Vopicka

2025, Interspeech 2015

descriptionView Paper arrow_downwardDownload

Increasing the Robustness of i-vectors with Model Compensated First Order Statistics

by Zekeriya Tufekci

2025, Afyon Kocatepe University Journal of Sciences and Engineering

Speaker recognition systems achieved significant improvements over the last decade, especially due to the performance of the i-vectors. Despite the achievements, mismatch between training and test data affects the recognition performance... more

descriptionView Paper arrow_downwardDownload

A Review on Feature Extraction for Speaker Recognition under Degraded Conditions

by Zekeriya Tufekci

2025, IETE Technical Review

Speaker Recognition systems exhibit a decrease in performance when the input speech is not in optimal circumstances, for example when the user is under emotional or stress conditions. The objective of this paper is measuring the effects... more

descriptionView Paper arrow_downwardDownload

Acoustic Feature Learning for Robust Speaker Verification Under Mismatched Noise and Recording Conditions

by QIT Press

2025, Quality Institute of Technology Press (QIT Press)

Speaker verification is crucial in biometric security systems, but performance degradation occurs under mismatched noise and recording conditions. This paper explores acoustic feature learning techniques to enhance robustness in speaker... more

descriptionView Paper arrow_downwardDownload

Preliminary intelligibility tests of a monaural speech segregation system

by Pierre Divenyi

2025

Human listeners are able to understand speech in the presence of a noisy background. How to simulate this perceptual ability remains a great challenge. This paper describes a preliminary evaluation of intelligibility of the output of a... more

descriptionView Paper arrow_downwardDownload

Influence of the attack conditions on countermeasures for Automatic Speaker Verification

by Raphaël GREFF

2025

The ASVSpoof challenges goal is to evaluate countermeasures to spoof attacks on automatic speaker verification systems. We first analyze in more details the results of the baseline systems provided by the organization and unveil several... more

descriptionView Paper arrow_downwardDownload

BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020

by Alicia Lozano-Diez

2025

In this paper, we present the winning BUT submission for the text-dependent task of the SdSV challenge 2020. Given the large amount of training data available in this challenge, we explore successful techniques from text-independent... more

descriptionView Paper arrow_downwardDownload

BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020

by Alicia Lozano-Diez

2025, Interspeech 2020

descriptionView Paper arrow_downwardDownload

Using Triplet Loss for Bird Species Recognition on BirdCLEF 2020

by Juan Colonna

2025

This paper presents the approach used in the BirdCLEF 2020 Competition. The objective of the competition is to try to recognize bird species through its sings and calls among 960 species in soundscapes. We use a MultiScale CNN + Triplet... more

descriptionView Paper arrow_downwardDownload

I-MSV 2022: Indic-Multilingual and Multi-sensor Speaker Verification Challenge

by Jagabandhu Mishra

2025, arXiv (Cornell University)

Speaker Verification (SV) is a task to verify the claimed identity of the claimant using his/her voice sample. Though there exists an ample amount of research in SV technologies, the development concerning a multilingual conversation is... more

descriptionView Paper arrow_downwardDownload

Optimizing a-DCF for Spoofing-Robust Speaker Verification

by Jagabandhu Mishra

2025, arXiv (Cornell University)

Automatic speaker verification (ASV) systems are vulnerable to spoofing attacks. We propose a spoofing-robust ASV system optimized directly for the recently introduced architecture-agnostic detection cost function (a-DCF), which allows... more

descriptionView Paper arrow_downwardDownload

Divergence-based out-of-class rejection for telephone handset identification

by Sun-yuan Kung

2025, 7th International Conference on Spoken Language Processing (ICSLP 2002)

Research has shown that handset selectors can be used to assist telephone-based speech/speaker recognition. Most handset selectors, however, simply select the most likely handset from a set of known handsets even for speech coming from an... more

descriptionView Paper arrow_downwardDownload

Adaptive decision fusion for multi-sample speaker verification over GSM networks

by Sun-yuan Kung

2025, 8th European Conference on Speech Communication and Technology (Eurospeech 2003)

In speaker verification, a claimant may produce two or more utterances. In our previous study , we proposed to compute the optimal weights for fusing the scores of these utterances based on their score distribution and our prior knowledge... more

descriptionView Paper arrow_downwardDownload

Channel robust speaker verification via Bayesian blind stochastic feature transformation

by Sun-yuan Kung

2025, Interspeech 2005

In telephone-based speaker verification, the channel conditions can be varied significantly from sessions to sessions. Therefore, it is desirable to estimate the channel conditions online and compensate the acoustic distortion without... more

descriptionView Paper arrow_downwardDownload

Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation

by Sun-yuan Kung

2025, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03).

In speaker verification, a claimant may produce two or more utterances. Typically, the scores of the speech patterns extracted from these utterances are averaged and the resulting mean score is compared with a decision threshold. Rather... more

descriptionView Paper arrow_downwardDownload

Probabilistic Fusion of Sorted Score Sequences for Robust Speaker Verification

by Sun-yuan Kung

2025, Studies in Fuzziness and Soft Computing

Fusion techniques have been widely used in multi-modal biometric authentication systems. While these techniques are mainly applied to combine the outputs of modality-dependent classifiers, they can also be applied to fuse the decisions or... more

descriptionView Paper arrow_downwardDownload

Multi-sample data-dependent fusion of sorted score sequences for biometric verification

by Sun-yuan Kung

2025, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing

In many biometric systems, the scores of multiple samples (e.g. utterances) are averaged and the average score is compared against a decision threshold for decision making. The average score, however, may not be optimal because the... more

descriptionView Paper arrow_downwardDownload

Adaptive conditional pronunciation modeling using articulatory features for speaker verification

by Sun-yuan Kung

2025, SympoTIC '04. Joint 1st Workshop on Mobile Future & Symposium on Trends In Communications (IEEE Cat. No.04EX877)

This paper proposes an articulatory feature-based conditional pronunciation modeling (AFCPM) technique for speaker verification. The technique models the pronunciation behaviors of speakers by creating a link between the actual phones... more

descriptionView Paper arrow_downwardDownload

Blind stochastic feature transformation for speaker verification over cellular networks

by Sun-yuan Kung

2025, Proceedings of 2004 International Symposium on Intelligent Multimedia, Video and Speech Processing, 2004.

Acoustic mismatch between the training and recognition conditions presents one of the serious challenges faced by speaker recognition researchers today. The goal of channel compensation is to achieve performance approaching that of a... more

descriptionView Paper arrow_downwardDownload

Speaker verification using adapted articulatory feature-based conditional pronunciation modeling

by Sun-yuan Kung

2025, ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

This paper proposes a speaker verification system based on articulatory feature-based conditional pronunciation modeling (AFCPM). The system captures the pronunciation characteristics of speakers by modeling the linkage between the actual... more

descriptionView Paper arrow_downwardDownload

Applying articulatory features to telephone-based speaker verification

by Sun-yuan Kung

2025, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper presents an approach that uses articulatory features (AFs) derived from spectral features for telephone-based speaker verification. To minimize the acoustic mismatch caused by different handsets, handset-specific normalization... more

descriptionView Paper arrow_downwardDownload

Adaptive articulatory feature-based conditional pronunciation modeling for speaker verification

by Sun-yuan Kung

2025, Speech Communication

Because of the differences in education background, accents, etc., different persons have their unique way of pronunciation. This paper exploits the pronunciation characteristics of speakers and proposes a new conditional pronunciation... more

descriptionView Paper arrow_downwardDownload

Blind Stochastic Feature Transformation for Channel Robust Speaker Verification

by Sun-yuan Kung

2025, Journal of VLSI signal processing systems for signal, image and video technology

To improve the reliability of telephone-based speaker verification systems, channel compensation is indispensable. However, it is also important to ensure that the channel compensation algorithms in these systems surpress channel... more

descriptionView Paper arrow_downwardDownload

Extraction of Speaker Features from Different Stages of DSR Front-Ends for Distributed Speaker Verification

by Sun-yuan Kung

2025, Genetic Resources and Crop Evolution

The ETSI has recently published a front-end processing standard for distributed speech recognition systems. The key idea of the standard is to extract the spectral features of speech signals at the front-end terminals so that acoustic... more

descriptionView Paper arrow_downwardDownload

A Hybrid Approach to Enhance the Security of Automated Teller Machine

by Sabarna Choudhury

2025

With the rise of crimes in Automated Teller Machines, the security of the ATM is at stake. The Traditional Security Methods such as passwords or pins had always been a cause of worry to the users because of it getting lost, stolen or... more

descriptionView Paper arrow_downwardDownload

Speaker Verification Using Support Vector Machines and High-Level Features

by Douglas Reynolds

2025, IEEE Transactions on Audio, Speech and Language Processing

High-level characteristics such as word usage, pronunciation, phonotactics, prosody, etc., have seen a resurgence for automatic speaker recognition over the last several years. With the availability of many conversation sides per speaker... more

descriptionView Paper arrow_downwardDownload

Prediction of Waveforms Under the Variation of Input Parameters Using Neural Networks

by Alex Prodan

2025

This paper introduces a modeling flow for predicting waveforms as a function of parameters, variables in the system generating the waveforms. In order to achieve this goal, a neural network is involved. The model is developed using... more

descriptionView Paper arrow_downwardDownload

Authorship Verification based on Linguistic Features

by Charitha Dissanayake

2025

This thesis attempts to solve the problem of authorship verification. Authorship verification is a subdomain of authorship analysis and its origins lie in stylometry analysis. However most of the research in authorship analysis is based on authorship identification where authorship verification is rather unexplored. With the increase of digital documents and authors it is very difficult to employ authorship identification solutions. Hence in such cases authorship verification solutions are in necessity. This research focuses on utilizing digital documents with 1000 words, written in English to solve the problem of authorship verification: coming into conclusion about the authorship of a text in dispute by analyzing texts written by some candidate author. To solve this problem three machine learning models were designed employing two feature sets, comprising of linguistic features which are suggested to characterize the writing style of a person, one comprising of stylometric features and other consisting of word frequency based features. One-class support vector machine and two-class support vector machine are used as machine learning models to tackle this problem. Results suggest one-class support vector machine with selected stylometric features does not tackle the problem very well while two-class classification model with stylometric features trained for known author class and unknown author class shows potential in solving the problem if the unknown author class can be properly represented. One-class support vector machine with word frequency based features, shows promising results in solving the authorship verification problem. By conducting this research, I have developed an immense interest in stylometry analysis and natural language processing and gained my first experiences in research world. Hence I would like to thank my supervisor, Dr. A. R. Weerasinghe in introducing me to the project and advising me in any way needed and trusting my capabilities. I would also like to thank Dr. H. Ekanayake for effectively coordinating the final year project in computer science and helping students in case of problematic situations. I am immensely grateful for my batchmates who also helped me in many ways and giving suggestions to improve the project. I would also like to be thankful towards my family who gave me great support and courage to carry out the research studies and providing with a suitable academic environment. Lastly my appreciation goes to everyone who helped during this attempt. Document -A digital file containing text entirely written by one person. Known document -A document, with prior knowledge of the person who has written it. Unknown document -A document, with no knowledge of the person who has written it. Known author -The author who is suspected to have written the unknown document. Unknown author -The author of the unknown document, if the unknown document is written by a different person than the known author. Target class -The class that contain all documents with known authorship from a suspected author. Outlier class -The class which contains all documents from other authors than the suspected author.

descriptionView Paper arrow_downwardDownload

Multitaper Estimation of Frequency-Warped Cepstra With Application to Speaker Verification

by Patrick Flandrin

2025, IEEE Signal Processing Letters

Usually the mel-frequency cepstral coefficients are estimated either from a periodogram or from a windowed periodogram. We state a general estimator which also includes multitaper estimators. We propose approximations of the variance and... more

descriptionView Paper arrow_downwardDownload

Presenting a New Text-Independent Speaker Verification System Based on Multi Model GMM

by mohammad mosleh

2025, Journal of Advances in Computer Research

Speaker verification is the process of accepting or rejecting claimed identity in terms of its sound features. A speaker verification system can be used for numerous security systems, including bank account accessing, getting to security... more

descriptionView Paper arrow_downwardDownload

Confidence measures for speaker segmentation and their relation to speaker verification

by antonio Miguel

2025, Interspeech 2010

This paper addresses the problem of speaker verification in two speaker conversations, proposing a set of confidence measures to assess the quality of a given speaker segmentation. In addition we study how these measures can be used to... more

descriptionView Paper arrow_downwardDownload

A Deep Learning Fusion Model Leveraging Spectral Features for Audio Deepfake Detection

by International Journal of Advanced Networking and Applications (IJANA)

2025, IJANA

Audio deepfakes, a subset of deepfake technology, employ machine learning or deep learning to create deceptive audio content by synthesizing authentic recordings. Such deepfakes not only fosters the dissemination of misinformation but... more

descriptionView Paper arrow_downwardDownload

Dealing with additive noise in speaker recognition systems based on i-vector approach

by D. Matrouf

2025, 2015 23rd European Signal Processing Conference (EUSIPCO)

In the last years, the i-vector approach became the state-of-theart in speaker recognition systems. As in previous approaches, i-vector -based systems suffer greatly in presence of additive noise, especially in low SNR cases. In this... more

descriptionView Paper arrow_downwardDownload

V'erification du locuteur : variations de performance (Speaker verification : results variation) [in French]

by Solange Rossato

2025, Proceedings of the Joint Conference JEP-TALN-RECITAL 2012, volume 1: JEP

Les progrès de performance en vérification du locuteur ces quinze dernières années sont incontestables. Les systèmes sont de plus en plus sûrs dans le sens où les taux EER ou DCF diminuent d'année en année. Pourtant, il est nécessaire de... more

descriptionView Paper arrow_downwardDownload

Speaker Verification

Key research themes

1. How can speaker verification systems be robustly defended against diverse spoofing attacks including voice conversion, speech synthesis, and replay?

2. What techniques improve speaker verification performance and robustness under practical conditions such as limited data, language mismatch, recording channel variability, and multi-speaker environments?

3. How can speaker verification fairness across demographic and language groups be improved without requiring subgroup labels or creating reliance on balanced data samples?

Related Topics

All papers in Speaker Verification