This paper describes the use of multi-way decomposition methods to efficiently summarize electroencephalographic (EEG) data. A space-frequency-time atomic decomposition was applied to EEG data recorded while subjects performed tasks... more
In this study, we propose an unsupervised classification scheme based on the Dempster-Shafer Theory (TDS) and the Dezert-Smarandache Theory (DSmT) to characterize vegetated, aquatic and mineral surfaces. From pre-processed ASTER satellite... more
Speaker verification using limited data is always a challenge for practical implementation as an application. An analysis on speaker verification studies for an i-vector based method using Mel-Frequency Cepstral Coefficient (MFCC) feature... more
The Quetta Syntaxis in western Baluchistan, Pakistan, is the result of an oroclinal bend of the western mountain belt and serves as a junction for different faults. As this area also lies close to the left-lateral strike-slip Chaman... more
In this paper we report on a number of speaker identification experiments that assume a phonetic-oriented segmentation scheme exists such as to motivate the extraction of psychoacoustically-motivated phase and pitch related features. MFCC... more
On 18 January 2017, the 2016–2017 central Italy seismic sequence reached the Campotosto area with four events with magnitude larger than 5 in three hours (major event MW 5.5). To study the slip behavior on the causative fault/faults we... more
BOLLETTINO SISMICO ITALIANO: ANALISYS OF EARLY AFTERSHOCKS OF THE 2016 MW 6.0 AMATRICE, MW 5.9 VISSO AND MW 6.5 NORCIA EARTHQUAKES IN CENTRAL ITALY B. Castello e Gruppo di Lavoro Bollettino Sismico Italiano (A. Nardi, A. Marchetti, F.M.... more
Source and excitation modeling in FDTD formulation has a significant impact on the method performance and the required simulation time. Since the abrupt source introduction yields intensive numerical variations in whole computational... more
In this paper, it is proposed to apply the Dempster-Shafer Theory (DST) or the theory of evidence to map vegetation, aquatic and mineral surfaces with a view to detecting potential areas of observation of outcrops of geological formations... more
This paper examines the association between the variability of the speech signal inside an analysis frame and the relative difficulty of classifying that frame. We introduce a novel measure of speech frame variability and show through... more
I wish to thank my primary supervisor Prof. Michael Wagner for his introducing me to speech as a biometric, and for his support, suggestions and guidance throughout the learning process that has been my doctoral studies. Thank you also to... more
In this paper, it is proposed to apply the Dempster-Shafer Theory (DST) or the theory of evidence to map vegetation, aquatic and mineral surfaces with a view to detecting potential areas of observation of outcrops of geological formations... more
Linked data continues to grow at a rapid rate, but a limitation of a lot of the data that is being published is the lack of a semantic description. There are tools, such as D2R, that allow a user to quickly convert a database into RDF,... more
Automatic voice pathology detection enables objective assessment of pathologies that affect the voice production mechanism. Detection systems have been developed using the traditional pipeline approach (consisting of the feature... more
Speaker Recognition is a multi-disciplinary branch of biometrics that may be used for identification, verification, and classification of individual speakers, with the capability of tracking, detection, and segmentation by extension.... more
In this paper, it is proposed to apply the Dempster-Shafer Theory (DST) or the theory of evidence to map vegetation, aquatic and mineral surfaces with a view to detecting potential areas of observation of outcrops of geological formations... more
In this paper, a novel excitation source-related feature set, viz., Teager Energy-based Mel Frequency Cepstral Coefficients (T-MFCC) is proposed for the task of spoken keyword detection. Experiments are carried out on TIMIT database for... more
In this paper, we elaborate on mobile phone identification from recorded speech signals. The goal is to extract intrinsic traces related to the mobile phone used to record a speech signal. Mel frequency cepstral coefficients (MFCCs) are... more
Motivated by the speaker-specificity and stationarity of subglottal acoustics, this paper investigates the utility of subglottal cepstral coefficients (SGCCs) for speaker identification (SID) and verification (SV). SGCCs can be computed... more
In this paper, it is proposed to apply the Dempster-Shafer Theory (DST) or the theory of evidence to map vegetation, aquatic and mineral surfaces with a view to detecting potential areas of observation of outcrops of geological formations... more
Semantic models of data sources and services provide support to automate many tasks such as source discovery, data integration, and service composition, but writing these semantic descriptions by hand is a tedious and time-consuming task.... more
On 18 January 2017, the 2016–2017 central Italy seismic sequence reached the Campotosto area with four events with magnitude larger than 5 in three hours (major event MW 5.5). To study the slip behavior on the causative fault/faults we... more
We propose a practical, feature-level fusion approach for combining acoustic and articulatory information in speaker verification task. We find that concatenating articulation features obtained from the measured speech production data... more
Speaker identification is a well-established research problem but has not been a major application used in gaming scenarios. In this paper, we propose a new algorithm for the open-set, text-independent, speaker ID problem, applied as an... more
In this paper, a novel excitation source-related feature set, viz., Teager Energy-based Mel Frequency Cepstral Coefficients (T-MFCC) is proposed for the task of spoken keyword detection. Experiments are carried out on TIMIT database for... more
In this paper, it is proposed to apply the Dempster-Shafer Theory (DST) or the theory of evidence to map vegetation, aquatic and mineral surfaces with a view to detecting potential areas of observation of outcrops of geological formations... more
In this paper, it is proposed to apply the Dempster-Shafer Theory (DST) or the theory of evidence to map vegetation, aquatic and mineral surfaces with a view to detecting potential areas of observation of outcrops of geological formations... more
Efficient computation of scalar optical diffraction field due to an object is an essential issue in holographic 3D television systems. The first step in the computation process is to construct an object. As a solution for this step, we... more
This paper describes an approach of speech recognition by using the Mel-Scale Frequency Cepstral Coefficients (MFCC) extracted from speech signal of spoken words. Principal Component Analysis is employed as the supplement in feature... more
Usually, speaker recognition systems do not take into account the short-term dependence between the vocal source and the vocal tract. A feasibility study that retains this dependence is presented here. A model of joint probability... more
Automatic speaker recognition system is used to recognize an unknown speaker among several reference speakers by making use of speaker-specific information from their speech. In this paper, we introduce a novel, hierarchical,... more
Our initial speaker verification study exploring the impact of mismatch in training and test conditions finds that the mismatch in sensor and acoustic environment results in significant performance degradation compared to other mismatches... more
Abstract—The crosslingual voice conversion problem refers to the replacement of a speaker’s timbre or vocal identity in a recorded sentence, assuming that the source speaker and target speaker use different languages. This problem differs... more
This study proposes a new method of fitting a glottal model to the glottal flow estimate using system identification (SI) algorithms. Each period of the glottal estimate is split into open and closed phases and each phase is modelled as... more
The possibility to discriminate between speech and music signals by using a feature based on low frequency modulation has been investigated. Three different low frequency modulation parameters have been extracted and tested concerning the... more
Various algorithms for text-independent speaker recognition have been developed through the decades, aiming to improve both accuracy and efficiency. This paper presents a novel PCA/LDA-based approach that is faster than traditional... more
The cross lingual voice conversion problem refers to the replacement of a speaker's timbre or vocal identity in a recorded sentence, assuming that the source speaker and target speaker use different languages. This problem differs... more
Various algorithms for text-independent speaker recognition have been developed through the decades, aiming to improve both accuracy and efficiency. This paper presents a novel PCA/LDA-based approach that is faster than traditional... more
VAD is a reason for the trouble of discrimination between external noise and voice. VAD is an issue and for that reason various techniques have been suggested. Some are based upon power spectral density derived characteristics, and others... more
We study robust pitch synchronous parameters that are derived from envelope and instantaneous frequencies estimated via a bank of cochlear filters. Closed set Speaker Identification experiments are performed on the SPIDRE corpus with... more
In the source-filter model of speech production, physiologically, the source corresponds to the vocal fold vibrations and the filter corresponds to the spectrum-shaping vocal tract. Vocal tract-based features like the mel-frequency... more
Additional information is available at the end of the chapter
ABSTRACT This paper addresses the problem of pitch modification, as an important module for an efficient voice transformation system. The Deterministic plus Stochastic Model of the residual signal we proposed in a previous work is... more
Abstract Voice source analysis and modelling has played a key role in important speech applications such as speech recognition, speech synthesis and speaker recognition. This work presents a robust algorithm for glottal closure detection... more
Abstract This paper presents a data-driven approach to the modelling of voice source waveforms. The voice source is a signal that is estimated by inverse-filtering speech signals with an estimate of the vocal tract filter. It is used in... more
Recent years have seen an explosion of interest in using neural oscillations to characterize the mechanisms supporting cognition and emotion. Oftentimes, oscillatory activity is indexed by mean power density in predefined frequency bands.... more