Music Emotion Classification Research Papers

Does Always the Phrygian Mode Elicit Responses of Negative Valence?

2025

In this paper the question of whether the Phrygian mode is always associated with perceived emotional responses of negative valence is looked into. To this end, we carried out a series of experiments. Music from two musical traditions... more

descriptionView Paper arrow_downwardDownload

Semantic structures of timbre emerging from social and acoustic descriptions of music

by Tuomas Eerola

2025, EURASIP Journal on Audio, Speech, and Music Processing

The perceptual attributes of timbre have inspired a considerable amount of multidisciplinary research, but because of the complexity of the phenomena, the approach has traditionally been confined to laboratory conditions, much to the... more

descriptionView Paper arrow_downwardDownload

Predicting Emotional Prosody of Music with High-Level Acoustic Features

by Tuomas Eerola

2025, nics.unicamp.br

BACKGROUND The automatic prediction of emotional content in music is nowadays a growing area of interest. Several algorithms have been developed to retrieve music features and computational models using these features are continuously... more

descriptionView Paper arrow_downwardDownload

Automatic music emotion classification using artificial neural network based on vocal and instrumental sound timbres

by Nurlaila Binti Rosli

2025, New Trends in Software Methodologies, Tools and Techniques

Detecting emotion features in a song remains as a challenge in various area of research especially in Music Emotion Classification (MEC). In order to classify selected song with certain mood or emotion, the algorithms of the machine... more

descriptionView Paper arrow_downwardDownload

A facial repertoire for avatars

by Zsófia Ruttkay

2025, Proceedings of the …

Facial expressions are becoming more and more important in today's computer systems with humanoid user interfaces. Avatars have become popular, however their facial communication is usually limited. This is partly due to the fact that... more

descriptionView Paper arrow_downwardDownload

Does Always the Phrygian Mode Elicit Responses of Negative Valence?

by Manuel Tizón Díaz

2025

In this paper the question of whether the Phrygian mode is always associated with perceived emotional responses of negative valence is looked into. To this end, we carried out a series of experiments. Music from two musical traditions... more

descriptionView Paper arrow_downwardDownload

Dimensional emotion driven facial expression synthesis based on the multi-stream DBN model

by Hichem Sahli

2024, Proceedings of the 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference

This paper proposes a dynamic Bayesian network (DBN) based MPEG-4 compliant 3D facial animation synthesis method driven by the (Evaluation, Activation) values in the continuous emotion space. For each emotion, a state synchronous DBN... more

descriptionView Paper arrow_downwardDownload

Predicting Emotional Prosody of Music with High-Level Acoustic Features

by José Fornari

2024, nics.unicamp.br

BACKGROUND The automatic prediction of emotional content in music is nowadays a growing area of interest. Several algorithms have been developed to retrieve music features and computational models using these features are continuously... more

descriptionView Paper arrow_downwardDownload

Emotional state recognition in speech signal

by Dawid Krawczyk

2024, Advances in Science, Technology and Engineering Systems Journal

The matters regarding speech signal processing and analyzing in terms of emotional states recognition were presented in this paper. An experiment was conducted to perform both objective and subjective emotional states recognition tests... more

descriptionView Paper arrow_downwardDownload

Application of Deep Learning Using Convolutional Neural Network (CNN) Method For Women’s Skin Classification

by Nilam cahya

2024, Scientific Journal of Informatics

Facial skin is skin that protects the inside of the face such as the eyes, nose, mouth, and others. Facial skin consists of several types, including normal skin, oily skin, dry skin, and combination skin. This is a problem for women... more

descriptionView Paper arrow_downwardDownload

Emotion Recognition From Different Types of Music From Different Cultures

by Zekeriya Tüfekci

2024, Çukurova Üniversitesi Mühendislik-Mimarlık Fakültesi Dergisi

Bu çalışmada, klasik makine öğrenme yöntemleri farklı kültürlere ait farklı türdeki müziklerden oluşmuş veri tabanları üzerinde duygu tanıması yapmak için kullanılmışlardır. Bu veri tabanlarında bulunan müziklerden öznitelik çıkarmak için... more

descriptionView Paper arrow_downwardDownload

Novelty and Cultural Evolution in Modern Popular Music

by Katherine O'Toole

2024, arXiv (Cornell University)

The ubiquity of digital music consumption has made it possible to extract information about modern music that allows us to perform large scale analysis of stylistic change over time. In order to uncover underlying patterns in cultural... more

descriptionView Paper arrow_downwardDownload

Speech Emotion Recognition of Sanskrit Language using Machine Learning

by Sujay Kakodkar

2024, International Journal of Computer Applications

A modern development in technology is Speech Emotion Recognition (SER). SER in partnership with Humane-Machine interaction (HMI) has advanced machine intelligence. An emotion precise HMI is designed by integrating speech processing and... more

descriptionView Paper arrow_downwardDownload

Subjective Emotional Responses to Musical Structure, Expression and Timbre Features: A Synthetic Approach

by Sylvain Le Groux

2024

Music appears to deeply affect emotional, cerebral and physiological states, and its effect on stress and anxiety has been established using a variety of self-report, physiological, and observational means. Yet, the relationship between... more

descriptionView Paper arrow_downwardDownload

Detection of Happiness Emotion on Images

by Beyza Akca

2024, Academic Perspective Procedia

Günümüzde bilgisayar kullanımı yaygınlaştıkça insan-bilgisayar etkileşimi üzerine yenilikçi çalışmalar hız kazanmıştır. Bu yeniliklerden biri, insanların duygusal durumlarının bilgisayarlı sistemler tarafından belirlenmesidir. Bu... more

descriptionView Paper arrow_downwardDownload

Automated Extraction of Features from Arabic Emotional Speech Corpus

by mohamed meddeb

2024

This paper presents the principal phase of extraction and recognition of the basic emotions in the Arabic speech applied to five emotional states were taken into effect; neutral, sadness, fear, anger and happiness. Emotional speech... more

descriptionView Paper arrow_downwardDownload

Quantitative comparison of motion history image variants for video-based depression assessment

by Awais Awais

2024, EURASIP Journal on Image and Video Processing

Depression is the most prevalent mood disorder and a leading cause of disability worldwide. Automated video-based analyses may afford objective measures to support clinical judgments. In the present paper, categorical depression... more

descriptionView Paper arrow_downwardDownload

On Constrained Local Model Feature Normalization for Facial Expression Recognition

by Christine Lisetti

2024, Lecture Notes in Computer Science

Real time user independent facial expression recognition is important for virtual agents but challenging. However, since in real time recognition users are not necessarily presenting all the emotions, some proposed methods are not... more

descriptionView Paper arrow_downwardDownload

Impression Determination of Batik Image Cloth by Multilabel Ensemble Classification Using Color Difference Histogram Feature Extraction

by Anny Yuniarti

2024, Jurnal Ilmiah Kursor

Hampir setiap orang akan memperhatikan impresi busana yang dipakai, termasuk busana dengan motif batik. Namun, perpaduan berbagai motif dan warna batik memberikan impresi yang beragam. Sehingga, penentuan impresi dari satu kain batik... more

descriptionView Paper arrow_downwardDownload

A Classifier Model based on the Features Quantitative Analysis for Facial Expression Recognition

by Md Jan Nordin

2024, International Journal on Advanced Science, Engineering and Information Technology

In recent decades computer technology has considerable developed in use of intelligent systems for classification. The development of HCI systems is highly depended on accurate understanding of emotions. However, facial expressions are... more

descriptionView Paper arrow_downwardDownload

Regularization and kernel parameters optimization based on PSO algorithm in EEG signals classification with SVM

by ismail gökhan gürsoy

2023, 2011 IEEE 19th Signal Processing and Communications Applications Conference (SIU)

Beyin fonksiyonları ile ilgili olarak EEG işaretleri birçok bilgi içermektedir. EEG işaretlerinin dalga biçimleri diğer beyin işaretleri ile benzerlik göstermektedir. Bu çalışmada sunulan yöntemde, önce EEG işaretlerine öz bağlanımlı... more

descriptionView Paper arrow_downwardDownload

Nöromüsküler hastalıkların yapay zeka yöntemleri ile sınıflandırılması

by Hanife Küçük

2023, Journal of The Faculty of Engineering and Architecture of Gazi University

• MUAP (Motor Unit Action Potential) clustering with hybrid structure • Use of multiple attribute vectors • Classification of neuromuscular diseases by artificial intelligence methods In this study, a classification structure consisting... more

descriptionView Paper arrow_downwardDownload

Music emotion classification for Turkish songs using lyrics

by Hanife Kebapci

2023, Pamukkale University Journal of Engineering Sciences

Music has grown into an important part of people's daily lives. As we move further into the digital age in which a large collection of music is being created daily and becomes easily accessible renders people to spend more time on... more

Music has grown into an important part of people's daily lives. As we move further into the digital age in which a large collection of music is being created daily and becomes easily accessible renders people to spend more time on activities that involve music. Consequently, the form of music retrieval is changed from catalogue based searches to searches made based on emotion tags in order for easy and effective musical information access. In this study, it is aimed to generate a model for automatic recognition of the perceived emotion of songs with the help of their lyrics and machine learning algorithms. For this purpose, first 300 songs are selected and annotated by human taggers with respect to their perceived emotions. Thereafter, Unigram, Bigram and Trigram word features are extracted from song lyrics after performing text preprocessing where stemming of the Turkish words is an essential part. Then, term by document matrices are created where term frequencies and tf-idf scores are considered as representations for the indices. Five different classification algorithms are fed with these matrices in order to find the best combination that achieves the highest accuracy results where recall and precision values are used as comparison metrics. As a result, best accuracy results are obtained by using Multinomial Naïve Bayes classifier where Unigram features are used to create the term by document matrix. In this setting, Unigram features are stemmed by Zemberek Long stemming method, and the index representation is chosen as term frequency. For this combination, obtained recall and precision values are 43.7 and 46.9, respectively. Müzik insanlık tarihinde önemli bir yere sahiptir. Özellikle dijital çağda kişiler tarafından her gün yaratılan ve ulaşılan müzik koleksiyonlarının büyüklüğü ile müziğin önemi daha da artmış ve insanlar müzik içeren aktivitelere daha fazla zaman ayırmaya başlamışlardır. Bununla birlikte, müziğe bilgi geri getirim sürecini kolay ve etkin hale getirmek için yapılan katalog bazlı aramalar duygu tabanlı etiketlere göre aramalara dönüşmüştür. Bu araştırmada amacımız şarkı sözlerine göre bir şarkıdan algılanan duygunun otomatik olarak çıkarıldığı bir model geliştirmektir. Model metin bazlı sınıflandırma için kullanılan makina öğrenmesi algoritmaları ile oluşturulmuştur. Bu amaçla araştırmada 300 şarkı seçilmiş ve bu şarkılar kişiler tarafından hissedilen duygularına göre etiketlenmiştir. Devamında metin ön analizi ile şarkı sözleri Türkçe köklerine ayrıştırılarak Unigram, Bigram ve Trigram kelime özellikleri çıkartılmıştır. Ardından endeksleri terim sıklığı ve tf-idf değerleri olan doküman bazında terim matrisleri yaratılmıştır. Bu matris değerleri 5 farklı sınıflandırma algoritmasına girdi olarak verilerek en yüksek doğruluk sonuçları, hatırlama ve kesinlik metrikleri üzerinden araştırılmıştır. Araştırmanın sonucunda en yüksek kesinlik değeri Zemberek Uzun Kök Ayıştırma Metodu ile Unigram kelime özelliklerine göre ayrıştırılmış ve endeksi terim sıklığına göre belirlenmiş terim bazlı doküman matrisinin Katlıterim Naïve Bayes kümeleyicisinde verdiği görülmüştür. Bu kombinasyonda hatırlama metriği değeri 43.7 iken kesinlik metriği değeri 46.9'dur.

descriptionView Paper arrow_downwardDownload

Music emotion classification for Turkish songs using lyrics

by Hanife Kebapci

2023, Pamukkale University Journal of Engineering Sciences

Music has grown into an important part of people's daily lives. As we move further into the digital age in which a large collection of music is being created daily and becomes easily accessible renders people to spend more time on... more

Music has grown into an important part of people's daily lives. As we move further into the digital age in which a large collection of music is being created daily and becomes easily accessible renders people to spend more time on activities that involve music. Consequently, the form of music retrieval is changed from catalogue based searches to searches made based on emotion tags in order for easy and effective musical information access. In this study, it is aimed to generate a model for automatic recognition of the perceived emotion of songs with the help of their lyrics and machine learning algorithms. For this purpose, first 300 songs are selected and annotated by human taggers with respect to their perceived emotions. Thereafter, Unigram, Bigram and Trigram word features are extracted from song lyrics after performing text preprocessing where stemming of the Turkish words is an essential part. Then, term by document matrices are created where term frequencies and tf-idf scores are considered as representations for the indices. Five different classification algorithms are fed with these matrices in order to find the best combination that achieves the highest accuracy results where recall and precision values are used as comparison metrics. As a result, best accuracy results are obtained by using Multinomial Naïve Bayes classifier where Unigram features are used to create the term by document matrix. In this setting, Unigram features are stemmed by Zemberek Long stemming method, and the index representation is chosen as term frequency. For this combination, obtained recall and precision values are 43.7 and 46.9, respectively. Müzik insanlık tarihinde önemli bir yere sahiptir. Özellikle dijital çağda kişiler tarafından her gün yaratılan ve ulaşılan müzik koleksiyonlarının büyüklüğü ile müziğin önemi daha da artmış ve insanlar müzik içeren aktivitelere daha fazla zaman ayırmaya başlamışlardır. Bununla birlikte, müziğe bilgi geri getirim sürecini kolay ve etkin hale getirmek için yapılan katalog bazlı aramalar duygu tabanlı etiketlere göre aramalara dönüşmüştür. Bu araştırmada amacımız şarkı sözlerine göre bir şarkıdan algılanan duygunun otomatik olarak çıkarıldığı bir model geliştirmektir. Model metin bazlı sınıflandırma için kullanılan makina öğrenmesi algoritmaları ile oluşturulmuştur. Bu amaçla araştırmada 300 şarkı seçilmiş ve bu şarkılar kişiler tarafından hissedilen duygularına göre etiketlenmiştir. Devamında metin ön analizi ile şarkı sözleri Türkçe köklerine ayrıştırılarak Unigram, Bigram ve Trigram kelime özellikleri çıkartılmıştır. Ardından endeksleri terim sıklığı ve tf-idf değerleri olan doküman bazında terim matrisleri yaratılmıştır. Bu matris değerleri 5 farklı sınıflandırma algoritmasına girdi olarak verilerek en yüksek doğruluk sonuçları, hatırlama ve kesinlik metrikleri üzerinden araştırılmıştır. Araştırmanın sonucunda en yüksek kesinlik değeri Zemberek Uzun Kök Ayıştırma Metodu ile Unigram kelime özelliklerine göre ayrıştırılmış ve endeksi terim sıklığına göre belirlenmiş terim bazlı doküman matrisinin Katlıterim Naïve Bayes kümeleyicisinde verdiği görülmüştür. Bu kombinasyonda hatırlama metriği değeri 43.7 iken kesinlik metriği değeri 46.9'dur.

descriptionView Paper arrow_downwardDownload

Music-induced emotions can be predicted from a combination of brain activity and acoustic features

by Prof Eduardo R. Miranda

2023, Brain and cognition

It is widely acknowledged that music can communicate and induce a wide range of emotions in the listener. However, music is a highly-complex audio signal composed of a wide range of complex time- and frequency-varying components.... more

descriptionView Paper arrow_downwardDownload

Children’s prototypic facial expressions during emotion-eliciting conversations with their mothers

by Amy Halberstadt

2023, Emotion

Despite theoretical claims that emotions are primarily communicated through prototypic facial expressions, empirical evidence is surprisingly scarce. This study aimed to: (1) test whether children produced more components of a prototypic... more

descriptionView Paper arrow_downwardDownload

Salp Sürü Algoritması ile Öznitelik Seçimi ve Sınıflandırıcı Performans Değerlendirmesi

by Celal Can

2023, European Journal of Science and Technology

Öz Son yıllarda doğadan esinlenen sürü tabanlı algoritmalar arasında yer alan Salp Sürü Algoritması oldukça popüler olmuştur. Bu çalışmada, Salp Sürü Algoritması kullanılarak farklı veri setleri üzerinde öznitelik seçimi yapılmış, farklı... more

descriptionView Paper arrow_downwardDownload

LSTM Network based Sentiment Analysis for Customer Reviews

by Fahrettin Horasan

2023, Politeknik dergisi

Provision of a new dataset to this field to work on it.  Showing the effects of noise normalization and preprocessing on the classification accuracy.  Representation of texts in vector form in various ways to be able to work on them. ... more

descriptionView Paper arrow_downwardDownload

Yapay sinir ağları ile görüntü işlemeye dayalı uzaklıktan bağımsız ağırlık tahmin sistemi: yumurta ve portakal örnekleri

by Umut Engin Ayten

2023, Journal of Geodesy and Geoinformation

Öz: Endüstriyel ve akademik çalışmalarda objelerin ağırlıklarının ölçülmesi oldukça önemli bir yere sahiptir. Bu nedenle gerçekleştirilmiş olan bu çalışmada yapay sinir ağları (YSA) kullanılarak görüntü işlemeye dayalı uzaklıktan ve... more

Öz: Endüstriyel ve akademik çalışmalarda objelerin ağırlıklarının ölçülmesi oldukça önemli bir yere sahiptir. Bu nedenle gerçekleştirilmiş olan bu çalışmada yapay sinir ağları (YSA) kullanılarak görüntü işlemeye dayalı uzaklıktan ve kamera açısından bağımsız ağırlık tahmini yapılması amaçlanmıştır. Yapay sinir ağı yapısı olarak ileri beslemeli çok katmanlı algılayıcı (multi-layer perceptron-MLP) ve radyal tabanlı fonksiyon (radial basis function-RBF) ağı kullanılmıştır. Ağırlığı tahmin edilecek obje olarak da portakal ve yumurta örnekleri belirlenmiştir. Bu örnekler ile sistemin eğitilmesi ve test edilmesi için; 4 farklı marka ve 4 farklı sınıf (çok büyükbüyük-orta-küçük) olacak şekilde 250 adet yumurta örneği ve farklı boyutlarda 150 adet portakal örneği seçilmiştir. Bu örnekler kullanılarak; yumurta için dik açı, pozitif açı ve negatif açı ile elde edilmiş 750 adet görüntü içeren, portakal için de dik açı, pozitif açı ve negatif açı ile elde edilmiş 450 adet görüntü içeren bir veri tabanı oluşturulmuştur. Oluşturulan bu ağırlık tahmin sistemi; bir adet kamera, yapay aydınlatma sistemi, yansıtıcılar ve referans görüntüden oluşmaktadır ve ayrıca ağırlık tahmin işlemi sırasında MATLAB programı ve araç kutuları kullanılmıştır. Bu çalışmada farklı öznitelik vektörleri, farklı açılardan çekilmiş görüntüler ve farklı YSA parametreleri test edilerek başarımı en yüksek olan sistemin kurulması hedeflenmiştir. Her bir değişiklik sonucu oluşturulan sistem beşer kez çalıştırılarak sonuçların aritmetik ortalaması alınmıştır. Ayrıca başarımı en yüksek olan denemenin, k-katlı çapraz doğrulama yöntemi ile de başarımı hesaplanmıştır. Hassas tartı ile yapılan ölçümlerde, Türk Gıda Kodeksi Yumurta Tebliği'ne göre belirlenmiş ve yumurta kutularının üzerinde yazan sınıflandırma değerlerine göre doğruluk oranı %47 iken, gerçekleştirilen bu çalışma sonucunda bu oran MLP'de %

descriptionView Paper arrow_downwardDownload

Soft Biometrics: Gender Recognition from Unconstrained Face Images using Local Feature Descriptor

by Salman Yussof

2023, arXiv (Cornell University)

Gender recognition from unconstrained face images is a challenging task due to the high degree of misalignment, pose, expression, and illumination variation. In previous works, the recognition of gender from unconstrained face images is... more

descriptionView Paper arrow_downwardDownload

Exploring Machine Learning Techniques for Music Emotion Recognition

by Bezal J Benny

2023

Music can be used to express a wide range of human emotions, from basic (e.g., pleasantness or unpleasantness dichotomies) to more complex emotions (e.g., transcendence or nostalgia). These emotions can be quantified by examining... more

Dimens factors ional psychometrics represent emotional perception by numerical fundamental plotted against emotion description axes. In 1980, Russell proposed an emotion model using two dimensions of fundamental factors, Valence (pleasantness, positive and negative affective states) and Arousal (activation, energy and stimulation levels) called the Circum of Arou for neut Arousa plex model of affect. Therefore, emotional states could be explained with the level sal and Valence in a circular plane. And if it is at the center of the graph, it stands ral level of Arousal or Valence and also both of them. The resulting VA (Valence- ) plane allows the placement of eight emotional adjectives as shown in Figure 1. The applications of this model are especially for affective states, emotional facial expressions and even music genre classification based on emotion. The model is one of the most efi ficient methods for quantifying emotions. Fig. 1 — Circumplex Model of Affect [2]

Chroma Energy Normalized (CENS) - A 12-element representation of the spectral energy where the bins represent the 12 equal-tempered pitch classes. Tempo - Average tempo of each song was used as a feature with frames separated by hop length of 512 samples.

Fig. 3 - Chroma Energy Normalized Mel-frequency Cepstral Coefficients (MFCCs) - Coefficients that collectively make up an MFC. They are derived from a type of cepstral representation of the audio clip (a nonlinear “spectrum-of- a-spectrum’’)

Spectral Centroid - A measure used in digital signal processing to characterize a spectrum. It indicates where the “center of mass” of the spectrum is.

Spectral Contrast - Developed to represent the spectral characteristics of a music piece. I considers the spectral peak and valley in each sub-band separately.

Spectral Roll-off - The Nth percentile of the power spectral distribution, where N is usually 85. Zero-Crossing Rate - A point in a digital audio file where the sample is at zero amplitude.

All the extracted features were normalized between 0 and | to avoid problems while combining the values, since the raw data is comprised of features with varying scales. The tempo feature of 6 instances were missing, so they were re-placed with the average tempo. All the features were collected, cleaned and stored in a CSV file.

where Nc is the number of examples where C = c and N is the number of total examples used for training. Calculating P(C = c) for all classes is easy using relative frequencies such that Gaussian Naive Bayes - One method to work with continuous attributes in the Naive Bayes classification is to use Gaussian distributions [14] to represent the likelihoods of the features conditioned on the classes. The parameters of Gaussian distributions can be obtained with

Fig. 13 — Feature Relevance Logistic Regression - In the classification experiments, the Multinomial Logistic Regression model is used to predict categorical placement in or the probability of category membership on a dependent variable based on multiple independent variables. The independent variables can be either dichotomous (i.e., binary) or continuous (1.e., interval or ratio in scale). Multinomial logistic regression can be described as an extension of binary logistic regression by allowing for more than two categories of the dependent variable. Like binary logistic regression, multinomial logistic regression uses maximum likelihood estimation to evaluate the probability of categorical membership.

The AdaBoost model used a Decision Tree Classifier (mac depth = 1) as its base estimator and produced a CA of 55%, which is the lowest accuracy out of the five tested classifiers. The LDA classifier produced a slightly higher accuracy of 57% on using the Singular Value Decomposition (SVD) solver. Using Least squares (LSqr) and eigen decomposition solvers produced significantly lower classification accuracies as expected. The Gaussian Naive Bayes classifier produced the same accuracies with the complete feature set as well as the selected feature subset with an accuracy of 61% and Table 2 shows its confusion matrix. The confusion matrices obtained from all the classification models consistently showed higher misclassifications for emotions falling into Quadrant 2 and Quadrant 4 and Figure 14 shows a similar trend where the two most relevant features, Chroma Mean and MFCC Mean, computed using Logistic Regression showed a high level of ambiguity. Meanwhile, Quadrants | and 3 which comprised of pieces with the highest and lowest Valence-Arousal annotations were easier to classify as shown in Figure 15.

When K value is equal to 11 or 12 and the split ratio for the training dataset is 0.9, the accuracy for K-Nearest Neighbor reached 71%. There is a big difference between the accuracy with K-Fold cross validation (57%) and the accuracy with the aforementioned conditions (71%).

Table. 1 — Confusion Matrix (MLP) In this section, the effectiveness of the emotion recognition models are evaluated in terms of accuracy. Coefficients o of relevant features ranked using Logistic Regression. Features were recursively removed with the RFE model. RFE was then used with the Logistic Regression model to classify every new feature- set. T he model was regu arized f all the models are very critical to performance. Figure 13 shows the set by using the L-BFGS solver, a standard quasi-Newton procedure to optimize smooth functions of many variables. The solvers began to converge after 200 iterations, giving a classification accuracy of 62%, which was the second highest of the five classi the ot! fiers that were used and t her classifiers as well. The Stochastic Gradient Descent model on the other hand produced a he new feature subset obtained was used to make predictions with different feature-set but was comparatively not as effective after using them on the classification mode classi fication. s. Both models showed t hat the Chroma Means were the most re evant feature for

Table. 3 — Classification Accuracies (CAs) for all classifiers For classification, all the models were trained and tested on the dataset using the Logistic Regression feature set and the results were evaluated using confusion matrices. The MLP classifier had 100 hidden layers and used a Rectified Linear Unit (ReLU) activation function returns (f(x) = max(0, x) for the hidden layers. Stochastic gradient-based optimizer was used to optimize the weights and finally the solver showed convergence after 600 iterations, giving the highest classification accuracy (CA) of 65%. Table 1 shows the confusion matrix obtained for the MLP classifier and Table 3 shows the CAs for all the classifiers that were tested.

descriptionView Paper arrow_downwardDownload

Emotica.AI - A Customer feedback system using AI

by Avijit Chaudhuri

2023, International Research Journal on Advanced Science Hub

Our lives are being significantly impacted by the rapid development of wireless technology and mobile gadgets on this day. The digital economy demands that services be developed almost instantly while also paying close attention to client... more

Emotica.AI - A Customer feedback system using AI Email: ayush.kumar.bar.official @ gmail.com Abstract Received: 14 February 2023 Accepted: 19 March 2023

of a rectangular neighbourhood is taken and used as the representative value for that neighbourhood. This helps to reduce the spatial dimensionality of the data and extract the most important features while preserving the most prominent patterns. In the cur- rent model, MaxPooling is being used with a win- dow size of 2x2 and 2x2 strides to further reduce the size of the features extracted by the convolutional layer. connected layers are some examples of the layers that may be used while building a CNN. Moreover, each layer’s characteristics, including the quantity and size of filters, must be supplied. The general architecture of the CNN is built using these layers and parameters, which affects how well it completes the task at hand.

perfectly or not. Through this rigorous training and

perform against same classes of emotion, we want tions in terms of similarities and differences or how

descriptionView Paper arrow_downwardDownload

Author Identification with Machine Learning Algorithms

by Feriştah Dalkılıç

2023, International Journal of Multidisciplinary Studies and Innovative Technologies

Author identification is one of the application areas of text mining. It deals with the automatic prediction of the potential author of an electronic text among predefined author candidates by using author specific writing styles. In this... more

descriptionView Paper arrow_downwardDownload

A novel robust feature extraction with GSO-optimized extreme learning for age-invariant face recognition

by Mr. Sonu Agrawal

2023, The Imaging Science Journal

This paper presents a novel age function modelling technique on the basis of the fusion of local features obtained by local texture descriptors. Initially, image normalization is performed and a feature extraction process is carried out.... more

descriptionView Paper arrow_downwardDownload

An Affective BCI Driven by Self-induced Emotions for People with Severe Neurological Disorders

by daniela iacoviello

2023, New Trends in Image Analysis and Processing – ICIAP 2017

Conditions of extreme neurological disability prevent any form of communication, even to show the emotional state. Brain Computer Interfaces (BCI) often use Electro-encephalography (EEG) measurements of the voluntary brain activity for... more

descriptionView Paper arrow_downwardDownload

Facial Signs and Psycho-physical Status Estimation for Well-being Assessment

by Dimitris Manousos

2023, Proceedings of the International Conference on Health Informatics

Stress and anxiety act as psycho-physical factors that increase the risk of developing several chronic diseases. Since they appear as early indicators, it is very important to be able to perform their evaluation in a contactless and... more

descriptionView Paper arrow_downwardDownload

Data analysis on music classification system and creating a sentiment word dictionary for Kokborok language

by Swapan Debbarma

2023, Journal of Ambient Intelligence and Humanized Computing

This work shows the development of a lexicon for a poorly resourced language, namely Kokborok. Kokborok is a regional language of North East India and offers an entirely new base for research in music information retrieval (MIR) field. We... more

Fig. 2. Havner’s mood taxonomy Fig. 1 Russell’s mood taxonomy

Fig. 3 Flow chart of proposed method The polarity information of any given word of the dictionary used as a feature set. After calculating the system perfor- mances, computational analysis is done on the results. We performed linear extrapolation of the data taken by both the feature set and found that for the dictionary, and TS features seem to converge, at 52% and 39% respectively in the limits of the number of songs goes to infinity. It has been shown in Fig. 3. The objective of this paper is to start this work and expand slowly to build a more extensive database for build- ing the sentimental dictionary since we just discovered in this paper that at present dictionary gives better performance than TS feature.

Fig.4 Creation of sentimental word dictionary from the holy bible

Fig.5 Creation of polarity annotated dataset for classification

Fig.6 Features taken from the sentimental word dictionary In this section of the paper, we developed a classification system for Kokborok song. In the system, lyrics are clas- sified by three types of polarities (positive or negative or neutral) that were assigned to a song after the interpretation of its consequent lyric. We create two classification systems separately, one is using only TS Features, and another is by mapping the dictionary words to a particular song. Process of features taken from dictionary is shown in Fig. 6. For

Fig.8 Comparison of accuracy rate by different algorithm for dic- tionary features

Fig. 7 Comparison of accuracy rate by different algorithm for TS fea- ture

Fig.9 Graphical representation of Linear Extrapolation of the data with both the Feature set It looks like all these methods have some limitations irrespective of different languages and may come from the technique used to input the data from the annotators.

Average accuracy rate 41% Table 2 Performance Evaluation for TS Feature

Table 3. Performance Evaluation for Dictionary Features Average accuracy rate 56%

Table 4 Confusion matrix for TS features

Table 5 Confusion matrix for Dictionary feature

descriptionView Paper arrow_downwardDownload

Quantitative comparison of motion history image variants for video-based depression assessment

by Muhammad Awais

2023, EURASIP Journal on Image and Video Processing

Depression is the most prevalent mood disorder and a leading cause of disability worldwide. Automated video-based analyses may afford objective measures to support clinical judgments. In the present paper, categorical depression... more

descriptionView Paper arrow_downwardDownload

Speech-based emotion classification using multiclass SVM with hybrid kernel and thresholding fusion

by Rajani Muraleedharan

2023, 2012 IEEE Spoken Language Technology Workshop (SLT)

Emotion classification is essential for understanding human interactions and hence is a vital component of behavioral studies. Although numerous algorithms have been developed, the emotion classification accuracy is still short of what is... more

Fig. 1. The proposed emotion classification approach using OAA SVM with hybrid kernel and thresholding fusion.

Table 1. Hybrid kernel selection based on classifier-level recall (%). Table | shows the CL-recall values for the different ker- nels. For each OAA classifier, we choose the kernel with the highest CL-recall (numbers in bold in Table 1). For exam- ple, we choose the polynomial kernel for the ‘Happy or Not’ OAA classifier in our hybrid kernel approach. Note that when training individual SVM OAA classifiers, we make sure that there are a comparable number of training utterances for both classes, otherwise, the trained model will provide biased clas- sification results. For example, when training the ‘Happy or Not’ OAA classifier, ‘not happy’ utterances consist of all of the other five emotions. Therefore, we need to sample utter- ances from the five emotions with the same number as that of happy utterances. The selected kernel functions for the indi- vidual OAA classifiers are shown in Table 1.

Table 2. a) Comparison of classifier-level accuracy (%) of the results in [2] and the proposed approach; b) decision-level recall (%) of the pro- posed approach.

descriptionView Paper arrow_downwardDownload

Parametrisation and correlation analysis applied to music mood classification

by Bożena Kostek

2023, International Journal of Computational Intelligence Studies

The paper presents a study on music mood categorisation. First, a review of music mood models is presented. Then, the preparation of a set of music excerpts to be used in the experiments and music parametrisation is described. Next, some... more

igure 5 Expressions given by listeners to describe mood of a music track (see online version for colours) Notes: The last position in this graph represents the amount of expressions, which occurred only once for a given song. Example No. 24, genre: rock, artist: Within Temptation, album: The Silent Force, title: Destroyed.

Figure 1 Mood representation in Thayer’s model

Source: Russel (1980) Figure 2 Russell’s model of music mood presented on valence/arousal plane

sure 4 Expressions given by listeners to describe the mood of a music track (see online version for colours) Notes: The last position in this graph represents the amount of other expressions, which occurred only once for a given song. Example No. 28, genre: classical, artist: pearl jam, album: Big Fish-music from the Motion Picture, title: Man of the Hour.

gure 6 Expressions given by listeners to describe the mood of a music track (see online version for colours) Notes: The last position in this graph represents the amount of expressions, which occurred only once for a given song. Example No. 27, genre: opera and vocal, artist: Linda Eder, album: Soundtrack, title: Falling Slowly.

ure 7 Expressions given by listeners to describe the mood of a music track (see online version for colours) Notes: The last position in this graph represents the amount of expressions, which occurred only once for a given song. Example No. 17, genre: alternative rock, artist: Kings of Leon, album: Come Around Sundown, title: The End.

figure 8 Results of part B averaged for all subjects (see online version for colours) Note: Labels are marked in accordance with Table 2.

Figure 9 Music samples presented on energy/arousal plane with assigned genre

Xu and Wunsch (2009) showed that MIREX clusters might not be appropriate due to some semantic overlap between categories. Moreover, they have shown that both Hevner’s and MIREX representations have advantages and limitations when evaluated in a semantic mood space. The authors also found that basic emotions: happy, sad, angry and tender, are very relevant to social networks. Laurier et al. proposed a folskonomy representation with four clusters each containing 15 adjectives (Table 2). The adjectives are very similar to the categories proposed in the main emotion theories (Thayer, 1989). The theory is strongly related to the two-dimensional model of Russell’s concept. Clusters represent the four quadrants of the classical Valence-Arousal representation: The nine emotion clusters proposed by Schubert (2003)

‘able 2 Clusters of mood tags proposed by Laurier et al. (2009)

ble 3 List of the music tracks used in the experiment (continued)

The overall quantity of the most frequent adjectives in part A Examples of the four tracks, which are representatives of the above four trends, are presented in Figures 4 to 7.

Note: Mood is assigned in accordance to the Thayer’s energy/arousal model. Table 5 Results of the part B averaged for all of the subjects

Note: Mood is assigned in accordance to the Thayer’s energy/arousal model. Results of the part B averaged for all of the subjects (continued)

ble 6 Adjectives obtained during part A, grouped by part B classification (Thayer’s model)

Notes: Parameters are ordered according to correlation coefficient (from higher to lower values). The last presented values in the table responc to the least significantly correlated parameters according to t-student statistics.

Notes: Parameters are ordered according to the correlation coefficient (from higher to lower values). The last presented values in table respond to the least significantly correlated parameters according to t-student statistics.

able 9 Interclass inertia for longer and shorter vectors of parameters

descriptionView Paper arrow_downwardDownload

Facial emotion recognition with expression energy

by Bir Bhanu

2023, Proceedings of the 14th ACM international conference on Multimodal interaction

descriptionView Paper arrow_downwardDownload

Within and Across-Language Comparison of Vocal Emotions in Mandarin and English

by Yong-cheol Lee

2023, Applied Sciences

This study reports experimental results on whether the acoustic realization of vocal emotions differs between Mandarin and English. Prosodic cues, spectral cues and articulatory cues generated by electroglottograph (EGG) of five emotions... more

descriptionView Paper arrow_downwardDownload

Quantitative comparison of motion history image variants for video-based depression assessment

by Awais Awais

2023, EURASIP Journal on Image and Video Processing

Depression is the most prevalent mood disorder and a leading cause of disability worldwide. Automated video-based analyses may afford objective measures to support clinical judgments. In the present paper, categorical depression... more

descriptionView Paper arrow_downwardDownload

Running head : Emotional state classification in PD 1 Optimal set of EEG features for emotional state classification and trajectory visualization in Parkinson s disease

by ye htut

2023

In addition to classic motor signs and symptoms, individuals with Parkinsons disease (PD) are characterized by emotional deficits. Ongoing brain activity can be recorded as electroencephalograph (EEG) to discover the links between... more

descriptionView Paper arrow_downwardDownload

Salp Sürü Algoritması ile Öznitelik Seçimi ve Sınıflandırıcı Performans Değerlendirmesi

by Yasin KAYA

2023, European Journal of Science and Technology

Öz Son yıllarda doğadan esinlenen sürü tabanlı algoritmalar arasında yer alan Salp Sürü Algoritması oldukça popüler olmuştur. Bu çalışmada, Salp Sürü Algoritması kullanılarak farklı veri setleri üzerinde öznitelik seçimi yapılmış, farklı... more

descriptionView Paper arrow_downwardDownload

Detection of Happiness Emotion on Images

by Beyza Akca

2023, Academic Perspective Procedia

Günümüzde bilgisayar kullanımı yaygınlaştıkça insan-bilgisayar etkileşimi üzerine yenilikçi çalışmalar hız kazanmıştır. Bu yeniliklerden biri, insanların duygusal durumlarının bilgisayarlı sistemler tarafından belirlenmesidir. Bu... more

descriptionView Paper arrow_downwardDownload

Quantitative comparison of motion history image variants for video-based depression assessment

by awais awais

2023, EURASIP Journal on Image and Video Processing

Depression is the most prevalent mood disorder and a leading cause of disability worldwide. Automated video-based analyses may afford objective measures to support clinical judgments. In the present paper, categorical depression... more

descriptionView Paper arrow_downwardDownload

Application of Deep Learning Using Convolutional Neural Network (CNN) Method For Women’s Skin Classification

by Nilam Cahya

2023, Scientific Journal of Informatics

Facial skin is skin that protects the inside of the face such as the eyes, nose, mouth, and others. Facial skin consists of several types, including normal skin, oily skin, dry skin, and combination skin. This is a problem for women... more

descriptionView Paper arrow_downwardDownload

Editorial: Recent advances in EEG (non-invasive) based BCI applications

by Md Kafiul Islam

2023, Frontiers in Computational Neuroscience

Editorial on the Research Topic Recent advances in EEG (non-invasive) based BCI applications

descriptionView Paper arrow_downwardDownload

UvA-DARE (Digital Academic Repository) Multi-Emotion Detection in User-Generated Reviews

by Ed Tan

2023

Abstract. Expressions of emotion abound in user-generated content, whether it be in blogs, reviews, or on social media. Much work has been devoted to detecting and classifying these emotions, but little of it has acknowledged the fact... more

descriptionView Paper arrow_downwardDownload

Music Emotion Classification

Key research themes

1. How can dynamic, dimensionally-annotated datasets and benchmarks advance the evaluation and development of music emotion recognition systems?

2. What are the effective machine learning approaches for multilabel and multimodal classification of emotions induced or perceived in music?

3. How do music structural elements and compositional eras influence perceived musical emotions, and how can computational models incorporate these insights?

All papers in Music Emotion Classification

Music Emotion Classification

Key research themes

1. How can dynamic, dimensionally-annotated datasets and benchmarks advance the evaluation and development of music emotion recognition systems?

2. What are the effective machine learning approaches for multilabel and multimodal classification of emotions induced or perceived in music?

3. How do music structural elements and compositional eras influence perceived musical emotions, and how can computational models incorporate these insights?

Related Topics

All papers in Music Emotion Classification