Please use this identifier to cite or link to this item:
Title: Multi-Modal Music Emotion Recognition: A New Dataset, Methodology and Comparative Analysis
Authors: Panda, Renato Eduardo Silva 
Malheiro, Ricardo 
Rocha, Bruno 
Oliveira, António Pedro
Paiva, Rui Pedro 
Keywords: music emotion recognition; machine learning; multi-modal analysis
Issue Date: 2013
Project: info:eu-repo/grantAgreement/FCT/5876-PPCDTI/102185/PT/MOODetector - A System for Mood-based Classification and Retrieval of Audio Music 
Serial title, monograph or event: 10th International Symposium on Computer Music Multidisciplinary Research (CMMR 2013)
Place of publication or event: Marseille, France
Abstract: We propose a multi-modal approach to the music emotion recognition (MER) problem, combining information from distinct sources, namely audio, MIDI and lyrics. We introduce a methodology for the automatic creation of a multi-modal music emotion dataset resorting to the AllMusic database, based on the emotion tags used in the MIREX Mood Classification Task. Then, MIDI files and lyrics corresponding to a sub-set of the obtained audio samples were gathered. The dataset was organized into the same 5 emotion clusters defined in MIREX. From the audio data, 177 standard features and 98 melodic features were extracted. As for MIDI, 320 features were collected. Finally, 26 lyrical features were extracted. We experimented with several supervised learning and feature selection strategies to evaluate the proposed multi-modal approach. Employing only standard audio features, the best attained performance was 44.3% (F-measure). With the multi-modal approach, results improved to 61.1%, using only 19 multi-modal features. Melodic audio features were particularly important to this improvement.
Rights: openAccess
Appears in Collections:I&D CISUC - Artigos em Livros de Actas

Show full item record

Page view(s)

checked on May 21, 2024


checked on May 21, 2024

Google ScholarTM


This item is licensed under a Creative Commons License Creative Commons