论文信息 - Additional Evidence That Common Low-level Features Of Individual Audio Frames Are Not Representative Of Music Genre

Additional Evidence That Common Low-level Features Of Individual Audio Frames Are Not Representative Of Music Genre

The Bag-of-Frames (BoF) approach has been widely used in music genre classification. In this approach, music genres are represented by statistical models of low-level features computed on short frames (e.g. in the tenth of ms) of audio signal. In the design of such models, a common procedure in BoF approaches is to represent each music genre by sets of instances (i.e. frame-based feature vectors) inferred from training data. The common underlying assumption is that the majority of such instances do capture somehow the (musical) specificities of each genre, and that obtaining good classification performance is a matter of size of the training dataset, and fine-tuning feature extraction and learning algorithm parameters. We report on extensive tests on two music databases that contradict this assumption. We show that there is little or no benefit in seeking a thorough representation of the feature vectors for each class. In particular, we show that genre classification performances are similar when representing music pieces from a number of different genres with the same set of symbols derived from a single genre or from all the genres. We conclude that our experiments provide additional evidence to the hypothesis that common low-level features of isolated audio frames are not representative of music genres.

Fabien Gouyon | Thibault Langlois | Miguel Lopes | Mohamed Sordo | Gonçalo Marques

[1] Gonçalo Marques,et al. A Music Classification Method based on Timbral Features , 2009, ISMIR.

[2] Arthur Flexer,et al. A Closer Look on Artist Filters for Musical Genre Classification , 2007, ISMIR.

[3] H. Yoshida. Tokyo, Japan , 2019, The Statesman’s Yearbook Companion.

[4] N. Scaringella,et al. Automatic genre classification of music content: a survey , 2006, IEEE Signal Processing Magazine.

[5] Ulas Bagci,et al. Automatic Classification of Musical Genres Using Inter-Genre Similarity , 2007, IEEE Signal Processing Letters.

[6] Elias Pampalk,et al. Computational Models of Music Similarity and their Application in Music Information Retrieval , 2006 .

[7] Alessandro L. Koerich,et al. The Latin Music Database , 2008, ISMIR.

[8] François Pachet,et al. The bag-of-frames approach to audio pattern recognition: a sufficient model for urban soundscapes but not for polyphonic music. , 2007, The Journal of the Acoustical Society of America.

[9] Jean-Julien Aucouturier,et al. Ten Experiments on the Modeling of Polyphonic Timbre. (Dix Expériences sur la Modélisation du Timbre Polyphonique) , 2006 .

[10] Luiz Eduardo Soares de Oliveira,et al. Selection of Training Instances for Music Genre Classification , 2010, 2010 20th International Conference on Pattern Recognition.

[11] Xavier Serra,et al. ISMIR 2004 Audio Description Contest , 2006 .