Datasets


I have collected several music datasets, especially for Chinese music:

  • MGD: Large Scale Chinese Folk Songs Dataset (2014-2023)

    Over 31000 Chinese folk songs in symbolic format.

  • MGDplus (2021-2022)

    1214 Selected symbolic Han Chinese folk songs from seven regions with five annotations, partially audio-score aligned. Some examples can be found here.

  • Music Emotion Dataset (2021-2022)

    Music Emotion Recognition (MER) has recently received considerable attention. We developed the MER1101 Music Emotion Dataset to support the MER research requiring large music content libraries. It is a dataset that explores the relationship between music and human emotion. It is based on Russell's valence-arousal emotion model and contains 1101 music audio gathered from the internet, each ranging from 16.5 seconds to 125.5 seconds.

  • Singing Quality Evaluation Dataset (2023-present, ongoing)