Attention-based CNN-BiGRU for Bengali Music Emotion Classification

Authors

  • Subhasish Ghosh Lecturer, Department of Computer Science and Engineering, BGC Trust University Bangladesh
  • Md. Omar Faruk Riad Student, Department of Computer science and engineering, BGC Trust University Bangladesh

DOI:

https://doi.org/10.33022/ijcs.v11i3.3111

Keywords:

Conv1D, BiGRU, MFCCs, Bahdanau attention mechanism, Bengali music classification

Abstract

For Bengali music emotion classification, deep learning models, particularly CNN and RNN are frequently used. To extract meaningful knowledge, however, past studies' shortcomings of low accuracy and overfitting have to be addressed. We have proposed a model combining Conv1D, Bi-GRU and the Bahdanau attention mechanism for music emotion classification of our Bengali music dataset. The model integrates distinct MFCCs wav preprocessing methods with deep learning methods and attention-based methods. The attention mechanism has increased the accuracy of the proposed classification model. The music is finally classified into one of the four emotion classes: Angry, Happy, Relax, Sad. The proposed Conv1D+BiGRU+Attention model is validated as more effective and efficient at classifying emotions in the Bengali music dataset than baseline methods, according to comparisons with baseline models. For our Bengali music dataset, the performance of our proposed model is 95%.

Downloads

Published

31-12-2022