Singer identification for Indian singers using convolutional neural networks

dc.contributor.authorVishnu Srinivasa Murthy, Y.V.S.
dc.contributor.authorKoolagudi, S.G.
dc.contributor.authorJeshventh Raja, T.K.
dc.date.accessioned2026-02-05T09:26:52Z
dc.date.issued2021
dc.description.abstractSinger identification is one of the important aspects of music information retrieval (MIR). In this work, traditional feature-based and trending convolutional neural network (CNN) based approaches are considered and compared for identifying singers. Two different datasets, namely artist20 and the Indian popular singers’ database with 20 singers are used in this work to evaluate proposed approaches. Cepstral features such as Mel-frequency cepstral coefficients (MFCCs) and linear prediction cepstral coefficients (LPCCs) are considered to represent timbre information. Shifted delta cepstral (SDC) features are also computed beside the cepstral coefficients to capture temporal information. In addition, chroma features are computed from 12 semitones of a musical octave, overall forming a 46-dimensional feature vector. Experiments are conducted with different feature combinations, and suitable features are selected using the genetic algorithm-based feature selection (GAFS) approach. Two different classification techniques, namely artificial neural networks (ANNs) and random forest (RF), are considered on the features mentioned above. Further, spectrograms and chromagrams of audio clips are directly fed to CNN for classification. The singer identification results obtained using CNNs seem to be better than the traditional isolated and ensemble classifiers. Average accuracy of around 75% is observed with CNN in the case of Indian popular singers database. Whereas, on artist20 dataset, the proposed configuration of feature-based approach and CNN could not give better than 60% accuracy. © 2021, The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature.
dc.identifier.citationInternational Journal of Speech Technology, 2021, 24, 3, pp. 781-796
dc.identifier.issn13812416
dc.identifier.urihttps://doi.org/10.1007/s10772-021-09849-5
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/23113
dc.publisherSpringer
dc.subjectConvolution
dc.subjectDecision trees
dc.subjectGenetic algorithms
dc.subjectCepstral coefficients
dc.subjectClassification technique
dc.subjectEnsemble classifiers
dc.subjectFeature based approaches
dc.subjectLinear prediction cepstral coefficient (LPCCs)
dc.subjectMel-frequency cepstral coefficients
dc.subjectMusic information retrieval
dc.subjectTemporal information
dc.subjectConvolutional neural networks
dc.titleSinger identification for Indian singers using convolutional neural networks

Files

Collections