Crossover based technique for data augmentation

dc.contributor.authorRaj, R.
dc.contributor.authorMathew, J.
dc.contributor.authorKannath, S.K.
dc.contributor.authorRajan, J.
dc.date.accessioned2026-02-04T12:28:03Z
dc.date.issued2022
dc.description.abstractBackground and Objective: Medical image classification problems are frequently constrained by the availability of datasets. “Data augmentation” has come as a data enhancement and data enrichment solution to the challenge of limited data. Traditionally data augmentation techniques are based on linear and label preserving transformations; however, recent works have demonstrated that even non-linear, non-label preserving techniques can be unexpectedly effective. This paper proposes a non-linear data augmentation technique for the medical domain and explores its results. Methods: This paper introduces “Crossover technique”, a new data augmentation technique for Convolutional Neural Networks in Medical Image Classification problems. Our technique synthesizes a pair of samples by applying two-point crossover on the already available training dataset. By this technique, we create N new samples from N training samples. The proposed crossover based data augmentation technique, although non-label preserving, has performed significantly better in terms of increased accuracy and reduced loss for all the tested datasets over varied architectures. Results: The proposed method was tested on three publicly available medical datasets with various network architectures. For the mini-MIAS database of mammograms, our method improved the accuracy by 1.47%, achieving 80.15% using VGG-16 architecture. Our method works fine for both gray-scale as well as RGB images, as on the PH2 database for Skin Cancer, it improved the accuracy by 3.57%, achieving 85.71% using VGG-19 architecture. In addition, our technique improved accuracy on the brain tumor dataset by 0.40%, achieving 97.97% using VGG-16 architecture. Conclusion: The proposed novel crossover technique for training the Convolutional Neural Network (CNN) is painless to implement by applying two-point crossover on two images to form new images. The method would go a long way in tackling the challenges of limited datasets and problems of class imbalances in medical image analysis. Our code is available at https://github.com/rishiraj-cs/Crossover-augmentation © 2022
dc.identifier.citationComputer Methods and Programs in Biomedicine, 2022, 218, , pp. -
dc.identifier.issn1692607
dc.identifier.urihttps://doi.org/10.1016/j.cmpb.2022.106716
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/22583
dc.publisherElsevier Ireland Ltd
dc.subjectClassification (of information)
dc.subjectConvolution
dc.subjectConvolutional neural networks
dc.subjectImage enhancement
dc.subjectLinear transformations
dc.subjectMedical imaging
dc.subjectNetwork architecture
dc.subjectAugmentation techniques
dc.subjectConvolutional neural network
dc.subjectCrossover
dc.subjectData augmentation
dc.subjectData enhancement
dc.subjectData enrichments
dc.subjectImages classification
dc.subjectMedical image classification
dc.subjectNon linear
dc.subjectTwo-point
dc.subjectImage classification
dc.subjectArticle
dc.subjectbrain tumor
dc.subjectcontrolled study
dc.subjectconvolutional neural network
dc.subjectdiagnostic accuracy
dc.subjectimage analysis
dc.subjectinformation processing
dc.subjectmammography
dc.subjectskin cancer
dc.subjectfactual database
dc.subjecthuman
dc.subjectBrain Neoplasms
dc.subjectData Management
dc.subjectDatabases, Factual
dc.subjectHumans
dc.subjectMammography
dc.subjectNeural Networks, Computer
dc.titleCrossover based technique for data augmentation

Files

Collections