Crossover based technique for data augmentation
| dc.contributor.author | Raj, R. | |
| dc.contributor.author | Mathew, J. | |
| dc.contributor.author | Kannath, S.K. | |
| dc.contributor.author | Rajan, J. | |
| dc.date.accessioned | 2026-02-04T12:28:03Z | |
| dc.date.issued | 2022 | |
| dc.description.abstract | Background and Objective: Medical image classification problems are frequently constrained by the availability of datasets. “Data augmentation” has come as a data enhancement and data enrichment solution to the challenge of limited data. Traditionally data augmentation techniques are based on linear and label preserving transformations; however, recent works have demonstrated that even non-linear, non-label preserving techniques can be unexpectedly effective. This paper proposes a non-linear data augmentation technique for the medical domain and explores its results. Methods: This paper introduces “Crossover technique”, a new data augmentation technique for Convolutional Neural Networks in Medical Image Classification problems. Our technique synthesizes a pair of samples by applying two-point crossover on the already available training dataset. By this technique, we create N new samples from N training samples. The proposed crossover based data augmentation technique, although non-label preserving, has performed significantly better in terms of increased accuracy and reduced loss for all the tested datasets over varied architectures. Results: The proposed method was tested on three publicly available medical datasets with various network architectures. For the mini-MIAS database of mammograms, our method improved the accuracy by 1.47%, achieving 80.15% using VGG-16 architecture. Our method works fine for both gray-scale as well as RGB images, as on the PH2 database for Skin Cancer, it improved the accuracy by 3.57%, achieving 85.71% using VGG-19 architecture. In addition, our technique improved accuracy on the brain tumor dataset by 0.40%, achieving 97.97% using VGG-16 architecture. Conclusion: The proposed novel crossover technique for training the Convolutional Neural Network (CNN) is painless to implement by applying two-point crossover on two images to form new images. The method would go a long way in tackling the challenges of limited datasets and problems of class imbalances in medical image analysis. Our code is available at https://github.com/rishiraj-cs/Crossover-augmentation © 2022 | |
| dc.identifier.citation | Computer Methods and Programs in Biomedicine, 2022, 218, , pp. - | |
| dc.identifier.issn | 1692607 | |
| dc.identifier.uri | https://doi.org/10.1016/j.cmpb.2022.106716 | |
| dc.identifier.uri | https://idr.nitk.ac.in/handle/123456789/22583 | |
| dc.publisher | Elsevier Ireland Ltd | |
| dc.subject | Classification (of information) | |
| dc.subject | Convolution | |
| dc.subject | Convolutional neural networks | |
| dc.subject | Image enhancement | |
| dc.subject | Linear transformations | |
| dc.subject | Medical imaging | |
| dc.subject | Network architecture | |
| dc.subject | Augmentation techniques | |
| dc.subject | Convolutional neural network | |
| dc.subject | Crossover | |
| dc.subject | Data augmentation | |
| dc.subject | Data enhancement | |
| dc.subject | Data enrichments | |
| dc.subject | Images classification | |
| dc.subject | Medical image classification | |
| dc.subject | Non linear | |
| dc.subject | Two-point | |
| dc.subject | Image classification | |
| dc.subject | Article | |
| dc.subject | brain tumor | |
| dc.subject | controlled study | |
| dc.subject | convolutional neural network | |
| dc.subject | diagnostic accuracy | |
| dc.subject | image analysis | |
| dc.subject | information processing | |
| dc.subject | mammography | |
| dc.subject | skin cancer | |
| dc.subject | factual database | |
| dc.subject | human | |
| dc.subject | Brain Neoplasms | |
| dc.subject | Data Management | |
| dc.subject | Databases, Factual | |
| dc.subject | Humans | |
| dc.subject | Mammography | |
| dc.subject | Neural Networks, Computer | |
| dc.title | Crossover based technique for data augmentation |
