An Effective Multi-Label Protein Sub-Chloroplast Localization Prediction by Skipped-Grams of Evolutionary Profiles Using Deep Neural Network

dc.contributor.authorBankapur, S.
dc.contributor.authorPatil, N.
dc.date.accessioned2026-02-05T09:26:22Z
dc.date.issued2022
dc.description.abstractChloroplast is one of the most classic organelles in algae and plant cells. Identifying the locations of chloroplast proteins in the chloroplast organelle is an important as well as a challenging task in deciphering their functions. Biological-based experiments to identify the Protein Sub-Chloroplast Localization (PSCL) is time-consuming and cost-intensive. Over the last decade, a few computational methods have been developed to predict PSCL in which earlier works assumed to predict only single-location; whereas, recent works are able to predict multiple-locations of chloroplast organelle. However, the performances of all the state-of-the-art predictors are poor. This article proposes a novel skip-gram technique to extract highly discriminating patterns from evolutionary profiles and a multi-label deep neural network to predict the PSCL. The proposed model is assessed on two publicly available datasets, i.e., Benchmark and Novel. Experimental results demonstrate that the proposed work outperforms significantly when compared to the state-of-the-art multi-label PSCL predictors. A multi-label prediction accuracy (i.e., Overall Actual Accuracy) of the proposed model is enhanced by an absolute minimum margin of 6.7 percent on Benchmark dataset and 7.9 percent on Novel dataset when compared to the best PSCL predictor from the literature. Further, result of statistical t-test concludes that the performance of the proposed work is significantly improved and thus, the proposed work is an effective computational model to solve multi-label PSCL prediction. The proposed prediction model is hosted on web-server and available at https://nitkit-vgst727-nppsa.nitk.ac.in/deeplocpred/. © 2004-2012 IEEE.
dc.identifier.citationIEEE/ACM Transactions on Computational Biology and Bioinformatics, 2022, 19, 3, pp. 1449-1458
dc.identifier.issn15455963
dc.identifier.urihttps://doi.org/10.1109/TCBB.2020.3037465
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/22899
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.subjectCell proliferation
dc.subjectDeep neural networks
dc.subjectForecasting
dc.subjectLocation
dc.subjectProteins
dc.subjectBiological experiments
dc.subjectChloroplast proteins
dc.subjectCost-intensive
dc.subjectEvaluation metrics
dc.subjectMulti-label proteins
dc.subjectPlant cells
dc.subjectSingle location
dc.subjectState of the art
dc.subjectNeural networks
dc.subjectprotein
dc.subjectbiology
dc.subjectchloroplast
dc.subjectevolution
dc.subjectmetabolism
dc.subjectprocedures
dc.subjectBiological Evolution
dc.subjectChloroplasts
dc.subjectComputational Biology
dc.subjectNeural Networks, Computer
dc.titleAn Effective Multi-Label Protein Sub-Chloroplast Localization Prediction by Skipped-Grams of Evolutionary Profiles Using Deep Neural Network

Files

Collections