An Enhanced Protein Fold Recognition for Low Similarity Datasets Using Convolutional and Skip-Gram Features with Deep Neural Network

dc.contributor.authorBankapur, S.
dc.contributor.authorPatil, N.
dc.date.accessioned2026-02-05T09:27:43Z
dc.date.issued2021
dc.description.abstractThe protein fold recognition is one of the important tasks of structural biology, which helps in addressing further challenges like predicting the protein tertiary structures and its functions. Many machine learning works are published to identify the protein folds effectively. However, very few works have reported the fold recognition accuracy above 80% on benchmark datasets. In this study, an effective set of global and local features are extracted from the proposed Convolutional (Conv) and SkipXGram bi-gram (SXGbg) techniques, and the fold recognition is performed using the proposed deep neural network. The performance of the proposed model reported 91.4% fold accuracy on one of the derived low similarity (< 25%) datasets of latest extended version of SCOPe_2.07. The proposed model is further evaluated on three popular and publicly available benchmark datasets such as DD, EDD, and TG and obtained 85.9%, 95.8%, and 88.8% fold accuracies, respectively. This work is first to report fold recognition accuracy above 85% on all the benchmark datasets. The performance of the proposed model has outperformed the best state-of-the-art models by 5% to 23% on DD, 2% to 19% on EDD, and 3% to 30% on TG dataset. © 2002-2011 IEEE.
dc.identifier.citationIEEE Transactions on Nanobioscience, 2021, 20, 1, pp. 42-49
dc.identifier.issn15361241
dc.identifier.urihttps://doi.org/10.1109/TNB.2020.3022456
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/23496
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.subjectConvolution
dc.subjectDeep neural networks
dc.subjectProtein folding
dc.subjectProteins
dc.subjectBenchmark datasets
dc.subjectBest state
dc.subjectExtended versions
dc.subjectFold recognition
dc.subjectLocal feature
dc.subjectProtein fold recognition
dc.subjectProtein tertiary structures
dc.subjectStructural biology
dc.subjectNeural networks
dc.subjectprotein
dc.subjectmachine learning
dc.subjectMachine Learning
dc.subjectNeural Networks, Computer
dc.titleAn Enhanced Protein Fold Recognition for Low Similarity Datasets Using Convolutional and Skip-Gram Features with Deep Neural Network

Files

Collections