An Enhanced Protein Fold Recognition for Low Similarity Datasets Using Convolutional and Skip-Gram Features with Deep Neural Network

dc.contributor.authorBankapur S.
dc.contributor.authorPatil N.
dc.date.accessioned2021-05-05T10:31:19Z
dc.date.available2021-05-05T10:31:19Z
dc.date.issued2021
dc.description.abstractThe protein fold recognition is one of the important tasks of structural biology, which helps in addressing further challenges like predicting the protein tertiary structures and its functions. Many machine learning works are published to identify the protein folds effectively. However, very few works have reported the fold recognition accuracy above 80% on benchmark datasets. In this study, an effective set of global and local features are extracted from the proposed Convolutional (Conv) and SkipXGram bi-gram (SXGbg) techniques, and the fold recognition is performed using the proposed deep neural network. The performance of the proposed model reported 91.4% fold accuracy on one of the derived low similarity (< 25%) datasets of latest extended version of SCOPe_2.07. The proposed model is further evaluated on three popular and publicly available benchmark datasets such as DD, EDD, and TG and obtained 85.9%, 95.8%, and 88.8% fold accuracies, respectively. This work is first to report fold recognition accuracy above 85% on all the benchmark datasets. The performance of the proposed model has outperformed the best state-of-the-art models by 5% to 23% on DD, 2% to 19% on EDD, and 3% to 30% on TG dataset. © 2002-2011 IEEE.en_US
dc.identifier.citationIEEE Transactions on Nanobioscience , Vol. 20 , 1 , p. 42 - 49en_US
dc.identifier.urihttps://doi.org/10.1109/TNB.2020.3022456
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/16692
dc.titleAn Enhanced Protein Fold Recognition for Low Similarity Datasets Using Convolutional and Skip-Gram Features with Deep Neural Networken_US
dc.typeArticleen_US

Files