Protein secondary structural class prediction using effective feature modeling and machine learning techniques

dc.contributor.authorBankapur, S.
dc.contributor.authorPatil, N.
dc.date.accessioned2020-03-30T10:22:54Z
dc.date.available2020-03-30T10:22:54Z
dc.date.issued2018
dc.description.abstractProtein Secondary Structural Class (PSSC) prediction is an important step to find its further folds, tertiary structure and functions, which in turn have potential applications in drug discovery. Various computational methods have been developed to predict the PSSC, however, predicting PSSC on the basis of protein sequences is still a challenging task. In this study, we propose an effective approach to extract features using two techniques (i) SkipXGram bi-gram: in which skipped bi-gram features are extracted and (ii) Character embedded features: in which features are extracted using word embedding approach. The combined feature sets from the proposed feature modeling approach are explored using various machine learning classifiers. The best performing classifier (i.e. Random Forest) is benchmarked against state-of-the-art PSSC prediction models. The proposed model was assessed on two low sequence similarity benchmark datasets i.e. 25PDB and FC699. The performance analysis demonstrates that the proposed model consistently outperformed state-of-the-art models by a factor of 3% to 23% and 4% to 6% for 25PDB and FC699 datasets respectively. � 2018 IEEE.en_US
dc.identifier.citationProceedings - 2018 IEEE 18th International Conference on Bioinformatics and Bioengineering, BIBE 2018, 2018, Vol., , pp.18-21en_US
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/8869
dc.titleProtein secondary structural class prediction using effective feature modeling and machine learning techniquesen_US
dc.typeBook chapteren_US

Files