An effective feature extraction with deep neural network architecture for protein-secondary-structure prediction

No Thumbnail Available

Date

2021

Journal Title

Journal ISSN

Volume Title

Publisher

Springer

Abstract

With the increased importance of proteins in day-to-day life, it is imperative to know the protein functions. Deciphering protein structure elucidates protein functions. Experimental approaches for protein-structure analysis are expensive and time-consuming, and require high dexterity. Thus, finding a viable computational approach is vital. Due to the high complexity of predicting protein structure (tertiary structure) directly, research in this field aims at the protein-secondary-structure prediction which is directly related to its tertiary structure. This research aims at exploring a plethora of features, namely position-specific scoring matrices, hidden Markov model alignment matrices, and physicochemical properties, that carry rich information required to predict the secondary structure. Furthermore, it aims at exploring a suitable combination of the features which could capture diverse information about the protein secondary structure. Finally, a cascaded convolutional neural network and bidirectional long short-term memory architecture is fit on the models, and two evaluation metrics, namely, Q8 score and segment overlap score, are benchmarked on various datasets. Our proposed model trained on data of CB6133 dataset and tested on CB513 dataset beats the benchmark models by a minimum of 2.9%. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Austria, part of Springer Nature.

Description

Keywords

Brain, Convolution, Convolutional neural networks, Deep neural networks, Forecasting, Hidden Markov models, Memory architecture, Network architecture, Physicochemical properties, Proteins, Bidirectional long short-term memory, Convolutional neural network, Hidden-Markov models, Physicochemical property, Protein functions, Protein secondary-structure prediction, Proteins structures, Q8 score, Segment overlap score, Tertiary structures, Long short-term memory, article, convolutional neural network, feature extraction, hidden Markov model, physical chemistry, position weight matrix, protein secondary structure, protein tertiary structure, short term memory

Citation

Network Modeling Analysis in Health Informatics and Bioinformatics, 2021, 10, 1, pp. -

Collections

Endorsement

Review

Supplemented By

Referenced By