A Hybrid Weighted Loss Function for Enhanced Protein Interaction Site Prediction

dc.contributor.authorBhat, P.
dc.contributor.authorPatil, N.
dc.date.accessioned2026-02-06T06:33:18Z
dc.date.issued2025
dc.description.abstractAccurately predicting protein interaction sites is crucial for applications such as protein design, drug discovery, and functional protein analysis. However, a significant challenge in this task arises from the inherent class imbalance between interacting and non-interacting sites in protein datasets. While data augmentation techniques are commonly used to mitigate this imbalance, they often introduce noise, potentially reducing prediction accuracy. In this study, we present a novel approach to improve protein interaction site prediction by developing a customized loss function that combines focal loss and cost-sensitive loss, specifically designed to address class imbalance without relying on data augmentation. Our model, which integrates graph convolutional networks (GCNs) to process evolutionary and structural features of proteins, is evaluated using robust performance metrics suited for imbalanced data: Matthews Correlation Coefficient (MCC) and Area Under Precision-Recall Curve (AUPRC). We evaluate the proposed method on the Test_60 dataset, achieving an MCC of 0.342 and an AUPRC of 0.425, providing a modest improvement over the standard cross-entropy loss function. These findings highlight the effectiveness of our tailored loss function in handling class imbalance and improving prediction performance in protein interaction site prediction. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
dc.identifier.citationLecture Notes in Networks and Systems, 2025, Vol.1371 LNNS, , p. 111-123
dc.identifier.issn23673370
dc.identifier.urihttps://doi.org/10.1007/978-981-96-5723-0_8
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/28588
dc.publisherSpringer Science and Business Media Deutschland GmbH
dc.subjectAUPRC
dc.subjectClass-imbalance
dc.subjectCost-sensitive loss
dc.subjectFocal loss
dc.subjectMCC
dc.subjectProtein-interaction site
dc.titleA Hybrid Weighted Loss Function for Enhanced Protein Interaction Site Prediction

Files