NITK SCaLAR Lab at the CLEF 2025 SimpleText Track: Transformer-Based Models for Biomedical Sentence Simplification (Task 1.1)

No Thumbnail Available

Date

2025

Journal Title

Journal ISSN

Volume Title

Publisher

CEUR-WS

Abstract

This paper presents the participation of the SCaLAR Lab from the National Institute of Technology Karnataka Surathkal (India) in the CLEF 2025 SimpleText Lab. Biomedical texts are often difficult to understand due to complex vocabulary and sentence structures, which limit access to crucial scientific information for non-expert audiences. Making biomedical literature more accessible, we propose two transformer-based simplification pipelines: one combining BioBERT and BioBART with prompts providing definitions, and another using a fine-tuned GPT-2 Medium model for direct simplification. Our dual approach demonstrates effective reduction of lexical and syntactic complexity while preserving medical accuracy, supporting clearer communication and laying the foundation for future work in multilingual and hybrid simplification systems. © 2025 Copyright for this paper by its authors.

Description

Keywords

BioBART, BioBERT, Biomedical, GPT-2, Text Simplification, Transformers

Citation

CEUR Workshop Proceedings, 2025, Vol.4038, , p. 4240-4254

Endorsement

Review

Supplemented By

Referenced By