Disaster Classification Using Multimodality Techniques by Integrating Images and Text
| dc.contributor.author | Medapati, B.M.R. | |
| dc.contributor.author | Pais, S.M. | |
| dc.contributor.author | Bhattacharjee, S. | |
| dc.date.accessioned | 2026-02-06T06:33:26Z | |
| dc.date.issued | 2025 | |
| dc.description.abstract | In today’s digital era, the abundance of information shared through social media platforms in times of disaster has emerged as a crucial asset for enhancing disaster response operations. This research initiative was specifically dedicated to enhance disaster categorization by integrating image and tweet text data. The devised model comprises two distinct modules aimed at optimizing the classification process. The primary module focuses on extracting insights from text and images independently using VGG-16 for images, and Bidirectional Long Short-Term Memory and Convolutional Neural Network for texts, subsequently executing the classification task. Conversely, the secondary module is designed to learn the interconnectedness between textual content and images using Contrastive Learning Image Pretraining (CLIP). After this late fusion is used to combine the outcomes of these modules and later softmax classification is used for the classification of the incident into one of seven humanitarian categories thereby enhancing the precision and efficacy of disaster classification. The developed model gives an accuracy of 75% with no data and image augmentations and the result was improved to 93% with different combinations of augmentations. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2025. | |
| dc.identifier.citation | Communications in Computer and Information Science, 2025, Vol.2461 CCIS, , p. 183-201 | |
| dc.identifier.issn | 18650929 | |
| dc.identifier.uri | https://doi.org/10.1007/978-3-031-96473-2_13 | |
| dc.identifier.uri | https://idr.nitk.ac.in/handle/123456789/28641 | |
| dc.publisher | Springer Science and Business Media Deutschland GmbH | |
| dc.subject | CLIP | |
| dc.subject | Cross attention | |
| dc.subject | Disaster management | |
| dc.subject | FastText | |
| dc.subject | Multimodal techniques | |
| dc.subject | SMOTE | |
| dc.subject | VGG-16 | |
| dc.title | Disaster Classification Using Multimodality Techniques by Integrating Images and Text |
