Crime base: Towards building a knowledge base for crime entities and their relationships from online news papers

dc.contributor.authorSrinivasa, S.
dc.contributor.authorSanthi Thilagam, P.
dc.date.accessioned2026-02-05T09:29:31Z
dc.date.issued2019
dc.description.abstractIn the current era of internet, information related to crime is scattered across many sources namely news media, social networks, blogs, and video repositories, etc. Crime reports published in online newspapers are often considered as reliable compared to crowdsourced data like social media and contain crime information not only in the form of unstructured text but also in the form of images. Given the volume and availability of crime-related information present in online newspapers, gathering and integrating crime entities from multiple modalities and representing them as a knowledge base in machine-readable form will be useful for any law enforcement agencies to analyze and prevent criminal activities. Extant research works to generate the crime knowledge base, does not address extraction of all non-redundant entities from text and image data present in multiple newspapers. Hence, this work proposes Crime Base, an entity relationship based system to extract and integrate crime related text and image data from online newspapers with a focus towards reducing duplicity and loss of information in the knowledge base. The proposed system uses a rule-based approach to extract the entities from text and image captions. The entities extracted from text data are correlated using contextual as-well-as semantic similarity measures and image entities are correlated using low-level and high-level image features. The proposed system also presents an integrated view of these entities and their relations in the form of a knowledge base using OWL. The system is tested for a collection of crime related articles from popular Indian online newspapers. © 2019 Elsevier Ltd
dc.identifier.citationInformation Processing and Management, 2019, 56, 6, pp. -
dc.identifier.issn3064573
dc.identifier.urihttps://doi.org/10.1016/j.ipm.2019.102059
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/24322
dc.publisherElsevier Ltd
dc.subjectCrime
dc.subjectData mining
dc.subjectImage processing
dc.subjectInformation retrieval
dc.subjectIntegration
dc.subjectKnowledge based systems
dc.subjectKnowledge representation
dc.subjectNatural language processing systems
dc.subjectNewsprint
dc.subjectOntology
dc.subjectSemantics
dc.subjectWebsites
dc.subjectCriminal activities
dc.subjectEntity-relationship
dc.subjectLaw-enforcement agencies
dc.subjectMachine readable form
dc.subjectMultiple modalities
dc.subjectNAtural language processing
dc.subjectRule-based approach
dc.subjectSemantic similarity measures
dc.subjectComputer crime
dc.titleCrime base: Towards building a knowledge base for crime entities and their relationships from online news papers

Files

Collections