Profile generation from web sources: an information extraction system

dc.contributor.authorRanjan, R.
dc.contributor.authorVathsala, H.
dc.contributor.authorKoolagudi, S.G.
dc.date.accessioned2026-02-04T12:27:30Z
dc.date.issued2022
dc.description.abstractThe Internet space has a vast collection of information which is not always structured. These sources of information such as social media, news articles, blogs, speeches and videos often contain information that could be utilized to generate decision making tools such as reports about events and individuals. Using this information is a long and tedious process if done manually. Over the years, a lot of research has been done in data mining and natural language processing techniques to facilitate the consumption of this vast amount of data. The current work describes ProfileGen, an information extraction system that uses a variety of these data sources to form a profile of a given person. There are two parts to this application: The first part uses information publicly available on social media sites, news articles on news websites and blogs and compiles this information to form a corpus about the given person, and in the second part, the information is ranked using machine learning techniques, so as to provide information in the order of importance. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Austria, part of Springer Nature.
dc.identifier.citationSocial Network Analysis and Mining, 2022, 12, 1, pp. -
dc.identifier.issn18695450
dc.identifier.urihttps://doi.org/10.1007/s13278-021-00827-y
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/22308
dc.publisherSpringer
dc.subjectData mining
dc.subjectDecision making
dc.subjectInformation retrieval
dc.subjectInformation retrieval systems
dc.subjectInformation use
dc.subjectLearning algorithms
dc.subjectSentiment analysis
dc.subjectSocial networking (online)
dc.subjectBiography generation
dc.subjectDecision making tool
dc.subjectInformation extraction
dc.subjectInformation extraction systems
dc.subjectNews articles
dc.subjectNews video
dc.subjectProfilegen
dc.subjectSocial media
dc.subjectSources of informations
dc.subjectWeb sources
dc.subjectRecurrent neural networks
dc.titleProfile generation from web sources: an information extraction system

Files

Collections