TORA: Text Summarization Using Optical Character Recognition and Attention Neural Networks
No Thumbnail Available
Date
2022
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Springer Science and Business Media Deutschland GmbH
Abstract
Text Summarization is the process of creating a short and coherent version of a longer document that holds the same meaning as that of the original data. This article illustrates the technique to read the text in a printed document (such as newspaper, brochure, web document, etc.) and generate a summary of text. The method proposed is named Text Summarization using Optical Character Recognition and Attention Neural Networks (TORA). TORA can perform extractive summarization of a news article with the aid of Recurrent Neural Networks, Bidirectional Long Short-Term Memory, and Bahadanu Attention Network. The experimental results of the proposed method are promising. The experimental results have shown 80% accuracy in producing the summary from the large text document. © 2022, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
Description
Keywords
Attention network, Bahadanu attention network, Neural network, Optical character recognition, Text summarization
Citation
Lecture Notes in Electrical Engineering, 2022, Vol.789, , p. 243-255
