Unsupervised Abstractive Text Summarization with Length Controlled Autoencoder

No Thumbnail Available

Date

2022

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.

Abstract

This work deals with taking an unsupervised approach to abstractive text summarization where a large set of sentences is converted into a concise summary highlighting the essential details. This is achieved with the use of an adversarial autoencoder model. The model encodes the input to a smaller latent vector and the decoder decodes this latent code to generate the higher dimensional output with some loss. Unlike variational autoencoders, AAE's use discriminators to learn using adversarial loss. K-Means clustering and language models are used to get the final summary. This model has been tested with different datasets like the Amazon, Rotten Tomatoes and Yelp reviews dataset to essentially do an opinion summarization task and this is finally evaluated using ROGUE-1, ROGUE-2,ROGUE-L and BLEU scores. The same task is also conducted on a dataset in Hindi. We obtain a ROGUE-1 score of around 24% for Amazon, Yelp and CNN/Daily Mail dataset and a score of 12% for Rotten Tomatoes while the score obtained for the Hindi news articles dataset is only 8%. © 2022 IEEE.

Description

Keywords

Abstractive Text Summarization, Adversarial Autoencoder, Unsupervised Learning

Citation

INDICON 2022 - 2022 IEEE 19th India Council International Conference, 2022, Vol., , p. -

Endorsement

Review

Supplemented By

Referenced By