Query-Based Extractive Multi-Document Summarization Using Paraphrasing and Textual Entailment

One of the most common problems with computer networks is the amount of information in these networks. Meanwhile searching and getting inform about content of textual document, as the most widespread forms of information on such networks, is difficult and sometimes impossible. The goal of multi-docu...

Full description

Saved in:
Bibliographic Details
Main Author: Ali Naserasadi
Format: Article
Language:fas
Published: University of Qom 2020-09-01
Series:مدیریت مهندسی و رایانش نرم
Subjects:
Online Access:https://jemsc.qom.ac.ir/article_1270_d874d641d3de3efad886d09ba7e820fa.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:One of the most common problems with computer networks is the amount of information in these networks. Meanwhile searching and getting inform about content of textual document, as the most widespread forms of information on such networks, is difficult and sometimes impossible. The goal of multi-document textual summarization is to produce a pre-defined length summary from input textual documents while maximizing documents’ content coverage. This paper presents a new approach for textual document summarization based on paraphrasing and textual entailment relations and formulating the problem as an optimization problem. In this approach the sentences of input documents are clustered according to paraphrasing relation and then the entailment score and final score of a fraction of the header sentences of clusters which have the best score according to the user query is calculated. Finally, the optimization problem is solved via greedy and dynamic programming approaches and while selecting the best sentences, the final summary is generated. The results of implementing the proposed system on standard datasets and evaluation via ROUGE system show that the proposed system outperforms the state-of-the-art systems at least by 2.5% in average.
ISSN:2538-6239
2538-2675