An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy

Nowadays, automatic multidocument text summarization systems can successfully retrieve the summary sentences from the input documents. But, it has many limitations such as inaccurate extraction to essential sentences, low coverage, poor coherence among the sentences, and redundancy. This paper intro...

Full description

Saved in:
Bibliographic Details
Main Authors: Nedunchelian Ramanujam, Manivannan Kaliappan
Format: Article
Language:English
Published: Wiley 2016-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1155/2016/1784827
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832548110541258752
author Nedunchelian Ramanujam
Manivannan Kaliappan
author_facet Nedunchelian Ramanujam
Manivannan Kaliappan
author_sort Nedunchelian Ramanujam
collection DOAJ
description Nowadays, automatic multidocument text summarization systems can successfully retrieve the summary sentences from the input documents. But, it has many limitations such as inaccurate extraction to essential sentences, low coverage, poor coherence among the sentences, and redundancy. This paper introduces a new concept of timestamp approach with Naïve Bayesian Classification approach for multidocument text summarization. The timestamp provides the summary an ordered look, which achieves the coherent looking summary. It extracts the more relevant information from the multiple documents. Here, scoring strategy is also used to calculate the score for the words to obtain the word frequency. The higher linguistic quality is estimated in terms of readability and comprehensibility. In order to show the efficiency of the proposed method, this paper presents the comparison between the proposed methods with the existing MEAD algorithm. The timestamp procedure is also applied on the MEAD algorithm and the results are examined with the proposed method. The results show that the proposed method results in lesser time than the existing MEAD algorithm to execute the summarization process. Moreover, the proposed method results in better precision, recall, and F-score than the existing clustering with lexical chaining approach.
format Article
id doaj-art-3e37174a53824911881ca47f9604d384
institution Kabale University
issn 2356-6140
1537-744X
language English
publishDate 2016-01-01
publisher Wiley
record_format Article
series The Scientific World Journal
spelling doaj-art-3e37174a53824911881ca47f9604d3842025-02-03T06:42:09ZengWileyThe Scientific World Journal2356-61401537-744X2016-01-01201610.1155/2016/17848271784827An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp StrategyNedunchelian Ramanujam0Manivannan Kaliappan1Department of Computer Science and Engineering, Sri Venkateswara College of Engineering, Pennalur, Sriperumbudur TK 602117, IndiaDepartment of Information Technology, RMK Engineering College, Kavaraipettai 601206, IndiaNowadays, automatic multidocument text summarization systems can successfully retrieve the summary sentences from the input documents. But, it has many limitations such as inaccurate extraction to essential sentences, low coverage, poor coherence among the sentences, and redundancy. This paper introduces a new concept of timestamp approach with Naïve Bayesian Classification approach for multidocument text summarization. The timestamp provides the summary an ordered look, which achieves the coherent looking summary. It extracts the more relevant information from the multiple documents. Here, scoring strategy is also used to calculate the score for the words to obtain the word frequency. The higher linguistic quality is estimated in terms of readability and comprehensibility. In order to show the efficiency of the proposed method, this paper presents the comparison between the proposed methods with the existing MEAD algorithm. The timestamp procedure is also applied on the MEAD algorithm and the results are examined with the proposed method. The results show that the proposed method results in lesser time than the existing MEAD algorithm to execute the summarization process. Moreover, the proposed method results in better precision, recall, and F-score than the existing clustering with lexical chaining approach.http://dx.doi.org/10.1155/2016/1784827
spellingShingle Nedunchelian Ramanujam
Manivannan Kaliappan
An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy
The Scientific World Journal
title An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy
title_full An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy
title_fullStr An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy
title_full_unstemmed An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy
title_short An Automatic Multidocument Text Summarization Approach Based on Naïve Bayesian Classifier Using Timestamp Strategy
title_sort automatic multidocument text summarization approach based on naive bayesian classifier using timestamp strategy
url http://dx.doi.org/10.1155/2016/1784827
work_keys_str_mv AT nedunchelianramanujam anautomaticmultidocumenttextsummarizationapproachbasedonnaivebayesianclassifierusingtimestampstrategy
AT manivannankaliappan anautomaticmultidocumenttextsummarizationapproachbasedonnaivebayesianclassifierusingtimestampstrategy
AT nedunchelianramanujam automaticmultidocumenttextsummarizationapproachbasedonnaivebayesianclassifierusingtimestampstrategy
AT manivannankaliappan automaticmultidocumenttextsummarizationapproachbasedonnaivebayesianclassifierusingtimestampstrategy