4.6 Article

A Novel Approach for Semantic Extractive Text Summarization

Journal

APPLIED SCIENCES-BASEL
Volume 12, Issue 9, Pages -

Publisher

MDPI
DOI: 10.3390/app12094479

Keywords

text mining; text summarization; text extraction; semantic text extraction

Ask authors/readers for more resources

Text summarization is a technique for shortening long texts or documents. Manual summarization can be costly and time-consuming, while an extractive summarization model balances compression and retention ratios by preserving meaningful sentences and filtering out redundant information.
Text summarization is a technique for shortening down or exacting a long text or document. It becomes critical when someone needs a quick and accurate summary of very long content. Manual text summarization can be expensive and time-consuming. While summarizing, some important content, such as information, concepts, and features of the document, can be lost; therefore, the retention ratio, which contains informative sentences, is lost, and if more information is added, then lengthy texts can be produced, increasing the compression ratio. Therefore, there is a tradeoff between two ratios (compression and retention). The model preserves or collects all the informative sentences by taking only the long sentences and removing the short sentences with less of a compression ratio. It tries to balance the retention ratio by avoiding text redundancies and also filters irrelevant information from the text by removing outliers. It generates sentences in chronological order as the sentences are mentioned in the original document. It also uses a heuristic approach for selecting the best cluster or group, which contains more meaningful sentences that are present in the topmost sentences of the summary. Our proposed model extractive summarizer overcomes these deficiencies and tries to balance between compression and retention ratios.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available