Journal
INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING
Volume 4, Issue 1, Pages 141-152Publisher
WORLD SCIENTIFIC PUBL CO PTE LTD
DOI: 10.1142/S0219622005001428
Keywords
automatic extraction of important sentence; statistical information; structural feature; Web
Ask authors/readers for more resources
Being increasingly popular, the Internet greatly changes our lives. We can conveniently receive and send information via the Internet. With the information explosion on the Web, it is becoming crucial to develop means to automatically extract important sentences from the Web articles. In this paper, we propose a method which uses both statistical and structural information for sentence extraction. In addition, following the analysis of human's extractions, several heuristic rules axe added to filter out non-important sentences and to prevent similar sentences from being extracted. Our experimental results proved the effectiveness of these means. In particular, once the heuristic rules being added, a significant improvement has been observed.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available