Information Ordering with an Event-Enriched Vector Space Model for Multi-Document News Summarization

Information Ordering with an Event-Enriched Vector Space Model for Multi-Document News Summarization

0.00 Avg rating0 Votes
Article ID: iaor20161491
Volume: 32
Issue: 2
Start Page Number: 323
End Page Number: 351
Publication Date: May 2016
Journal: Computational Intelligence
Authors: , , ,
Keywords: information, networks, statistics: regression
Abstract:

Information ordering is a nontrivial task in multi‐document summarization (MDS), which typically relies on the traditional vector space model (VSM) notorious for semantic deficiency. In this article, we propose a novel event‐enriched VSM to alleviate the problem by building event semantics into sentence representations. The mediation of event information between sentence and term, especially in the news domain, has an intuitive appeal as well as technical advantage in common sentence‐level operations such as sentence similarity computation. Inspired by the block‐style writing by humans, we base the sentence ordering algorithm on sentence clustering. To accommodate the complexity introduced by event information, we adopt a soft‐to‐hard clustering strategy on the event and sentence levels, using expectation–maximization clustering and K‐means, respectively. For the purpose of cluster‐based sentence ordering, the event‐enriched VSM enables us to design an ordering algorithm to enhance event coherence computed between sentence and sentence–context pairs. Drawing on the findings of earlier research, we also incorporate topic continuity measures and time information into the scheme. We evaluate the performance of the model and its variants automatically and manually, with experimental results showing clear advantage of the event‐based model over baseline and non‐event‐based models in information ordering for multi‐document news summarization. We are confident that the event‐enriched VSM has even greater potential in summarization and beyond, which awaits further research.

Reviews

Required fields are marked *. Your email address will not be published.