Summarization Evaluation in the Absence of Human Model Summaries Using the Compositionality of Word Embeddings

COLING 2018 · Elaheh ShafieiBavani, Mohammad Ebrahimi, Raymond Wong, Fang Chen ·

We present a new summary evaluation approach that does not require human model summaries. Our approach exploits the compositional capabilities of corpus-based and lexical resource-based word embeddings to develop the features reflecting coverage, diversity, informativeness, and coherence of summaries. The features are then used to train a learning model for predicting the summary content quality in the absence of gold models. We evaluate the proposed metric in replicating the human assigned scores for summarization systems and summaries on data from query-focused and update summarization tasks in TAC 2008 and 2009. The results show that our feature combination provides reliable estimates of summary content quality when model summaries are not available.