Skip to topic | Skip to bottom
Home
Main
Main.ATask-BasedEvaluationOfMulti-DocumentSummarizationr1.1 - 07 Dec 2007 - 02:54 - ElifAktolgatopic end

Start of topic | Skip to actions
-- ElifAktolga - 07 Dec 2007

Do Summaries Help? A Task-Based Evaluation of Multi-Document Summarization

Date Place Author Keyword
2005 SIGIR K. McKeown?, R. J. Passonneau, and D. K. Elson summarization, evaluation

Summary

In this paper, the usefulness of Columbia's multi-document summarization system Newsblaster is evaluated.

Design of Evaluation

Report writing Task:

  • Three questions on each of four topics
  • 30 minutes to write the report
  • Use only the provided information (4 clusters: 2 have related news, 2 have relevant news)
  • One of these conditions: no summaries; one sentence summaries; Newsblaster summaries; human summaries
  • Report content quality is measured

Method of Evaluation

Pyramid method for scoring the reports:

  • look over all available summaries
  • look for clause-length "summarization content units" (SCUs)
  • weight them by the number of summaries they occur in
  • arrange SCUs in a pyramid; fewest SCUs at the top (contained in all the summaries), most of them at the bottom (contained only in 1 summary)
  • good summaries will contain all the SCUs at the topmost tier and further below
  • pyramid score is the ratio of the summary's score (sum of SCU weights in the summary) to the optimal report score

Results

  • reports are significantly better if the Newsblaster summaries were provided than when no summary was provided
  • user satisfaction: higher with multi-document summaries (human or Newsblaster) than without any summaries
  • the better the summary was, the more it was used by the users

Reference


to top

You are here: Main > Fall2007ReadingGroup > ATask-BasedEvaluationOfMulti-DocumentSummarization

to top

Copyright © 1999-2008 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding TWiki? Send feedback