creators_name: Li, Liangda creators_name: Zhou, Ke creators_name: Xue, Gui-Rong creators_name: Zha, Hongyuan creators_name: Yu, Yong type: conference_item datestamp: 2009-04-06 19:08:42 lastmod: 2009-04-22 22:08:03 metadata_visibility: show title: Enhancing Diversity, Coverage and Balance for Summarization through Structure Learning ispublished: pub full_text_status: public pres_type: paper abstract: Document summarization plays an increasingly important role with the exponential growth of documents on the Web. Many supervised and unsupervised approaches have been proposed to generate summaries from documents. However, these approaches seldom simultaneously consider summary diversity, coverage, and balance issues which to a large extent determine the quality of summaries. In this paper, we consider extract-based summarization emphasizing the following three requirements: 1) diversity in summarization, which seeks to reduce redundancy among sentences in the summary; 2) sufficient coverage, which focuses on avoiding the loss of the document’s main information when generating the summary; and 3) balance, which demands that different aspects of the document need to have about the same relative importance in the summary. We formulate the extract-based summarization problem as learning a mapping from a set of sentences of a given document to a subset of the sentences that satisfies the above three requirements. The mapping is learned by incorporating several constraints in a structure learning framework, and we explore the graph structure of the output variables and employ structural SVM for solving the resulted optimization problem. Experiments on the DUC2001 data sets demonstrate significant performance improvements in terms of F1 and ROUGE metrics. date: 2009-04 pagerange: 71-71 event_title: 18th International World Wide Web Conference event_location: Madrid, Spain event_dates: April 20th-24th, 2009 event_type: conference refereed: TRUE citation: Li, Liangda and Zhou, Ke and Xue, Gui-Rong and Zha, Hongyuan and Yu, Yong (2009) Enhancing Diversity, Coverage and Balance for Summarization through Structure Learning. In: 18th International World Wide Web Conference, April 20th-24th, 2009, Madrid, Spain. document_url: http://www2009.eprints.org/8/1/p71.pdf document_url: http://www2009.eprints.org/8/2/fp739-li.ppt