TU Darmstadt / ULB / TUprints

Live blog summarization

Avinesh, P. V. S. ; Peyrard, Maxime ; Meyer, Christian M. (2024)
Live blog summarization.
In: Language Resources and Evaluation, 2021, 55 (1)
doi: 10.26083/tuprints-00023525
Article, Secondary publication, Publisher's Version

[img] Text
s10579-020-09513-5.pdf
Copyright Information: CC BY 4.0 International - Creative Commons, Attribution.

Download (1MB)
Item Type: Article
Type of entry: Secondary publication
Title: Live blog summarization
Language: English
Date: 10 December 2024
Place of Publication: Darmstadt
Year of primary publication: March 2021
Place of primary publication: Dordrecht
Publisher: Springer
Journal or Publication Title: Language Resources and Evaluation
Volume of the journal: 55
Issue Number: 1
DOI: 10.26083/tuprints-00023525
Corresponding Links:
Origin: Secondary publication DeepGreen
Abstract:

Live blogs are an increasingly popular news format to cover breaking news and live events in online journalism. Online news websites around the world are using this medium to give their readers a minute by minute update on an event. Good summaries enhance the value of the live blogs for a reader, but are often not available. In this article, (a) we first define the task of summarizing a live blog, (b) study ways of automatically collecting corpora for live blog summarization, and (c) understand the complexity of the task by empirically evaluating well-known state-of-the-art unsupervised and supervised summarization systems on our new corpus. We show that live blog summarization poses new challenges in the field of news summarization, since frequency and positional signals cannot be used. We make our tools publicly available to reconstruct the corpus and to conduct our empirical experiments. This encourages the research community to build upon and replicate our results.

Uncontrolled Keywords: Live blog summarization, Corpus construction, Focused crawling, Online journalism
Status: Publisher's Version
URN: urn:nbn:de:tuda-tuprints-235256
Classification DDC: 000 Generalities, computers, information > 004 Computer science
000 Generalities, computers, information > 070 News media, journalism, publishing
Divisions: 20 Department of Computer Science > Ubiquitous Knowledge Processing
DFG-Graduiertenkollegs > Research Training Group 1994 Adaptive Preparation of Information from Heterogeneous Sources
Date Deposited: 10 Dec 2024 13:19
Last Modified: 13 Dec 2024 10:49
SWORD Depositor: Deep Green
URI: https://tuprints.ulb.tu-darmstadt.de/id/eprint/23525
PPN: 524551820
Export:
Actions (login required)
View Item View Item