TU Darmstadt / ULB / TUprints

Re3: A Holistic Framework and Dataset for Modeling Collaborative Document Revision

Ruan, Qian ; Kuznetsov, Ilia ; Gurevych, Iryna
eds.: Ku, Lun-Wei ; Martins, Andre ; Srikumar, Vivek (2024)
Re3: A Holistic Framework and Dataset for Modeling Collaborative Document Revision.
The 62nd Annual Meeting of the Association for Computational Linguistics. Bangkok, Thailand (11.08.2024-16.08.2024)
doi: 10.26083/tuprints-00028923
Conference or Workshop Item, Secondary publication, Publisher's Version

[img] Text
2024.acl-long.255.pdf
Copyright Information: CC BY 4.0 International - Creative Commons, Attribution.

Download (2MB)
Item Type: Conference or Workshop Item
Type of entry: Secondary publication
Title: Re3: A Holistic Framework and Dataset for Modeling Collaborative Document Revision
Language: English
Date: 17 December 2024
Place of Publication: Darmstadt
Year of primary publication: August 2024
Place of primary publication: Kerrville, TX, USA
Publisher: ACL
Book Title: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Event Title: The 62nd Annual Meeting of the Association for Computational Linguistics
Event Location: Bangkok, Thailand
Event Dates: 11.08.2024-16.08.2024
DOI: 10.26083/tuprints-00028923
Corresponding Links:
Origin: Secondary publication service
Abstract:

Collaborative review and revision of textual documents is the core of knowledge work and a promising target for empirical analysis and NLP assistance. Yet, a holistic framework that would allow modeling complex relationships between document revisions, reviews and author responses is lacking. To address this gap, we introduce Re3, a framework for joint analysis of collaborative document revision. We instantiate this framework in the scholarly domain, and present Re3-Sci, a large corpus of aligned scientific paper revisions manually labeled according to their action and intent, and supplemented with the respective peer reviews and human-written edit summaries. We use the new data to provide first empirical insights into collaborative document revision in the academic domain, and to assess the capabilities of state-of-the-art LLMs at automating edit analysis and facilitating text-based collaboration. We make our annotation environment and protocols, the resulting data and experimental code publicly available.

Status: Publisher's Version
URN: urn:nbn:de:tuda-tuprints-289236
Classification DDC: 000 Generalities, computers, information > 004 Computer science
Divisions: 20 Department of Computer Science > Ubiquitous Knowledge Processing
Zentrale Einrichtungen > hessian.AI - The Hessian Center for Artificial Intelligence
Date Deposited: 17 Dec 2024 16:37
Last Modified: 17 Dec 2024 16:37
URI: https://tuprints.ulb.tu-darmstadt.de/id/eprint/28923
PPN:
Export:
Actions (login required)
View Item View Item