Saved in:
Bibliographic Details
Main Authors: Bindal, Purnima, Kumar, Vikas, Rathore, Sagar, Bhatnagar, Vasudha
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.12290
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918203489255424
author Bindal, Purnima
Kumar, Vikas
Rathore, Sagar
Bhatnagar, Vasudha
author_facet Bindal, Purnima
Kumar, Vikas
Rathore, Sagar
Bhatnagar, Vasudha
contents Summarization of legal judgments poses a heavy cognitive burden on law practitioners due to the complexity of the language, context-sensitive legal jargon, and the length of the document. Therefore, the automatic summarization of legal documents has attracted serious attention from natural language processing researchers. Since the abstractive summaries of legal documents generated by deep neural methods remain prone to the risk of misrepresenting nuanced legal jargon or overlooking key contextual details, we envisage a rising trend toward the use of extractive case summarizers. Given the high cost of human annotation for gold standard extractive summaries, we engineer a light and transparent pipeline that leverages existing abstractive gold standard summaries to create the corresponding extractive gold standard versions. The approach ensures that the experts` opinions ensconced in the original gold standard abstractive summaries are carried over to the transformed extractive summaries. We aim to augment seven existing case summarization datasets, which include abstractive summaries, by incorporating corresponding extractive summaries and create an enriched data resource for case summarization research community. To ensure the quality of the augmented extractive summaries, we perform an extensive comparative evaluation with the original abstractive gold standard summaries covering structural, lexical, and semantic dimensions. We also compare the domain-level information of the two summaries. We commit to release the augmented datasets in the public domain for use by the research community and believe that the resource will offer opportunities to advance the field of automatic summarization of legal documents.
format Preprint
id arxiv_https___arxiv_org_abs_2511_12290
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle AugAbEx : Way Forward for Extractive Case Summarization
Bindal, Purnima
Kumar, Vikas
Rathore, Sagar
Bhatnagar, Vasudha
Computation and Language
Summarization of legal judgments poses a heavy cognitive burden on law practitioners due to the complexity of the language, context-sensitive legal jargon, and the length of the document. Therefore, the automatic summarization of legal documents has attracted serious attention from natural language processing researchers. Since the abstractive summaries of legal documents generated by deep neural methods remain prone to the risk of misrepresenting nuanced legal jargon or overlooking key contextual details, we envisage a rising trend toward the use of extractive case summarizers. Given the high cost of human annotation for gold standard extractive summaries, we engineer a light and transparent pipeline that leverages existing abstractive gold standard summaries to create the corresponding extractive gold standard versions. The approach ensures that the experts` opinions ensconced in the original gold standard abstractive summaries are carried over to the transformed extractive summaries. We aim to augment seven existing case summarization datasets, which include abstractive summaries, by incorporating corresponding extractive summaries and create an enriched data resource for case summarization research community. To ensure the quality of the augmented extractive summaries, we perform an extensive comparative evaluation with the original abstractive gold standard summaries covering structural, lexical, and semantic dimensions. We also compare the domain-level information of the two summaries. We commit to release the augmented datasets in the public domain for use by the research community and believe that the resource will offer opportunities to advance the field of automatic summarization of legal documents.
title AugAbEx : Way Forward for Extractive Case Summarization
topic Computation and Language
url https://arxiv.org/abs/2511.12290