Saved in:
Bibliographic Details
Main Authors: Gayen, Avijit, Chakraborty, Somyajit, Sen, Mainak, Paul, Soham, Jana, Angshuman
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.21689
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912399153430528
author Gayen, Avijit
Chakraborty, Somyajit
Sen, Mainak
Paul, Soham
Jana, Angshuman
author_facet Gayen, Avijit
Chakraborty, Somyajit
Sen, Mainak
Paul, Soham
Jana, Angshuman
contents The persistent accumulation of unresolved legal cases, especially within the Indian judiciary, significantly hampers the timely delivery of justice. Manual methods of prioritizing petitions are often prone to inefficiencies and subjective biases further exacerbating delays. To address this issue, we propose LLMPR (Large Language Model-based Petition Ranking), an automated framework that utilizes transfer learning and machine learning to assign priority rankings to legal petitions based on their contextual urgency. Leveraging the ILDC dataset comprising 7,593 annotated petitions, we process unstructured legal text and extract features through various embedding techniques, including DistilBERT, LegalBERT, and MiniLM. These textual embeddings are combined with quantitative indicators such as gap days, rank scores, and word counts to train multiple machine learning models, including Random Forest, Decision Tree, XGBoost, LightGBM, and CatBoost. Our experiments demonstrate that Random Forest and Decision Tree models yield superior performance, with accuracy exceeding 99% and a Spearman rank correlation of 0.99. Notably, models using only numerical features achieve nearly optimal ranking results (R2 = 0.988, \r{ho} = 0.998), while LLM-based embeddings offer only marginal gains. These findings suggest that automated petition ranking can effectively streamline judicial workflows, reduce case backlog, and improve fairness in legal prioritization.
format Preprint
id arxiv_https___arxiv_org_abs_2505_21689
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle LLMPR: A Novel LLM-Driven Transfer Learning based Petition Ranking Model
Gayen, Avijit
Chakraborty, Somyajit
Sen, Mainak
Paul, Soham
Jana, Angshuman
Computation and Language
Artificial Intelligence
Machine Learning
The persistent accumulation of unresolved legal cases, especially within the Indian judiciary, significantly hampers the timely delivery of justice. Manual methods of prioritizing petitions are often prone to inefficiencies and subjective biases further exacerbating delays. To address this issue, we propose LLMPR (Large Language Model-based Petition Ranking), an automated framework that utilizes transfer learning and machine learning to assign priority rankings to legal petitions based on their contextual urgency. Leveraging the ILDC dataset comprising 7,593 annotated petitions, we process unstructured legal text and extract features through various embedding techniques, including DistilBERT, LegalBERT, and MiniLM. These textual embeddings are combined with quantitative indicators such as gap days, rank scores, and word counts to train multiple machine learning models, including Random Forest, Decision Tree, XGBoost, LightGBM, and CatBoost. Our experiments demonstrate that Random Forest and Decision Tree models yield superior performance, with accuracy exceeding 99% and a Spearman rank correlation of 0.99. Notably, models using only numerical features achieve nearly optimal ranking results (R2 = 0.988, \r{ho} = 0.998), while LLM-based embeddings offer only marginal gains. These findings suggest that automated petition ranking can effectively streamline judicial workflows, reduce case backlog, and improve fairness in legal prioritization.
title LLMPR: A Novel LLM-Driven Transfer Learning based Petition Ranking Model
topic Computation and Language
Artificial Intelligence
Machine Learning
url https://arxiv.org/abs/2505.21689