Saved in:
Bibliographic Details
Main Authors: Huang, Yuting, Hu, Yinghao, Xiao, Qian, Zhong, Wenlin, Wu, Yiquan, Zhou, Taishi, Chen, Moke, Sun, Changlong, Kuang, Kun, Wu, Fei
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.17543
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914488895143936
author Huang, Yuting
Hu, Yinghao
Xiao, Qian
Zhong, Wenlin
Wu, Yiquan
Zhou, Taishi
Chen, Moke
Sun, Changlong
Kuang, Kun
Wu, Fei
author_facet Huang, Yuting
Hu, Yinghao
Xiao, Qian
Zhong, Wenlin
Wu, Yiquan
Zhou, Taishi
Chen, Moke
Sun, Changlong
Kuang, Kun
Wu, Fei
contents Large language models (LLMs) have achieved remarkable success in general-domain tasks, yet their direct application to the legal domain remains challenging due to hallucinated legal citations, incomplete knowledge coverage, and weak structured reasoning. To address these issues, we propose PoliLegalLM, a domain-specific large language model tailored for political and legal applications. Our approach adopts a unified training framework that integrates continued pretraining, progressive supervised fine-tuning, and preference-based reinforcement learning to jointly enhance legal knowledge grounding, task alignment, and reasoning capability. We construct a large-scale, high-quality legal corpus and design a structured post-training pipeline, enabling the model to effectively learn domain-specific knowledge and adapt to diverse legal tasks. We evaluate PoliLegalLM on three representative benchmarks, including LawBench, LexEval, and a real-world dataset, PoliLegal. Experimental results demonstrate that PoliLegalLM achieves strong and consistent performance, outperforming competitive models of similar scale and remaining highly competitive with significantly larger models, while achieving the best results on real-world legal scenarios. These results highlight the effectiveness of our training paradigm and the practical value of domain-specific LLMs for real-world legal applications.
format Preprint
id arxiv_https___arxiv_org_abs_2604_17543
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle PoliLegalLM: A Technical Report on a Large Language Model for Political and Legal Affairs
Huang, Yuting
Hu, Yinghao
Xiao, Qian
Zhong, Wenlin
Wu, Yiquan
Zhou, Taishi
Chen, Moke
Sun, Changlong
Kuang, Kun
Wu, Fei
Computation and Language
Large language models (LLMs) have achieved remarkable success in general-domain tasks, yet their direct application to the legal domain remains challenging due to hallucinated legal citations, incomplete knowledge coverage, and weak structured reasoning. To address these issues, we propose PoliLegalLM, a domain-specific large language model tailored for political and legal applications. Our approach adopts a unified training framework that integrates continued pretraining, progressive supervised fine-tuning, and preference-based reinforcement learning to jointly enhance legal knowledge grounding, task alignment, and reasoning capability. We construct a large-scale, high-quality legal corpus and design a structured post-training pipeline, enabling the model to effectively learn domain-specific knowledge and adapt to diverse legal tasks. We evaluate PoliLegalLM on three representative benchmarks, including LawBench, LexEval, and a real-world dataset, PoliLegal. Experimental results demonstrate that PoliLegalLM achieves strong and consistent performance, outperforming competitive models of similar scale and remaining highly competitive with significantly larger models, while achieving the best results on real-world legal scenarios. These results highlight the effectiveness of our training paradigm and the practical value of domain-specific LLMs for real-world legal applications.
title PoliLegalLM: A Technical Report on a Large Language Model for Political and Legal Affairs
topic Computation and Language
url https://arxiv.org/abs/2604.17543