Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Niklaus, Joel, Zheng, Lucia, McCarthy, Arya D., Hahn, Christopher, Rosen, Brian M., Henderson, Peter, Ho, Daniel E., Honke, Garrett, Liang, Percy, Manning, Christopher
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence Machine Learning 68T50 I.2
Online Access:	https://arxiv.org/abs/2404.02127
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912200242757632
author	Niklaus, Joel Zheng, Lucia McCarthy, Arya D. Hahn, Christopher Rosen, Brian M. Henderson, Peter Ho, Daniel E. Honke, Garrett Liang, Percy Manning, Christopher
author_facet	Niklaus, Joel Zheng, Lucia McCarthy, Arya D. Hahn, Christopher Rosen, Brian M. Henderson, Peter Ho, Daniel E. Honke, Garrett Liang, Percy Manning, Christopher
contents	Instruction tuning is an important step in making language models useful for direct user interaction. However, the legal domain is underrepresented in typical instruction datasets (e.g., only 10 out of 1600+ tasks in Super-NaturalInstructions). To study whether instruction tuning on legal datasets is necessary for strong legal reasoning, we aggregate 58 annotated legal datasets and write instructions for each, creating LawInstruct. LawInstruct covers 17 global jurisdictions, 24 languages and a total of 12M examples across diverse tasks such as legal QA, summarization of court cases, and legal argument mining. We evaluate our models on LegalBench, measuring legal reasoning across five categories in 162 challenging and realistic legal tasks, and MMLU, to measure potential drops in general reasoning capabilities. We find that legal-specific instruction tuning on Flan-T5 - yielding FLawN-T5 - improves performance on LegalBench across all model sizes, with an aggregate increase of 15 points or 50% over Flan-T5 for the base size. No model size shows performance drops in MMLU. We publish LawInstruct as a resource for further study of instruction tuning in the legal domain.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_02127
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain Niklaus, Joel Zheng, Lucia McCarthy, Arya D. Hahn, Christopher Rosen, Brian M. Henderson, Peter Ho, Daniel E. Honke, Garrett Liang, Percy Manning, Christopher Computation and Language Artificial Intelligence Machine Learning 68T50 I.2 Instruction tuning is an important step in making language models useful for direct user interaction. However, the legal domain is underrepresented in typical instruction datasets (e.g., only 10 out of 1600+ tasks in Super-NaturalInstructions). To study whether instruction tuning on legal datasets is necessary for strong legal reasoning, we aggregate 58 annotated legal datasets and write instructions for each, creating LawInstruct. LawInstruct covers 17 global jurisdictions, 24 languages and a total of 12M examples across diverse tasks such as legal QA, summarization of court cases, and legal argument mining. We evaluate our models on LegalBench, measuring legal reasoning across five categories in 162 challenging and realistic legal tasks, and MMLU, to measure potential drops in general reasoning capabilities. We find that legal-specific instruction tuning on Flan-T5 - yielding FLawN-T5 - improves performance on LegalBench across all model sizes, with an aggregate increase of 15 points or 50% over Flan-T5 for the base size. No model size shows performance drops in MMLU. We publish LawInstruct as a resource for further study of instruction tuning in the legal domain.
title	LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain
topic	Computation and Language Artificial Intelligence Machine Learning 68T50 I.2
url	https://arxiv.org/abs/2404.02127

Similar Items