Saved in:
| Main Authors: | Barale, Claire, Barrett, Leslie, Bajaj, Vikram Sunil, Rovatsos, Michael |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.04041 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
When Fairness Isn't Statistical: The Limits of Machine Learning in Evaluating Legal Reasoning
by: Barale, Claire, et al.
Published: (2025)
by: Barale, Claire, et al.
Published: (2025)
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in English
by: Santosh, T. Y. S. S., et al.
Published: (2024)
by: Santosh, T. Y. S. S., et al.
Published: (2024)
ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks
by: Santosh, T. Y. S. S, et al.
Published: (2024)
by: Santosh, T. Y. S. S, et al.
Published: (2024)
LexRel: Benchmarking Legal Relation Extraction for Chinese Civil Cases
by: Cai, Yida, et al.
Published: (2025)
by: Cai, Yida, et al.
Published: (2025)
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
by: Li, Haitao, et al.
Published: (2024)
by: Li, Haitao, et al.
Published: (2024)
Engineering Conversational Search Systems: A Review of Applications, Architectures, and Functional Components
by: Schneider, Phillip, et al.
Published: (2024)
by: Schneider, Phillip, et al.
Published: (2024)
LexGenius: An Expert-Level Benchmark for Large Language Models in Legal General Intelligence
by: Liu, Wenjin, et al.
Published: (2025)
by: Liu, Wenjin, et al.
Published: (2025)
LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation
by: Li, Haitao, et al.
Published: (2025)
by: Li, Haitao, et al.
Published: (2025)
LexAbSumm: Aspect-based Summarization of Legal Decisions
by: Santosh, T. Y. S. S, et al.
Published: (2024)
by: Santosh, T. Y. S. S, et al.
Published: (2024)
LexChain: Modeling Legal Reasoning Chains for Chinese Tort Case Analysis
by: Xie, Huiyuan, et al.
Published: (2025)
by: Xie, Huiyuan, et al.
Published: (2025)
SpeciaLex: A Benchmark for In-Context Specialized Lexicon Learning
by: Imperial, Joseph Marvin, et al.
Published: (2024)
by: Imperial, Joseph Marvin, et al.
Published: (2024)
SynLexLM: Scaling Legal LLMs with Synthetic Data and Curriculum Learning
by: Upadhyay, Ojasw, et al.
Published: (2025)
by: Upadhyay, Ojasw, et al.
Published: (2025)
ProLex: A Benchmark for Language Proficiency-oriented Lexical Substitution
by: Zhang, Xuanming, et al.
Published: (2024)
by: Zhang, Xuanming, et al.
Published: (2024)
CoCoLex: Confidence-guided Copy-based Decoding for Grounded Legal Text Generation
by: S, Santosh T. Y. S., et al.
Published: (2025)
by: S, Santosh T. Y. S., et al.
Published: (2025)
LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence
by: Tummepalli, Anka Chandrahas, et al.
Published: (2026)
by: Tummepalli, Anka Chandrahas, et al.
Published: (2026)
MultiLexNorm++: A Unified Benchmark and a Generative Model for Lexical Normalization for Asian Languages
by: Buaphet, Weerayut, et al.
Published: (2026)
by: Buaphet, Weerayut, et al.
Published: (2026)
LexDrafter: Terminology Drafting for Legislative Documents using Retrieval Augmented Generation
by: Chouhan, Ashish, et al.
Published: (2024)
by: Chouhan, Ashish, et al.
Published: (2024)
LegalRikai: Open Benchmark -- Benchmark for Complex Japanese Corporate Legal Tasks
by: Fujita, Shogo, et al.
Published: (2025)
by: Fujita, Shogo, et al.
Published: (2025)
LexPro-1.0 Technical Report
by: Chen, Haotian, et al.
Published: (2025)
by: Chen, Haotian, et al.
Published: (2025)
LegalCore: A Dataset for Event Coreference Resolution in Legal Documents
by: Wei, Kangda, et al.
Published: (2025)
by: Wei, Kangda, et al.
Published: (2025)
BriefMe: A Legal NLP Benchmark for Assisting with Legal Briefs
by: Woo, Jesse, et al.
Published: (2025)
by: Woo, Jesse, et al.
Published: (2025)
Legal-DC: Benchmarking Retrieval-Augmented Generation for Legal Documents
by: Li, Yaocong, et al.
Published: (2026)
by: Li, Yaocong, et al.
Published: (2026)
LexGen: Domain-aware Multilingual Lexicon Generation
by: Maheshwari, Ayush, et al.
Published: (2024)
by: Maheshwari, Ayush, et al.
Published: (2024)
$\left|\,\circlearrowright\,\boxed{\text{BUS}}\,\right|$: A Large and Diverse Multimodal Benchmark for evaluating the ability of Vision-Language Models to understand Rebus Puzzles
by: Das, Trishanu, et al.
Published: (2025)
by: Das, Trishanu, et al.
Published: (2025)
Analyzing Temporal Complex Events with Large Language Models? A Benchmark towards Temporal, Long Context Understanding
by: Zhang, Zhihan, et al.
Published: (2024)
by: Zhang, Zhihan, et al.
Published: (2024)
MetaGraph: A Large-Scale Meta-Analysis of GenAI in Financial NLP (2022-2025)
by: Pedinotti, Paolo, et al.
Published: (2025)
by: Pedinotti, Paolo, et al.
Published: (2025)
A Reasoning-Focused Legal Retrieval Benchmark
by: Zheng, Lucia, et al.
Published: (2025)
by: Zheng, Lucia, et al.
Published: (2025)
Lex2Sent: A bagging approach to unsupervised sentiment analysis
by: Lange, Kai-Robin, et al.
Published: (2022)
by: Lange, Kai-Robin, et al.
Published: (2022)
PsychoLex: Unveiling the Psychological Mind of Large Language Models
by: Abbasi, Mohammad Amin, et al.
Published: (2024)
by: Abbasi, Mohammad Amin, et al.
Published: (2024)
Formally Verified Linear-Time Invertible Lexing
by: Chassot, Samuel, et al.
Published: (2025)
by: Chassot, Samuel, et al.
Published: (2025)
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
by: Fatemi, Bahare, et al.
Published: (2024)
by: Fatemi, Bahare, et al.
Published: (2024)
LegalBench.PT: A Benchmark for Portuguese Law
by: Canaverde, Beatriz, et al.
Published: (2025)
by: Canaverde, Beatriz, et al.
Published: (2025)
FastLexRank: Efficient Lexical Ranking for Structuring Social Media Posts
by: Li, Mao, et al.
Published: (2024)
by: Li, Mao, et al.
Published: (2024)
ViLexNorm: A Lexical Normalization Corpus for Vietnamese Social Media Text
by: Nguyen, Thanh-Nhi, et al.
Published: (2024)
by: Nguyen, Thanh-Nhi, et al.
Published: (2024)
SCTc-TE: A Comprehensive Formulation and Benchmark for Temporal Event Forecasting
by: Ma, Yunshan, et al.
Published: (2023)
by: Ma, Yunshan, et al.
Published: (2023)
LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text
by: yunhan, Li, et al.
Published: (2025)
by: yunhan, Li, et al.
Published: (2025)
Korean Canonical Legal Benchmark: Toward Knowledge-Independent Evaluation of LLMs' Legal Reasoning Capabilities
by: Oh, Hongseok, et al.
Published: (2025)
by: Oh, Hongseok, et al.
Published: (2025)
LexMatcher: Dictionary-centric Data Collection for LLM-based Machine Translation
by: Yin, Yongjing, et al.
Published: (2024)
by: Yin, Yongjing, et al.
Published: (2024)
Temporal Dependencies in In-Context Learning: The Role of Induction Heads
by: Bajaj, Anooshka, et al.
Published: (2026)
by: Bajaj, Anooshka, et al.
Published: (2026)
TriLex: A Framework for Multilingual Sentiment Analysis in Low-Resource South African Languages
by: Nkongolo, Mike, et al.
Published: (2025)
by: Nkongolo, Mike, et al.
Published: (2025)
Similar Items
-
When Fairness Isn't Statistical: The Limits of Machine Learning in Evaluating Legal Reasoning
by: Barale, Claire, et al.
Published: (2025) -
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in English
by: Santosh, T. Y. S. S., et al.
Published: (2024) -
ChronosLex: Time-aware Incremental Training for Temporal Generalization of Legal Classification Tasks
by: Santosh, T. Y. S. S, et al.
Published: (2024) -
LexRel: Benchmarking Legal Relation Extraction for Chinese Civil Cases
by: Cai, Yida, et al.
Published: (2025) -
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
by: Li, Haitao, et al.
Published: (2024)