:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Dou, Yao, Xu, Wei
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2601.04424
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Evaluating LLMs on Chinese Idiom Translation
by: Yang, Cai, et al.
Published: (2025)

Are Long-LLMs A Necessity For Long-Context Tasks?
by: Qian, Hongjin, et al.
Published: (2024)

Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
by: Doddapaneni, Sumanth, et al.
Published: (2024)

Policies and Evaluation for Online Meeting Summarization
by: Schneider, Felix, et al.
Published: (2025)

Enabling Discriminative Reasoning in LLMs for Legal Judgment Prediction
by: Deng, Chenlong, et al.
Published: (2024)

LegalAgentBench: Evaluating LLM Agents in Legal Domain
by: Li, Haitao, et al.
Published: (2024)

Context-Aware Hierarchical Merging for Long Document Summarization
by: Ou, Litu, et al.
Published: (2025)

Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG
by: Jin, Bowen, et al.
Published: (2024)

Agent-as-Judge for Factual Summarization of Long Narratives
by: Jeong, Yeonseok, et al.
Published: (2025)

Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization
by: Qi, Siya, et al.
Published: (2025)

CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese Novels
by: Wei, Lingxiao, et al.
Published: (2024)

Action-Item-Driven Summarization of Long Meeting Transcripts
by: Golia, Logan, et al.
Published: (2023)

Long Context vs. RAG for LLMs: An Evaluation and Revisits
by: Li, Xinze, et al.
Published: (2024)

PersonaMatrix: A Recipe for Persona-Aware Evaluation of Legal Summarization
by: Pang, Tsz Fung, et al.
Published: (2025)

Multi-Dimensional Evaluation of Text Summarization with In-Context Learning
by: Jain, Sameer, et al.
Published: (2023)

Long-Context Long-Form Question Answering for Legal Domain
by: Kulkarni, Anagha, et al.
Published: (2026)

CTkvr: KV Cache Retrieval for Long-Context LLMs via Centroid then Token Indexing
by: Lu, Kuan, et al.
Published: (2025)

LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs
by: Wu, Yuhao, et al.
Published: (2024)

Systematic Evaluation of Long-Context LLMs on Financial Concepts
by: Gupta, Lavanya, et al.
Published: (2024)

Unstructured Evidence Attribution for Long Context Query Focused Summarization
by: Wright, Dustin, et al.
Published: (2025)

Reasoning or Not? A Comprehensive Evaluation of Reasoning LLMs for Dialogue Summarization
by: Jin, Keyan, et al.
Published: (2025)

Joint Enhancement of Relational Reasoning for Long-Context LLMs
by: Chen, Zhirui, et al.
Published: (2025)

Long$^2$RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall
by: Qi, Zehan, et al.
Published: (2024)

Automatic Legal Writing Evaluation of LLMs
by: Pires, Ramon, et al.
Published: (2025)

Evaluation Ethics of LLMs in Legal Domain
by: Zhang, Ruizhe, et al.
Published: (2024)

ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
by: Wu, Xixi, et al.
Published: (2025)

CREAM: Comparison-Based Reference-Free ELO-Ranked Automatic Evaluation for Meeting Summarization
by: Gong, Ziwei, et al.
Published: (2024)

CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization
by: Ding, Yixi, et al.
Published: (2024)

RocketEval: Efficient Automated LLM Evaluation via Grading Checklist
by: Wei, Tianjun, et al.
Published: (2025)

A Comprehensive Survey on Legal Summarization: Challenges and Future Directions
by: Akter, Mousumi, et al.
Published: (2025)

RELexED: Retrieval-Enhanced Legal Summarization with Exemplar Diversity
by: Santosh, T. Y. S. S., et al.
Published: (2025)

LexAbSumm: Aspect-based Summarization of Legal Decisions
by: Santosh, T. Y. S. S, et al.
Published: (2024)

Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding
by: Fei, Weizhi, et al.
Published: (2024)

StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs
by: Yuan, Haohan, et al.
Published: (2025)

TO-GATE: Clarifying Questions and Summarizing Responses with Trajectory Optimization for Eliciting Human Preference
by: Dou, Yulin, et al.
Published: (2025)

Korean Canonical Legal Benchmark: Toward Knowledge-Independent Evaluation of LLMs' Legal Reasoning Capabilities
by: Oh, Hongseok, et al.
Published: (2025)

LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data
by: Yang, Cehao, et al.
Published: (2025)

Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
by: Zhong, Meizhi, et al.
Published: (2024)

Improving Minimum Bayes Risk Decoding with Multi-Prompt
by: Heineman, David, et al.
Published: (2024)

Leveraging Discourse Structure for Extractive Meeting Summarization
by: Rennard, Virgile, et al.
Published: (2024)