Saved in:
| Main Authors: | Dou, Yao, Xu, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.04424 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Evaluating LLMs on Chinese Idiom Translation
by: Yang, Cai, et al.
Published: (2025)
by: Yang, Cai, et al.
Published: (2025)
Are Long-LLMs A Necessity For Long-Context Tasks?
by: Qian, Hongjin, et al.
Published: (2024)
by: Qian, Hongjin, et al.
Published: (2024)
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
by: Doddapaneni, Sumanth, et al.
Published: (2024)
by: Doddapaneni, Sumanth, et al.
Published: (2024)
Policies and Evaluation for Online Meeting Summarization
by: Schneider, Felix, et al.
Published: (2025)
by: Schneider, Felix, et al.
Published: (2025)
Enabling Discriminative Reasoning in LLMs for Legal Judgment Prediction
by: Deng, Chenlong, et al.
Published: (2024)
by: Deng, Chenlong, et al.
Published: (2024)
LegalAgentBench: Evaluating LLM Agents in Legal Domain
by: Li, Haitao, et al.
Published: (2024)
by: Li, Haitao, et al.
Published: (2024)
Context-Aware Hierarchical Merging for Long Document Summarization
by: Ou, Litu, et al.
Published: (2025)
by: Ou, Litu, et al.
Published: (2025)
Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG
by: Jin, Bowen, et al.
Published: (2024)
by: Jin, Bowen, et al.
Published: (2024)
Agent-as-Judge for Factual Summarization of Long Narratives
by: Jeong, Yeonseok, et al.
Published: (2025)
by: Jeong, Yeonseok, et al.
Published: (2025)
Evaluating LLMs' Assessment of Mixed-Context Hallucination Through the Lens of Summarization
by: Qi, Siya, et al.
Published: (2025)
by: Qi, Siya, et al.
Published: (2025)
CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese Novels
by: Wei, Lingxiao, et al.
Published: (2024)
by: Wei, Lingxiao, et al.
Published: (2024)
Action-Item-Driven Summarization of Long Meeting Transcripts
by: Golia, Logan, et al.
Published: (2023)
by: Golia, Logan, et al.
Published: (2023)
Long Context vs. RAG for LLMs: An Evaluation and Revisits
by: Li, Xinze, et al.
Published: (2024)
by: Li, Xinze, et al.
Published: (2024)
PersonaMatrix: A Recipe for Persona-Aware Evaluation of Legal Summarization
by: Pang, Tsz Fung, et al.
Published: (2025)
by: Pang, Tsz Fung, et al.
Published: (2025)
Multi-Dimensional Evaluation of Text Summarization with In-Context Learning
by: Jain, Sameer, et al.
Published: (2023)
by: Jain, Sameer, et al.
Published: (2023)
Long-Context Long-Form Question Answering for Legal Domain
by: Kulkarni, Anagha, et al.
Published: (2026)
by: Kulkarni, Anagha, et al.
Published: (2026)
CTkvr: KV Cache Retrieval for Long-Context LLMs via Centroid then Token Indexing
by: Lu, Kuan, et al.
Published: (2025)
by: Lu, Kuan, et al.
Published: (2025)
LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs
by: Wu, Yuhao, et al.
Published: (2024)
by: Wu, Yuhao, et al.
Published: (2024)
Systematic Evaluation of Long-Context LLMs on Financial Concepts
by: Gupta, Lavanya, et al.
Published: (2024)
by: Gupta, Lavanya, et al.
Published: (2024)
Unstructured Evidence Attribution for Long Context Query Focused Summarization
by: Wright, Dustin, et al.
Published: (2025)
by: Wright, Dustin, et al.
Published: (2025)
Reasoning or Not? A Comprehensive Evaluation of Reasoning LLMs for Dialogue Summarization
by: Jin, Keyan, et al.
Published: (2025)
by: Jin, Keyan, et al.
Published: (2025)
Joint Enhancement of Relational Reasoning for Long-Context LLMs
by: Chen, Zhirui, et al.
Published: (2025)
by: Chen, Zhirui, et al.
Published: (2025)
Long$^2$RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall
by: Qi, Zehan, et al.
Published: (2024)
by: Qi, Zehan, et al.
Published: (2024)
Automatic Legal Writing Evaluation of LLMs
by: Pires, Ramon, et al.
Published: (2025)
by: Pires, Ramon, et al.
Published: (2025)
Evaluation Ethics of LLMs in Legal Domain
by: Zhang, Ruizhe, et al.
Published: (2024)
by: Zhang, Ruizhe, et al.
Published: (2024)
ReSum: Unlocking Long-Horizon Search Intelligence via Context Summarization
by: Wu, Xixi, et al.
Published: (2025)
by: Wu, Xixi, et al.
Published: (2025)
CREAM: Comparison-Based Reference-Free ELO-Ranked Automatic Evaluation for Meeting Summarization
by: Gong, Ziwei, et al.
Published: (2024)
by: Gong, Ziwei, et al.
Published: (2024)
CCSBench: Evaluating Compositional Controllability in LLMs for Scientific Document Summarization
by: Ding, Yixi, et al.
Published: (2024)
by: Ding, Yixi, et al.
Published: (2024)
RocketEval: Efficient Automated LLM Evaluation via Grading Checklist
by: Wei, Tianjun, et al.
Published: (2025)
by: Wei, Tianjun, et al.
Published: (2025)
A Comprehensive Survey on Legal Summarization: Challenges and Future Directions
by: Akter, Mousumi, et al.
Published: (2025)
by: Akter, Mousumi, et al.
Published: (2025)
RELexED: Retrieval-Enhanced Legal Summarization with Exemplar Diversity
by: Santosh, T. Y. S. S., et al.
Published: (2025)
by: Santosh, T. Y. S. S., et al.
Published: (2025)
LexAbSumm: Aspect-based Summarization of Legal Decisions
by: Santosh, T. Y. S. S, et al.
Published: (2024)
by: Santosh, T. Y. S. S, et al.
Published: (2024)
Retrieval Meets Reasoning: Dynamic In-Context Editing for Long-Text Understanding
by: Fei, Weizhi, et al.
Published: (2024)
by: Fei, Weizhi, et al.
Published: (2024)
StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs
by: Yuan, Haohan, et al.
Published: (2025)
by: Yuan, Haohan, et al.
Published: (2025)
TO-GATE: Clarifying Questions and Summarizing Responses with Trajectory Optimization for Eliciting Human Preference
by: Dou, Yulin, et al.
Published: (2025)
by: Dou, Yulin, et al.
Published: (2025)
Korean Canonical Legal Benchmark: Toward Knowledge-Independent Evaluation of LLMs' Legal Reasoning Capabilities
by: Oh, Hongseok, et al.
Published: (2025)
by: Oh, Hongseok, et al.
Published: (2025)
LongFaith: Enhancing Long-Context Reasoning in LLMs with Faithful Synthetic Data
by: Yang, Cehao, et al.
Published: (2025)
by: Yang, Cehao, et al.
Published: (2025)
Understanding the RoPE Extensions of Long-Context LLMs: An Attention Perspective
by: Zhong, Meizhi, et al.
Published: (2024)
by: Zhong, Meizhi, et al.
Published: (2024)
Improving Minimum Bayes Risk Decoding with Multi-Prompt
by: Heineman, David, et al.
Published: (2024)
by: Heineman, David, et al.
Published: (2024)
Leveraging Discourse Structure for Extractive Meeting Summarization
by: Rennard, Virgile, et al.
Published: (2024)
by: Rennard, Virgile, et al.
Published: (2024)
Similar Items
-
Evaluating LLMs on Chinese Idiom Translation
by: Yang, Cai, et al.
Published: (2025) -
Are Long-LLMs A Necessity For Long-Context Tasks?
by: Qian, Hongjin, et al.
Published: (2024) -
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
by: Doddapaneni, Sumanth, et al.
Published: (2024) -
Policies and Evaluation for Online Meeting Summarization
by: Schneider, Felix, et al.
Published: (2025) -
Enabling Discriminative Reasoning in LLMs for Legal Judgment Prediction
by: Deng, Chenlong, et al.
Published: (2024)