:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Haitao, Chen, Junjie, Ai, Qingyao, Chu, Zhumin, Zhou, Yujia, Dong, Qian, Liu, Yiqun
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2410.15393
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Overview of the NTCIR-18 Automatic Evaluation of LLMs (AEOLLM) Task
by: Chen, Junjie, et al.
Published: (2025)

LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
by: Li, Haitao, et al.
Published: (2024)

PRE: A Peer Review Based Large Language Model Evaluator
by: Chu, Zhumin, et al.
Published: (2024)

ATACompressor: Adaptive Task-Aware Compression for Efficient Long-Context Processing in LLMs
by: Li, Xuancheng, et al.
Published: (2026)

Auto-PRE: An Automatic and Cost-Efficient Peer-Review Framework for Language Generation Evaluation
by: Chen, Junjie, et al.
Published: (2024)

Benchmarking LLM-as-a-Judge for Long-Form Output Evaluation
by: Chen, Junjie, et al.
Published: (2026)

LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
by: Li, Haitao, et al.
Published: (2024)

Beyond Scalar Reward Model: Learning Generative Judge from Preference Data
by: Ye, Ziyi, et al.
Published: (2024)

RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
by: Tu, Yiteng, et al.
Published: (2025)

Decoupling Reasoning and Knowledge Injection for In-Context Knowledge Editing
by: Wang, Changyue, et al.
Published: (2025)

Beyond Experience Retrieval: Learning to Generate Utility-Optimized Structured Experience for Frozen LLMs
by: Li, Xuancheng, et al.
Published: (2026)

Parametric Social Identity Injection and Diversification in Public Opinion Simulation
by: Wang, Hexi, et al.
Published: (2026)

BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
by: Li, Haitao, et al.
Published: (2024)

Evaluation Ethics of LLMs in Legal Domain
by: Zhang, Ruizhe, et al.
Published: (2024)

DELTA: Pre-train a Discriminative Encoder for Legal Case Retrieval via Structural Word Alignment
by: Li, Haitao, et al.
Published: (2024)

Dynamic and Parametric Retrieval-Augmented Generation
by: Su, Weihang, et al.
Published: (2025)

MulFeRL: Enhancing Reinforcement Learning with Verbal Feedback in a Multi-turn Loop
by: Li, Xuancheng, et al.
Published: (2026)

Augmenting Multi-Agent Communication with State Delta Trajectory
by: Tang, Yichen, et al.
Published: (2025)

LegalOne: A Family of Foundation Models for Reliable Legal Reasoning
by: Li, Haitao, et al.
Published: (2026)

How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study
by: Zhu, Shuqi, et al.
Published: (2026)

Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models
by: Su, Weihang, et al.
Published: (2024)

Mitigating Entity-Level Hallucination in Large Language Models
by: Su, Weihang, et al.
Published: (2024)

Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models
by: Wang, Changyue, et al.
Published: (2025)

TEC: A Collection of Human Trial-and-error Trajectories for Problem Solving
by: Zhang, Xinkai, et al.
Published: (2026)

Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation
by: Su, Weihang, et al.
Published: (2026)

CaseGen: A Benchmark for Multi-Stage Legal Case Documents Generation
by: Li, Haitao, et al.
Published: (2025)

ValueSim: Generating Backstories to Model Individual Value Systems
by: Du, Bangde, et al.
Published: (2025)

LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation Conversation
by: Li, Haitao, et al.
Published: (2025)

Enhancing LLM-Based Agents via Global Planning and Hierarchical Execution
by: Chen, Junjie, et al.
Published: (2025)

Improve Large Language Model Systems with User Logs
by: Wang, Changyue, et al.
Published: (2026)

Multi-Field Tool Retrieval
by: Tang, Yichen, et al.
Published: (2026)

Option-ID Based Elimination For Multiple Choice Questions
by: Zhu, Zhenhao, et al.
Published: (2025)

Enhancing Judgment Document Generation via Agentic Legal Information Collection and Rubric-Guided Optimization
by: Su, Weihang, et al.
Published: (2026)

Knowledge Editing through Chain-of-Thought
by: Wang, Changyue, et al.
Published: (2024)

Parametric Retrieval Augmented Generation
by: Su, Weihang, et al.
Published: (2025)

DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models
by: Su, Weihang, et al.
Published: (2024)

Capability-aware Prompt Reformulation Learning for Text-to-Image Generation
by: Zhan, Jingtao, et al.
Published: (2024)

Qilin: A Multimodal Information Retrieval Dataset with APP-level User Sessions
by: Chen, Jia, et al.
Published: (2025)

What Scales in Cross-Entropy Scaling Law?
by: Yan, Junxi, et al.
Published: (2025)

Analytical Search
by: Tu, Yiteng, et al.
Published: (2026)