:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Du, Bangde, Ye, Ziyi, Wu, Zhijing, Monika, Jankowska, Zhu, Shuqi, Ai, Qingyao, Zhou, Yujia, Liu, Yiqun
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2505.23827
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Understanding the Effect of Opinion Polarization in Short Video Browsing
by: Du, Bangde, et al.
Published: (2024)

How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study
by: Zhu, Shuqi, et al.
Published: (2026)

Parametric Social Identity Injection and Diversification in Public Opinion Simulation
by: Wang, Hexi, et al.
Published: (2026)

TwinVoice: A Multi-dimensional Benchmark Towards Digital Twins via LLM Persona Simulation
by: Du, Bangde, et al.
Published: (2025)

CrossPT-EEG: A Benchmark for Cross-Participant and Cross-Time Generalization of EEG-based Visual Decoding
by: Zhu, Shuqi, et al.
Published: (2024)

Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models
by: Su, Weihang, et al.
Published: (2024)

DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models
by: Su, Weihang, et al.
Published: (2024)

Beyond Scalar Reward Model: Learning Generative Judge from Preference Data
by: Ye, Ziyi, et al.
Published: (2024)

RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
by: Tu, Yiteng, et al.
Published: (2025)

Mitigating Entity-Level Hallucination in Large Language Models
by: Su, Weihang, et al.
Published: (2024)

Parametric Retrieval Augmented Generation
by: Su, Weihang, et al.
Published: (2025)

Comparing point‐wise and pair‐wise relevance judgment with brain signals
by: Shuqi Zhu, et al.
Published: (2024)

LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
by: Li, Haitao, et al.
Published: (2024)

ATACompressor: Adaptive Task-Aware Compression for Efficient Long-Context Processing in LLMs
by: Li, Xuancheng, et al.
Published: (2026)

Decoupling Reasoning and Knowledge Injection for In-Context Knowledge Editing
by: Wang, Changyue, et al.
Published: (2025)

Improve Large Language Model Systems with User Logs
by: Wang, Changyue, et al.
Published: (2026)

BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
by: Li, Haitao, et al.
Published: (2024)

BrainLLM: Generative Language Decoding from Brain Recordings
by: Ye, Ziyi, et al.
Published: (2023)

Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation
by: Su, Weihang, et al.
Published: (2026)

Option-ID Based Elimination For Multiple Choice Questions
by: Zhu, Zhenhao, et al.
Published: (2025)

Augmenting Multi-Agent Communication with State Delta Trajectory
by: Tang, Yichen, et al.
Published: (2025)

CalibraEval: Calibrating Prediction Distribution to Mitigate Selection Bias in LLMs-as-Judges
by: Li, Haitao, et al.
Published: (2024)

Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models
by: Wang, Changyue, et al.
Published: (2025)

SurGE: A Benchmark and Evaluation Framework for Scientific Survey Generation
by: Su, Weihang, et al.
Published: (2025)

Towards Unification of Hallucination Detection and Fact Verification for Large Language Models
by: Su, Weihang, et al.
Published: (2025)

Enhancing Judgment Document Generation via Agentic Legal Information Collection and Rubric-Guided Optimization
by: Su, Weihang, et al.
Published: (2026)

TEC: A Collection of Human Trial-and-error Trajectories for Problem Solving
by: Zhang, Xinkai, et al.
Published: (2026)

Dynamic and Parametric Retrieval-Augmented Generation
by: Su, Weihang, et al.
Published: (2025)

Multi-Field Tool Retrieval
by: Tang, Yichen, et al.
Published: (2026)

Query Augmentation by Decoding Semantics from Brain Signals
by: Ye, Ziyi, et al.
Published: (2024)

LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
by: Li, Haitao, et al.
Published: (2024)

PRE: A Peer Review Based Large Language Model Evaluator
by: Chu, Zhumin, et al.
Published: (2024)

Relative-Based Scaling Law for Neural Language Models
by: Yue, Baoqing, et al.
Published: (2025)

Training-free Truthfulness Detection via Value Vectors in LLMs
by: Liu, Runheng, et al.
Published: (2025)

Words Like Knives: Backstory-Personalized Modeling and Detection of Violent Communication
by: Shen, Jocelyn, et al.
Published: (2025)

Capability-aware Prompt Reformulation Learning for Text-to-Image Generation
by: Zhan, Jingtao, et al.
Published: (2024)

Beyond Experience Retrieval: Learning to Generate Utility-Optimized Structured Experience for Frozen LLMs
by: Li, Xuancheng, et al.
Published: (2026)

Overview of the NTCIR-18 Automatic Evaluation of LLMs (AEOLLM) Task
by: Chen, Junjie, et al.
Published: (2025)

Knowledge Editing through Chain-of-Thought
by: Wang, Changyue, et al.
Published: (2024)

Virtual Personas for Language Models via an Anthology of Backstories
by: Moon, Suhong, et al.
Published: (2024)