Saved in:
| Main Authors: | Du, Bangde, Ye, Ziyi, Wu, Zhijing, Monika, Jankowska, Zhu, Shuqi, Ai, Qingyao, Zhou, Yujia, Liu, Yiqun |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.23827 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Understanding the Effect of Opinion Polarization in Short Video Browsing
by: Du, Bangde, et al.
Published: (2024)
by: Du, Bangde, et al.
Published: (2024)
How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study
by: Zhu, Shuqi, et al.
Published: (2026)
by: Zhu, Shuqi, et al.
Published: (2026)
Parametric Social Identity Injection and Diversification in Public Opinion Simulation
by: Wang, Hexi, et al.
Published: (2026)
by: Wang, Hexi, et al.
Published: (2026)
TwinVoice: A Multi-dimensional Benchmark Towards Digital Twins via LLM Persona Simulation
by: Du, Bangde, et al.
Published: (2025)
by: Du, Bangde, et al.
Published: (2025)
CrossPT-EEG: A Benchmark for Cross-Participant and Cross-Time Generalization of EEG-based Visual Decoding
by: Zhu, Shuqi, et al.
Published: (2024)
by: Zhu, Shuqi, et al.
Published: (2024)
Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models
by: Su, Weihang, et al.
Published: (2024)
by: Su, Weihang, et al.
Published: (2024)
DRAGIN: Dynamic Retrieval Augmented Generation based on the Information Needs of Large Language Models
by: Su, Weihang, et al.
Published: (2024)
by: Su, Weihang, et al.
Published: (2024)
Beyond Scalar Reward Model: Learning Generative Judge from Preference Data
by: Ye, Ziyi, et al.
Published: (2024)
by: Ye, Ziyi, et al.
Published: (2024)
RbFT: Robust Fine-tuning for Retrieval-Augmented Generation against Retrieval Defects
by: Tu, Yiteng, et al.
Published: (2025)
by: Tu, Yiteng, et al.
Published: (2025)
Mitigating Entity-Level Hallucination in Large Language Models
by: Su, Weihang, et al.
Published: (2024)
by: Su, Weihang, et al.
Published: (2024)
Parametric Retrieval Augmented Generation
by: Su, Weihang, et al.
Published: (2025)
by: Su, Weihang, et al.
Published: (2025)
Comparing point‐wise and pair‐wise relevance judgment with brain signals
by: Shuqi Zhu, et al.
Published: (2024)
by: Shuqi Zhu, et al.
Published: (2024)
LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation Methods
by: Li, Haitao, et al.
Published: (2024)
by: Li, Haitao, et al.
Published: (2024)
ATACompressor: Adaptive Task-Aware Compression for Efficient Long-Context Processing in LLMs
by: Li, Xuancheng, et al.
Published: (2026)
by: Li, Xuancheng, et al.
Published: (2026)
Decoupling Reasoning and Knowledge Injection for In-Context Knowledge Editing
by: Wang, Changyue, et al.
Published: (2025)
by: Wang, Changyue, et al.
Published: (2025)
Improve Large Language Model Systems with User Logs
by: Wang, Changyue, et al.
Published: (2026)
by: Wang, Changyue, et al.
Published: (2026)
BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
by: Li, Haitao, et al.
Published: (2024)
by: Li, Haitao, et al.
Published: (2024)
BrainLLM: Generative Language Decoding from Brain Recordings
by: Ye, Ziyi, et al.
Published: (2023)
by: Ye, Ziyi, et al.
Published: (2023)
Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation
by: Su, Weihang, et al.
Published: (2026)
by: Su, Weihang, et al.
Published: (2026)
Option-ID Based Elimination For Multiple Choice Questions
by: Zhu, Zhenhao, et al.
Published: (2025)
by: Zhu, Zhenhao, et al.
Published: (2025)
Augmenting Multi-Agent Communication with State Delta Trajectory
by: Tang, Yichen, et al.
Published: (2025)
by: Tang, Yichen, et al.
Published: (2025)
CalibraEval: Calibrating Prediction Distribution to Mitigate Selection Bias in LLMs-as-Judges
by: Li, Haitao, et al.
Published: (2024)
by: Li, Haitao, et al.
Published: (2024)
Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models
by: Wang, Changyue, et al.
Published: (2025)
by: Wang, Changyue, et al.
Published: (2025)
SurGE: A Benchmark and Evaluation Framework for Scientific Survey Generation
by: Su, Weihang, et al.
Published: (2025)
by: Su, Weihang, et al.
Published: (2025)
Towards Unification of Hallucination Detection and Fact Verification for Large Language Models
by: Su, Weihang, et al.
Published: (2025)
by: Su, Weihang, et al.
Published: (2025)
Enhancing Judgment Document Generation via Agentic Legal Information Collection and Rubric-Guided Optimization
by: Su, Weihang, et al.
Published: (2026)
by: Su, Weihang, et al.
Published: (2026)
TEC: A Collection of Human Trial-and-error Trajectories for Problem Solving
by: Zhang, Xinkai, et al.
Published: (2026)
by: Zhang, Xinkai, et al.
Published: (2026)
Dynamic and Parametric Retrieval-Augmented Generation
by: Su, Weihang, et al.
Published: (2025)
by: Su, Weihang, et al.
Published: (2025)
Multi-Field Tool Retrieval
by: Tang, Yichen, et al.
Published: (2026)
by: Tang, Yichen, et al.
Published: (2026)
Query Augmentation by Decoding Semantics from Brain Signals
by: Ye, Ziyi, et al.
Published: (2024)
by: Ye, Ziyi, et al.
Published: (2024)
LexEval: A Comprehensive Chinese Legal Benchmark for Evaluating Large Language Models
by: Li, Haitao, et al.
Published: (2024)
by: Li, Haitao, et al.
Published: (2024)
PRE: A Peer Review Based Large Language Model Evaluator
by: Chu, Zhumin, et al.
Published: (2024)
by: Chu, Zhumin, et al.
Published: (2024)
Relative-Based Scaling Law for Neural Language Models
by: Yue, Baoqing, et al.
Published: (2025)
by: Yue, Baoqing, et al.
Published: (2025)
Training-free Truthfulness Detection via Value Vectors in LLMs
by: Liu, Runheng, et al.
Published: (2025)
by: Liu, Runheng, et al.
Published: (2025)
Words Like Knives: Backstory-Personalized Modeling and Detection of Violent Communication
by: Shen, Jocelyn, et al.
Published: (2025)
by: Shen, Jocelyn, et al.
Published: (2025)
Capability-aware Prompt Reformulation Learning for Text-to-Image Generation
by: Zhan, Jingtao, et al.
Published: (2024)
by: Zhan, Jingtao, et al.
Published: (2024)
Beyond Experience Retrieval: Learning to Generate Utility-Optimized Structured Experience for Frozen LLMs
by: Li, Xuancheng, et al.
Published: (2026)
by: Li, Xuancheng, et al.
Published: (2026)
Overview of the NTCIR-18 Automatic Evaluation of LLMs (AEOLLM) Task
by: Chen, Junjie, et al.
Published: (2025)
by: Chen, Junjie, et al.
Published: (2025)
Knowledge Editing through Chain-of-Thought
by: Wang, Changyue, et al.
Published: (2024)
by: Wang, Changyue, et al.
Published: (2024)
Virtual Personas for Language Models via an Anthology of Backstories
by: Moon, Suhong, et al.
Published: (2024)
by: Moon, Suhong, et al.
Published: (2024)
Similar Items
-
Understanding the Effect of Opinion Polarization in Short Video Browsing
by: Du, Bangde, et al.
Published: (2024) -
How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study
by: Zhu, Shuqi, et al.
Published: (2026) -
Parametric Social Identity Injection and Diversification in Public Opinion Simulation
by: Wang, Hexi, et al.
Published: (2026) -
TwinVoice: A Multi-dimensional Benchmark Towards Digital Twins via LLM Persona Simulation
by: Du, Bangde, et al.
Published: (2025) -
CrossPT-EEG: A Benchmark for Cross-Participant and Cross-Time Generalization of EEG-based Visual Decoding
by: Zhu, Shuqi, et al.
Published: (2024)