:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Ningke, Song, Yahui, Wang, Kailong, Li, Yuekang, Shi, Ling, Liu, Yi, Wang, Haoyu
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2502.13416
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Drowzee: Metamorphic Testing for Fact-Conflicting Hallucination Detection in Large Language Models
by: Li, Ningke, et al.
Published: (2024)

Beyond Correctness: Exposing LLM-generated Logical Flaws in Reasoning via Multi-step Automated Theorem Proving
by: Zheng, Xinyi, et al.
Published: (2025)

Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection
by: Li, Yuxi, et al.
Published: (2024)

Continuous Embedding Attacks via Clipped Inputs in Jailbreaking Large Language Models
by: Xu, Zihao, et al.
Published: (2024)

Digger: Detecting Copyright Content Mis-usage in Large Language Model Training
by: Li, Haodong, et al.
Published: (2024)

Prompt Injection attack against LLM-integrated Applications
by: Liu, Yi, et al.
Published: (2023)

Circumventing Safety Alignment in Large Language Models Through Embedding Space Toxicity Attenuation
by: Zhang, Zhibo, et al.
Published: (2025)

OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models
by: Sun, Chongren, et al.
Published: (2025)

GlitchProber: Advancing Effective Detection and Mitigation of Glitch Tokens in Large Language Models
by: Zhang, Zhibo, et al.
Published: (2024)

STEAMROLLER: A Multi-Agent System for Inclusive Automatic Speech Recognition for People who Stutter
by: Xu, Ziqi, et al.
Published: (2026)

FactCHD: Benchmarking Fact-Conflicting Hallucination Detection
by: Chen, Xiang, et al.
Published: (2023)

Large Language Models are overconfident and amplify human bias
by: Sun, Fengfei, et al.
Published: (2025)

Detection and Mitigation of Hallucination in Large Reasoning Models: A Mechanistic Perspective
by: Sun, Zhongxiang, et al.
Published: (2025)

ChronoFact: Timeline-based Temporal Fact Verification
by: Barik, Anab Maulana, et al.
Published: (2024)

Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection
by: Zhang, Chaowei, et al.
Published: (2025)

Jailbreaking ChatGPT via Prompt Engineering: An Empirical Study
by: Liu, Yi, et al.
Published: (2023)

Towards Unification of Hallucination Detection and Fact Verification for Large Language Models
by: Su, Weihang, et al.
Published: (2025)

FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs
by: Alnuhait, Deema, et al.
Published: (2024)

Uncovering Logit Suppression Vulnerabilities in LLM Safety Alignment
by: Li, Yuxi, et al.
Published: (2024)

LLM Hallucination Detection: HSAD
by: Li, JinXin, et al.
Published: (2025)

MiniScope: Automated UI Exploration and Privacy Inconsistency Detection of MiniApps via Two-phase Iterative Hybrid Analysis
by: Wang, Shenao, et al.
Published: (2024)

LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals
by: Li, Jinxin, et al.
Published: (2025)

Steer LLM Latents for Hallucination Detection
by: Park, Seongheon, et al.
Published: (2025)

Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models
by: Wang, Changyue, et al.
Published: (2025)

Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
by: Zhu, Fangwei, et al.
Published: (2024)

MeTMaP: Metamorphic Testing for Detecting False Vector Matching Problems in LLM Augmented Generation
by: Wang, Guanyu, et al.
Published: (2024)

FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs
by: Sawczyn, Albert, et al.
Published: (2025)

LLM-Guided Knowledge Distillation for Temporal Knowledge Graph Reasoning
by: Xing, Wang, et al.
Published: (2026)

Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking
by: Chung, Yi-Ling, et al.
Published: (2025)

Socrates or Smartypants: Testing Logic Reasoning Capabilities of Large Language Models with Logic Programming-based Test Oracles
by: Xu, Zihao, et al.
Published: (2025)

How to Detect and Defeat Molecular Mirage: A Metric-Driven Benchmark for Hallucination in LLM-based Molecular Comprehension
by: Li, Hao, et al.
Published: (2025)

Mitigating Hallucinations in Large Vision-Language Models with Internal Fact-based Contrastive Decoding
by: Wang, Chao, et al.
Published: (2025)

VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts
by: Liu, Xin, et al.
Published: (2025)

Enhancing Temporal Sensitivity and Reasoning for Time-Sensitive Question Answering
by: Yang, Wanqi, et al.
Published: (2024)

Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts
by: Gupta, Raavi, et al.
Published: (2025)

AgentHallu: Benchmarking Automated Hallucination Attribution of LLM-based Agents
by: Liu, Xuannan, et al.
Published: (2026)

HARP: Hallucination Detection via Reasoning Subspace Projection
by: Hu, Junjie, et al.
Published: (2025)

Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability
by: Wang, Ruida, et al.
Published: (2025)

IndexRAG: Bridging Facts for Cross-Document Reasoning at Index Time
by: Bao, Zhenghua, et al.
Published: (2026)

From Hallucinations to Facts: Enhancing Language Models with Curated Knowledge Graphs
by: Joshi, Ratnesh Kumar, et al.
Published: (2024)