Saved in:
| Main Authors: | Li, Ningke, Song, Yahui, Wang, Kailong, Li, Yuekang, Shi, Ling, Liu, Yi, Wang, Haoyu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.13416 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Drowzee: Metamorphic Testing for Fact-Conflicting Hallucination Detection in Large Language Models
by: Li, Ningke, et al.
Published: (2024)
by: Li, Ningke, et al.
Published: (2024)
Beyond Correctness: Exposing LLM-generated Logical Flaws in Reasoning via Multi-step Automated Theorem Proving
by: Zheng, Xinyi, et al.
Published: (2025)
by: Zheng, Xinyi, et al.
Published: (2025)
Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection
by: Li, Yuxi, et al.
Published: (2024)
by: Li, Yuxi, et al.
Published: (2024)
Continuous Embedding Attacks via Clipped Inputs in Jailbreaking Large Language Models
by: Xu, Zihao, et al.
Published: (2024)
by: Xu, Zihao, et al.
Published: (2024)
Digger: Detecting Copyright Content Mis-usage in Large Language Model Training
by: Li, Haodong, et al.
Published: (2024)
by: Li, Haodong, et al.
Published: (2024)
Prompt Injection attack against LLM-integrated Applications
by: Liu, Yi, et al.
Published: (2023)
by: Liu, Yi, et al.
Published: (2023)
Circumventing Safety Alignment in Large Language Models Through Embedding Space Toxicity Attenuation
by: Zhang, Zhibo, et al.
Published: (2025)
by: Zhang, Zhibo, et al.
Published: (2025)
OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models
by: Sun, Chongren, et al.
Published: (2025)
by: Sun, Chongren, et al.
Published: (2025)
GlitchProber: Advancing Effective Detection and Mitigation of Glitch Tokens in Large Language Models
by: Zhang, Zhibo, et al.
Published: (2024)
by: Zhang, Zhibo, et al.
Published: (2024)
STEAMROLLER: A Multi-Agent System for Inclusive Automatic Speech Recognition for People who Stutter
by: Xu, Ziqi, et al.
Published: (2026)
by: Xu, Ziqi, et al.
Published: (2026)
FactCHD: Benchmarking Fact-Conflicting Hallucination Detection
by: Chen, Xiang, et al.
Published: (2023)
by: Chen, Xiang, et al.
Published: (2023)
Large Language Models are overconfident and amplify human bias
by: Sun, Fengfei, et al.
Published: (2025)
by: Sun, Fengfei, et al.
Published: (2025)
Detection and Mitigation of Hallucination in Large Reasoning Models: A Mechanistic Perspective
by: Sun, Zhongxiang, et al.
Published: (2025)
by: Sun, Zhongxiang, et al.
Published: (2025)
ChronoFact: Timeline-based Temporal Fact Verification
by: Barik, Anab Maulana, et al.
Published: (2024)
by: Barik, Anab Maulana, et al.
Published: (2024)
Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection
by: Zhang, Chaowei, et al.
Published: (2025)
by: Zhang, Chaowei, et al.
Published: (2025)
Jailbreaking ChatGPT via Prompt Engineering: An Empirical Study
by: Liu, Yi, et al.
Published: (2023)
by: Liu, Yi, et al.
Published: (2023)
Towards Unification of Hallucination Detection and Fact Verification for Large Language Models
by: Su, Weihang, et al.
Published: (2025)
by: Su, Weihang, et al.
Published: (2025)
FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs
by: Alnuhait, Deema, et al.
Published: (2024)
by: Alnuhait, Deema, et al.
Published: (2024)
Uncovering Logit Suppression Vulnerabilities in LLM Safety Alignment
by: Li, Yuxi, et al.
Published: (2024)
by: Li, Yuxi, et al.
Published: (2024)
LLM Hallucination Detection: HSAD
by: Li, JinXin, et al.
Published: (2025)
by: Li, JinXin, et al.
Published: (2025)
MiniScope: Automated UI Exploration and Privacy Inconsistency Detection of MiniApps via Two-phase Iterative Hybrid Analysis
by: Wang, Shenao, et al.
Published: (2024)
by: Wang, Shenao, et al.
Published: (2024)
LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals
by: Li, Jinxin, et al.
Published: (2025)
by: Li, Jinxin, et al.
Published: (2025)
Steer LLM Latents for Hallucination Detection
by: Park, Seongheon, et al.
Published: (2025)
by: Park, Seongheon, et al.
Published: (2025)
Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning Models
by: Wang, Changyue, et al.
Published: (2025)
by: Wang, Changyue, et al.
Published: (2025)
Reducing Hallucinations in Entity Abstract Summarization with Facts-Template Decomposition
by: Zhu, Fangwei, et al.
Published: (2024)
by: Zhu, Fangwei, et al.
Published: (2024)
MeTMaP: Metamorphic Testing for Detecting False Vector Matching Problems in LLM Augmented Generation
by: Wang, Guanyu, et al.
Published: (2024)
by: Wang, Guanyu, et al.
Published: (2024)
FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs
by: Sawczyn, Albert, et al.
Published: (2025)
by: Sawczyn, Albert, et al.
Published: (2025)
LLM-Guided Knowledge Distillation for Temporal Knowledge Graph Reasoning
by: Xing, Wang, et al.
Published: (2026)
by: Xing, Wang, et al.
Published: (2026)
Beyond Translation: LLM-Based Data Generation for Multilingual Fact-Checking
by: Chung, Yi-Ling, et al.
Published: (2025)
by: Chung, Yi-Ling, et al.
Published: (2025)
Socrates or Smartypants: Testing Logic Reasoning Capabilities of Large Language Models with Logic Programming-based Test Oracles
by: Xu, Zihao, et al.
Published: (2025)
by: Xu, Zihao, et al.
Published: (2025)
How to Detect and Defeat Molecular Mirage: A Metric-Driven Benchmark for Hallucination in LLM-based Molecular Comprehension
by: Li, Hao, et al.
Published: (2025)
by: Li, Hao, et al.
Published: (2025)
Mitigating Hallucinations in Large Vision-Language Models with Internal Fact-based Contrastive Decoding
by: Wang, Chao, et al.
Published: (2025)
by: Wang, Chao, et al.
Published: (2025)
VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts
by: Liu, Xin, et al.
Published: (2025)
by: Liu, Xin, et al.
Published: (2025)
Enhancing Temporal Sensitivity and Reasoning for Time-Sensitive Question Answering
by: Yang, Wanqi, et al.
Published: (2024)
by: Yang, Wanqi, et al.
Published: (2024)
Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts
by: Gupta, Raavi, et al.
Published: (2025)
by: Gupta, Raavi, et al.
Published: (2025)
AgentHallu: Benchmarking Automated Hallucination Attribution of LLM-based Agents
by: Liu, Xuannan, et al.
Published: (2026)
by: Liu, Xuannan, et al.
Published: (2026)
HARP: Hallucination Detection via Reasoning Subspace Projection
by: Hu, Junjie, et al.
Published: (2025)
by: Hu, Junjie, et al.
Published: (2025)
Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's Math Capability
by: Wang, Ruida, et al.
Published: (2025)
by: Wang, Ruida, et al.
Published: (2025)
IndexRAG: Bridging Facts for Cross-Document Reasoning at Index Time
by: Bao, Zhenghua, et al.
Published: (2026)
by: Bao, Zhenghua, et al.
Published: (2026)
From Hallucinations to Facts: Enhancing Language Models with Curated Knowledge Graphs
by: Joshi, Ratnesh Kumar, et al.
Published: (2024)
by: Joshi, Ratnesh Kumar, et al.
Published: (2024)
Similar Items
-
Drowzee: Metamorphic Testing for Fact-Conflicting Hallucination Detection in Large Language Models
by: Li, Ningke, et al.
Published: (2024) -
Beyond Correctness: Exposing LLM-generated Logical Flaws in Reasoning via Multi-step Automated Theorem Proving
by: Zheng, Xinyi, et al.
Published: (2025) -
Glitch Tokens in Large Language Models: Categorization Taxonomy and Effective Detection
by: Li, Yuxi, et al.
Published: (2024) -
Continuous Embedding Attacks via Clipped Inputs in Jailbreaking Large Language Models
by: Xu, Zihao, et al.
Published: (2024) -
Digger: Detecting Copyright Content Mis-usage in Large Language Model Training
by: Li, Haodong, et al.
Published: (2024)