Saved in:
| Main Authors: | Shan, Alex, Bauer, John, Manning, Christopher D. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.04844 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Semgrex and Ssurgeon, Searching and Manipulating Dependency Graphs
by: Bauer, John, et al.
Published: (2024)
by: Bauer, John, et al.
Published: (2024)
Do "English" Named Entity Recognizers Work Well on Global Englishes?
by: Shan, Alexander, et al.
Published: (2024)
by: Shan, Alexander, et al.
Published: (2024)
A New Pair of GloVes
by: Carlson, Riley, et al.
Published: (2025)
by: Carlson, Riley, et al.
Published: (2025)
Drop Dropout on Single-Epoch Language Model Pretraining
by: Liu, Houjun, et al.
Published: (2025)
by: Liu, Houjun, et al.
Published: (2025)
Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
by: Laitenberger, Alex, et al.
Published: (2025)
by: Laitenberger, Alex, et al.
Published: (2025)
Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools
by: Magesh, Varun, et al.
Published: (2024)
by: Magesh, Varun, et al.
Published: (2024)
Lynx: An Open Source Hallucination Evaluation Model
by: Ravi, Selvan Sunitha, et al.
Published: (2024)
by: Ravi, Selvan Sunitha, et al.
Published: (2024)
The Case for Repeatable, Open, and Expert-Grounded Hallucination Benchmarks in Large Language Models
by: Norman, Justin D., et al.
Published: (2025)
by: Norman, Justin D., et al.
Published: (2025)
Humans and transformer LMs: Abstraction drives language learning
by: Jian, Jasper, et al.
Published: (2026)
by: Jian, Jasper, et al.
Published: (2026)
Marcel: A Lightweight and Open-Source Conversational Agent for University Student Support
by: Trienes, Jan, et al.
Published: (2025)
by: Trienes, Jan, et al.
Published: (2025)
ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation
by: Wu, Zhengxuan, et al.
Published: (2023)
by: Wu, Zhengxuan, et al.
Published: (2023)
Source or It Didn't Happen: A Multi-Agent Framework for Citation Hallucination Detection
by: Li, Mingzhe, et al.
Published: (2026)
by: Li, Mingzhe, et al.
Published: (2026)
Instruction Following without Instruction Tuning
by: Hewitt, John, et al.
Published: (2024)
by: Hewitt, John, et al.
Published: (2024)
Sneaking Syntax into Transformer Language Models with Tree Regularization
by: Nandi, Ananjan, et al.
Published: (2024)
by: Nandi, Ananjan, et al.
Published: (2024)
ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices
by: Vathul, Aneesh, et al.
Published: (2025)
by: Vathul, Aneesh, et al.
Published: (2025)
The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models
by: Hong, Giwon, et al.
Published: (2024)
by: Hong, Giwon, et al.
Published: (2024)
Hallucination Detection via Activations of Open-Weight Proxy Analyzers
by: Singh, Akshita, et al.
Published: (2026)
by: Singh, Akshita, et al.
Published: (2026)
Hallucination Detection and Hallucination Mitigation: An Investigation
by: Luo, Junliang, et al.
Published: (2024)
by: Luo, Junliang, et al.
Published: (2024)
Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models
by: Das, Anindya Bijoy, et al.
Published: (2025)
by: Das, Anindya Bijoy, et al.
Published: (2025)
HARP: Hallucination Detection via Reasoning Subspace Projection
by: Hu, Junjie, et al.
Published: (2025)
by: Hu, Junjie, et al.
Published: (2025)
HalluCiteChecker: A Lightweight Toolkit for Hallucinated Citation Detection and Verification in the Era of AI Scientists
by: Sakai, Yusuke, et al.
Published: (2026)
by: Sakai, Yusuke, et al.
Published: (2026)
HalluDetect: Detecting, Mitigating, and Benchmarking Hallucinations in Conversational Systems in the Legal Domain
by: Anaokar, Spandan, et al.
Published: (2025)
by: Anaokar, Spandan, et al.
Published: (2025)
LUMINA: Detecting Hallucinations in RAG System with Context-Knowledge Signals
by: Yeh, Samuel, et al.
Published: (2025)
by: Yeh, Samuel, et al.
Published: (2025)
MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems
by: Sok, Channdeth, et al.
Published: (2025)
by: Sok, Channdeth, et al.
Published: (2025)
Improved Representation Steering for Language Models
by: Wu, Zhengxuan, et al.
Published: (2025)
by: Wu, Zhengxuan, et al.
Published: (2025)
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
by: Zhong, Zexuan, et al.
Published: (2023)
by: Zhong, Zexuan, et al.
Published: (2023)
Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)
Mechanisms vs. Outcomes: Probing for Syntax Fails to Explain Performance on Targeted Syntactic Evaluations
by: Agarwal, Ananth, et al.
Published: (2025)
by: Agarwal, Ananth, et al.
Published: (2025)
NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild
by: Murty, Shikhar, et al.
Published: (2024)
by: Murty, Shikhar, et al.
Published: (2024)
LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals
by: Li, Jinxin, et al.
Published: (2025)
by: Li, Jinxin, et al.
Published: (2025)
LLM Hallucination Detection: HSAD
by: Li, JinXin, et al.
Published: (2025)
by: Li, JinXin, et al.
Published: (2025)
Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation
by: Qin, Chengwei, et al.
Published: (2025)
by: Qin, Chengwei, et al.
Published: (2025)
Towards Long Context Hallucination Detection
by: Liu, Siyi, et al.
Published: (2025)
by: Liu, Siyi, et al.
Published: (2025)
Model Editing with Canonical Examples
by: Hewitt, John, et al.
Published: (2024)
by: Hewitt, John, et al.
Published: (2024)
Hallucination is Inevitable for LLMs with the Open World Assumption
by: Xu, Bowen
Published: (2025)
by: Xu, Bowen
Published: (2025)
Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems
by: Hikal, Baraa, et al.
Published: (2025)
by: Hikal, Baraa, et al.
Published: (2025)
Can LLMs Detect Their Own Hallucinations?
by: Kadotani, Sora, et al.
Published: (2025)
by: Kadotani, Sora, et al.
Published: (2025)
Statistical Uncertainty in Word Embeddings: GloVe-V
by: Vallebueno, Andrea, et al.
Published: (2024)
by: Vallebueno, Andrea, et al.
Published: (2024)
EduCoder: An Open-Source Annotation System for Education Transcript Data
by: Ashraf, Saad, et al.
Published: (2025)
by: Ashraf, Saad, et al.
Published: (2025)
Veracity: An Open-Source AI Fact-Checking System
by: Curtis, Taylor Lynn, et al.
Published: (2025)
by: Curtis, Taylor Lynn, et al.
Published: (2025)
Similar Items
-
Semgrex and Ssurgeon, Searching and Manipulating Dependency Graphs
by: Bauer, John, et al.
Published: (2024) -
Do "English" Named Entity Recognizers Work Well on Global Englishes?
by: Shan, Alexander, et al.
Published: (2024) -
A New Pair of GloVes
by: Carlson, Riley, et al.
Published: (2025) -
Drop Dropout on Single-Epoch Language Model Pretraining
by: Liu, Houjun, et al.
Published: (2025) -
Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
by: Laitenberger, Alex, et al.
Published: (2025)