:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shan, Alex, Bauer, John, Manning, Christopher D.
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2505.04844
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Semgrex and Ssurgeon, Searching and Manipulating Dependency Graphs
by: Bauer, John, et al.
Published: (2024)

Do "English" Named Entity Recognizers Work Well on Global Englishes?
by: Shan, Alexander, et al.
Published: (2024)

A New Pair of GloVes
by: Carlson, Riley, et al.
Published: (2025)

Drop Dropout on Single-Epoch Language Model Pretraining
by: Liu, Houjun, et al.
Published: (2025)

Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
by: Laitenberger, Alex, et al.
Published: (2025)

Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools
by: Magesh, Varun, et al.
Published: (2024)

Lynx: An Open Source Hallucination Evaluation Model
by: Ravi, Selvan Sunitha, et al.
Published: (2024)

The Case for Repeatable, Open, and Expert-Grounded Hallucination Benchmarks in Large Language Models
by: Norman, Justin D., et al.
Published: (2025)

Humans and transformer LMs: Abstraction drives language learning
by: Jian, Jasper, et al.
Published: (2026)

Marcel: A Lightweight and Open-Source Conversational Agent for University Student Support
by: Trienes, Jan, et al.
Published: (2025)

ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation
by: Wu, Zhengxuan, et al.
Published: (2023)

Source or It Didn't Happen: A Multi-Agent Framework for Citation Hallucination Detection
by: Li, Mingzhe, et al.
Published: (2026)

Instruction Following without Instruction Tuning
by: Hewitt, John, et al.
Published: (2024)

Sneaking Syntax into Transformer Language Models with Tree Regularization
by: Nandi, Ananjan, et al.
Published: (2024)

ShED-HD: A Shannon Entropy Distribution Framework for Lightweight Hallucination Detection on Edge Devices
by: Vathul, Aneesh, et al.
Published: (2025)

The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models
by: Hong, Giwon, et al.
Published: (2024)

Hallucination Detection via Activations of Open-Weight Proxy Analyzers
by: Singh, Akshita, et al.
Published: (2026)

Hallucination Detection and Hallucination Mitigation: An Investigation
by: Luo, Junliang, et al.
Published: (2024)

Hallucinations and Key Information Extraction in Medical Texts: A Comprehensive Assessment of Open-Source Large Language Models
by: Das, Anindya Bijoy, et al.
Published: (2025)

HARP: Hallucination Detection via Reasoning Subspace Projection
by: Hu, Junjie, et al.
Published: (2025)

HalluCiteChecker: A Lightweight Toolkit for Hallucinated Citation Detection and Verification in the Era of AI Scientists
by: Sakai, Yusuke, et al.
Published: (2026)

HalluDetect: Detecting, Mitigating, and Benchmarking Hallucinations in Conversational Systems in the Legal Domain
by: Anaokar, Spandan, et al.
Published: (2025)

LUMINA: Detecting Hallucinations in RAG System with Context-Knowledge Signals
by: Yeh, Samuel, et al.
Published: (2025)

MetaRAG: Metamorphic Testing for Hallucination Detection in RAG Systems
by: Sok, Channdeth, et al.
Published: (2025)

Improved Representation Steering for Language Models
by: Wu, Zhengxuan, et al.
Published: (2025)

MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
by: Zhong, Zexuan, et al.
Published: (2023)

Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)

Mechanisms vs. Outcomes: Probing for Syntax Fails to Explain Performance on Targeted Syntactic Evaluations
by: Agarwal, Ananth, et al.
Published: (2025)

NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild
by: Murty, Shikhar, et al.
Published: (2024)

LLM Hallucination Detection: A Fast Fourier Transform Method Based on Hidden Layer Temporal Signals
by: Li, Jinxin, et al.
Published: (2025)

LLM Hallucination Detection: HSAD
by: Li, JinXin, et al.
Published: (2025)

Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation
by: Qin, Chengwei, et al.
Published: (2025)

Towards Long Context Hallucination Detection
by: Liu, Siyi, et al.
Published: (2025)

Model Editing with Canonical Examples
by: Hewitt, John, et al.
Published: (2024)

Hallucination is Inevitable for LLMs with the Open World Assumption
by: Xu, Bowen
Published: (2025)

Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems
by: Hikal, Baraa, et al.
Published: (2025)

Can LLMs Detect Their Own Hallucinations?
by: Kadotani, Sora, et al.
Published: (2025)

Statistical Uncertainty in Word Embeddings: GloVe-V
by: Vallebueno, Andrea, et al.
Published: (2024)

EduCoder: An Open-Source Annotation System for Education Transcript Data
by: Ashraf, Saad, et al.
Published: (2025)

Veracity: An Open-Source AI Fact-Checking System
by: Curtis, Taylor Lynn, et al.
Published: (2025)