:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Zhengxuan, Liang, Zhuowen, Wu, Yin, Lin, Teng, Luo, Yuyu, Tang, Nan
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2504.10036
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs
by: Liang, Zhuowen, et al.
Published: (2026)

EXCLAIM: An Explainable Cross-Modal Agentic System for Misinformation Detection with Hierarchical Retrieval
by: Wu, Yin, et al.
Published: (2025)

Fine-Grained Knowledge Structuring and Retrieval for Visual Question Answering
by: Zhang, Zhengxuan, et al.
Published: (2025)

SRAG: Structured Retrieval-Augmented Generation for Multi-Entity Question Answering over Wikipedia Graph
by: Lin, Teng, et al.
Published: (2025)

MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering
by: Lin, Teng, et al.
Published: (2025)

DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering
by: Lin, Teng, et al.
Published: (2026)

In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation
by: Chen, Shiqi, et al.
Published: (2024)

SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values
by: Zhang, Yunfan, et al.
Published: (2024)

TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding
by: Wu, Bingheng, et al.
Published: (2025)

LLMs Encode Harmfulness and Refusal Separately
by: Zhao, Jiachen, et al.
Published: (2025)

TableTale: Reviving the Narrative Interplay Between Data Tables and Text in Scientific Papers
by: Wang, Liangwei, et al.
Published: (2026)

Are Large Language Models Good Statisticians?
by: Zhu, Yizhang, et al.
Published: (2024)

ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering
by: Wu, Yifan, et al.
Published: (2024)

EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing
by: Zhu, Yizhang, et al.
Published: (2025)

LightKGG: Simple and Efficient Knowledge Graph Generation from Textual Data
by: Lin, Teng
Published: (2025)

AnnoRetrieve: Efficient Structured Retrieval for Unstructured Document Analysis
by: Lin, Teng, et al.
Published: (2026)

Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies
by: Wu, Zhengxuan, et al.
Published: (2022)

RoleBreak: Character Hallucination as a Jailbreak Attack in Role-Playing Systems
by: Tang, Yihong, et al.
Published: (2024)

D$^2$HScore: Reasoning-Aware Hallucination Detection via Semantic Breadth and Depth Analysis in LLMs
by: Ding, Yue, et al.
Published: (2025)

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs
by: Zhou, Wei, et al.
Published: (2026)

Challenging Multilingual LLMs: A New Taxonomy and Benchmark for Unraveling Hallucination in Translation
by: Wu, Xinwei, et al.
Published: (2025)

ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation
by: Wu, Zhengxuan, et al.
Published: (2023)

Shared Imagination: LLMs Hallucinate Alike
by: Zhou, Yilun, et al.
Published: (2024)

Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting
by: Tan, Chenchen, et al.
Published: (2025)

Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
by: Chen, Qi, et al.
Published: (2024)

HART: Data-Driven Hallucination Attribution and Evidence-Based Tracing for Large Language Models
by: Liang, Shize, et al.
Published: (2026)

SHARP: Unlocking Interactive Hallucination via Stance Transfer in Role-Playing LLMs
by: Kong, Chuyi, et al.
Published: (2024)

Can LLMs Generate and Solve Linguistic Olympiad Puzzles?
by: Majmudar, Neh, et al.
Published: (2025)

Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation
by: Liang, Yuxin, et al.
Published: (2024)

Understanding New-Knowledge-Induced Factual Hallucinations in LLMs: Analysis and Interpretation
by: Dang, Renfei, et al.
Published: (2025)

Where Fake Citations Are Made: Tracing Field-Level Hallucination to Specific Neurons in LLMs
by: Chen, Yuefei, et al.
Published: (2026)

nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning
by: Luo, Tianqi, et al.
Published: (2025)

How Chinese are Chinese Language Models? The Puzzling Lack of Language Policy in China's LLMs
by: Wen-Yi, Andrea W, et al.
Published: (2024)

Atom of Thoughts for Markov LLM Test-Time Scaling
by: Teng, Fengwei, et al.
Published: (2025)

A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis
by: Gao, Xin, et al.
Published: (2025)

Geometric Uncertainty for Detecting and Correcting Hallucinations in LLMs
by: Phillips, Edward, et al.
Published: (2025)

NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification
by: Huang, Hongfei, et al.
Published: (2024)

PlotCraft: Pushing the Limits of LLMs for Complex and Interactive Data Visualization
by: Zhang, Jiajun, et al.
Published: (2025)

Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
by: Fang, Yi, et al.
Published: (2024)

DCR: Quantifying Data Contamination in LLMs Evaluation
by: Xu, Cheng, et al.
Published: (2025)