Saved in:
| Main Authors: | Zhang, Zhengxuan, Liang, Zhuowen, Wu, Yin, Lin, Teng, Luo, Yuyu, Tang, Nan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.10036 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs
by: Liang, Zhuowen, et al.
Published: (2026)
by: Liang, Zhuowen, et al.
Published: (2026)
EXCLAIM: An Explainable Cross-Modal Agentic System for Misinformation Detection with Hierarchical Retrieval
by: Wu, Yin, et al.
Published: (2025)
by: Wu, Yin, et al.
Published: (2025)
Fine-Grained Knowledge Structuring and Retrieval for Visual Question Answering
by: Zhang, Zhengxuan, et al.
Published: (2025)
by: Zhang, Zhengxuan, et al.
Published: (2025)
SRAG: Structured Retrieval-Augmented Generation for Multi-Entity Question Answering over Wikipedia Graph
by: Lin, Teng, et al.
Published: (2025)
by: Lin, Teng, et al.
Published: (2025)
MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering
by: Lin, Teng, et al.
Published: (2025)
by: Lin, Teng, et al.
Published: (2025)
DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering
by: Lin, Teng, et al.
Published: (2026)
by: Lin, Teng, et al.
Published: (2026)
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation
by: Chen, Shiqi, et al.
Published: (2024)
by: Chen, Shiqi, et al.
Published: (2024)
SketchFill: Sketch-Guided Code Generation for Imputing Derived Missing Values
by: Zhang, Yunfan, et al.
Published: (2024)
by: Zhang, Yunfan, et al.
Published: (2024)
TransXSSM: A Hybrid Transformer State Space Model with Unified Rotary Position Embedding
by: Wu, Bingheng, et al.
Published: (2025)
by: Wu, Bingheng, et al.
Published: (2025)
LLMs Encode Harmfulness and Refusal Separately
by: Zhao, Jiachen, et al.
Published: (2025)
by: Zhao, Jiachen, et al.
Published: (2025)
TableTale: Reviving the Narrative Interplay Between Data Tables and Text in Scientific Papers
by: Wang, Liangwei, et al.
Published: (2026)
by: Wang, Liangwei, et al.
Published: (2026)
Are Large Language Models Good Statisticians?
by: Zhu, Yizhang, et al.
Published: (2024)
by: Zhu, Yizhang, et al.
Published: (2024)
ChartInsights: Evaluating Multimodal Large Language Models for Low-Level Chart Question Answering
by: Wu, Yifan, et al.
Published: (2024)
by: Wu, Yifan, et al.
Published: (2024)
EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing
by: Zhu, Yizhang, et al.
Published: (2025)
by: Zhu, Yizhang, et al.
Published: (2025)
LightKGG: Simple and Efficient Knowledge Graph Generation from Textual Data
by: Lin, Teng
Published: (2025)
by: Lin, Teng
Published: (2025)
AnnoRetrieve: Efficient Structured Retrieval for Unstructured Document Analysis
by: Lin, Teng, et al.
Published: (2026)
by: Lin, Teng, et al.
Published: (2026)
Oolong: Investigating What Makes Transfer Learning Hard with Controlled Studies
by: Wu, Zhengxuan, et al.
Published: (2022)
by: Wu, Zhengxuan, et al.
Published: (2022)
RoleBreak: Character Hallucination as a Jailbreak Attack in Role-Playing Systems
by: Tang, Yihong, et al.
Published: (2024)
by: Tang, Yihong, et al.
Published: (2024)
D$^2$HScore: Reasoning-Aware Hallucination Detection via Semantic Breadth and Depth Analysis in LLMs
by: Ding, Yue, et al.
Published: (2025)
by: Ding, Yue, et al.
Published: (2025)
Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs
by: Zhou, Wei, et al.
Published: (2026)
by: Zhou, Wei, et al.
Published: (2026)
Challenging Multilingual LLMs: A New Taxonomy and Benchmark for Unraveling Hallucination in Translation
by: Wu, Xinwei, et al.
Published: (2025)
by: Wu, Xinwei, et al.
Published: (2025)
ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation
by: Wu, Zhengxuan, et al.
Published: (2023)
by: Wu, Zhengxuan, et al.
Published: (2023)
Shared Imagination: LLMs Hallucinate Alike
by: Zhou, Yilun, et al.
Published: (2024)
by: Zhou, Yilun, et al.
Published: (2024)
Wisdom is Knowing What not to Say: Hallucination-Free LLMs Unlearning via Attention Shifting
by: Tan, Chenchen, et al.
Published: (2025)
by: Tan, Chenchen, et al.
Published: (2025)
Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
by: Chen, Qi, et al.
Published: (2024)
by: Chen, Qi, et al.
Published: (2024)
HART: Data-Driven Hallucination Attribution and Evidence-Based Tracing for Large Language Models
by: Liang, Shize, et al.
Published: (2026)
by: Liang, Shize, et al.
Published: (2026)
SHARP: Unlocking Interactive Hallucination via Stance Transfer in Role-Playing LLMs
by: Kong, Chuyi, et al.
Published: (2024)
by: Kong, Chuyi, et al.
Published: (2024)
Can LLMs Generate and Solve Linguistic Olympiad Puzzles?
by: Majmudar, Neh, et al.
Published: (2025)
by: Majmudar, Neh, et al.
Published: (2025)
Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation
by: Liang, Yuxin, et al.
Published: (2024)
by: Liang, Yuxin, et al.
Published: (2024)
Understanding New-Knowledge-Induced Factual Hallucinations in LLMs: Analysis and Interpretation
by: Dang, Renfei, et al.
Published: (2025)
by: Dang, Renfei, et al.
Published: (2025)
Where Fake Citations Are Made: Tracing Field-Level Hallucination to Specific Neurons in LLMs
by: Chen, Yuefei, et al.
Published: (2026)
by: Chen, Yuefei, et al.
Published: (2026)
nvBench 2.0: Resolving Ambiguity in Text-to-Visualization through Stepwise Reasoning
by: Luo, Tianqi, et al.
Published: (2025)
by: Luo, Tianqi, et al.
Published: (2025)
How Chinese are Chinese Language Models? The Puzzling Lack of Language Policy in China's LLMs
by: Wen-Yi, Andrea W, et al.
Published: (2024)
by: Wen-Yi, Andrea W, et al.
Published: (2024)
Atom of Thoughts for Markov LLM Test-Time Scaling
by: Teng, Fengwei, et al.
Published: (2025)
by: Teng, Fengwei, et al.
Published: (2025)
A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis
by: Gao, Xin, et al.
Published: (2025)
by: Gao, Xin, et al.
Published: (2025)
Geometric Uncertainty for Detecting and Correcting Hallucinations in LLMs
by: Phillips, Edward, et al.
Published: (2025)
by: Phillips, Edward, et al.
Published: (2025)
NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification
by: Huang, Hongfei, et al.
Published: (2024)
by: Huang, Hongfei, et al.
Published: (2024)
PlotCraft: Pushing the Limits of LLMs for Complex and Interactive Data Visualization
by: Zhang, Jiajun, et al.
Published: (2025)
by: Zhang, Jiajun, et al.
Published: (2025)
Counterfactual Debating with Preset Stances for Hallucination Elimination of LLMs
by: Fang, Yi, et al.
Published: (2024)
by: Fang, Yi, et al.
Published: (2024)
DCR: Quantifying Data Contamination in LLMs Evaluation
by: Xu, Cheng, et al.
Published: (2025)
by: Xu, Cheng, et al.
Published: (2025)
Similar Items
-
Long-Document QA with Chain-of-Structured-Thought and Fine-Tuned SLMs
by: Liang, Zhuowen, et al.
Published: (2026) -
EXCLAIM: An Explainable Cross-Modal Agentic System for Misinformation Detection with Hierarchical Retrieval
by: Wu, Yin, et al.
Published: (2025) -
Fine-Grained Knowledge Structuring and Retrieval for Visual Question Answering
by: Zhang, Zhengxuan, et al.
Published: (2025) -
SRAG: Structured Retrieval-Augmented Generation for Multi-Entity Question Answering over Wikipedia Graph
by: Lin, Teng, et al.
Published: (2025) -
MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering
by: Lin, Teng, et al.
Published: (2025)