Saved in:
| Main Authors: | Sun, Chenghao, Zhang, Chengsheng, Qin, Guanzheng, Dai, Rui, Tian, Xinmei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.13694 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Interpreting and Enhancing Emotional Circuits in Large Vision-Language Models via Cross-Modal Information Flow
by: Zhang, Chengsheng, et al.
Published: (2026)
by: Zhang, Chengsheng, et al.
Published: (2026)
Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models
by: Zhang, Chengsheng, et al.
Published: (2026)
by: Zhang, Chengsheng, et al.
Published: (2026)
MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems
by: Cai, Qianshu, et al.
Published: (2026)
by: Cai, Qianshu, et al.
Published: (2026)
Towards Theoretical Understandings of Self-Consuming Generative Models
by: Fu, Shi, et al.
Published: (2024)
by: Fu, Shi, et al.
Published: (2024)
Drawback of Enforcing Equivariance and its Compensation via the Lens of Expressive Power
by: Chen, Yuzhu, et al.
Published: (2025)
by: Chen, Yuzhu, et al.
Published: (2025)
MSCR: Exploring the Vulnerability of LLMs' Mathematical Reasoning Abilities Using Multi-Source Candidate Replacement
by: Sun, Zhishen, et al.
Published: (2025)
by: Sun, Zhishen, et al.
Published: (2025)
Localized Definitions and Distributed Reasoning: A Proof-of-Concept Mechanistic Interpretability Study via Activation Patching
by: Bahador, Nooshin
Published: (2025)
by: Bahador, Nooshin
Published: (2025)
LLMIA: An Out-of-the-Box Index Advisor via In-Context Learning with LLMs
by: Zhao, Xinxin, et al.
Published: (2025)
by: Zhao, Xinxin, et al.
Published: (2025)
ME-Mamba: Multi-Expert Mamba with Efficient Knowledge Capture and Fusion for Multimodal Survival Analysis
by: Zhang, Chengsheng, et al.
Published: (2025)
by: Zhang, Chengsheng, et al.
Published: (2025)
Exploring Large Language Models for Knowledge Graph Completion
by: Yao, Liang, et al.
Published: (2023)
by: Yao, Liang, et al.
Published: (2023)
ATIR: Towards Audio-Text Interleaved Contextual Retrieval
by: Zhao, Tong, et al.
Published: (2026)
by: Zhao, Tong, et al.
Published: (2026)
PKU-SafeRLHF: Towards Multi-Level Safety Alignment for LLMs with Human Preference
by: Ji, Jiaming, et al.
Published: (2024)
by: Ji, Jiaming, et al.
Published: (2024)
A Theoretical Perspective: How to Prevent Model Collapse in Self-consuming Training Loops
by: Fu, Shi, et al.
Published: (2025)
by: Fu, Shi, et al.
Published: (2025)
PatchFinder: A Two-Phase Approach to Security Patch Tracing for Disclosed Vulnerabilities in Open-Source Software
by: Li, Kaixuan, et al.
Published: (2024)
by: Li, Kaixuan, et al.
Published: (2024)
Intrinsic Self-Correction in LLMs: Towards Explainable Prompting via Mechanistic Interpretability
by: Lee, Yu-Ting, et al.
Published: (2025)
by: Lee, Yu-Ting, et al.
Published: (2025)
UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude Mobility
by: Tian, Yonglin, et al.
Published: (2025)
by: Tian, Yonglin, et al.
Published: (2025)
SUDO: a framework for evaluating clinical artificial intelligence systems without ground-truth annotations
by: Kiyasseh, Dani, et al.
Published: (2024)
by: Kiyasseh, Dani, et al.
Published: (2024)
How Emotion Shapes the Behavior of LLMs and Agents: A Mechanistic Study
by: Sun, Moran, et al.
Published: (2026)
by: Sun, Moran, et al.
Published: (2026)
Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning
by: Pu, Rui, et al.
Published: (2025)
by: Pu, Rui, et al.
Published: (2025)
LocalBench: Benchmarking LLMs on County-Level Local Knowledge and Reasoning
by: Gao, Zihan, et al.
Published: (2025)
by: Gao, Zihan, et al.
Published: (2025)
RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty
by: Zhang, Ziqian, et al.
Published: (2026)
by: Zhang, Ziqian, et al.
Published: (2026)
HyperDAS: Towards Automating Mechanistic Interpretability with Hypernetworks
by: Sun, Jiuding, et al.
Published: (2025)
by: Sun, Jiuding, et al.
Published: (2025)
Towards Foundation Models for Zero-Shot Time Series Anomaly Detection: Leveraging Synthetic Data and Relative Context Discrepancy
by: Lan, Tian, et al.
Published: (2025)
by: Lan, Tian, et al.
Published: (2025)
Dissecting Bias in LLMs: A Mechanistic Interpretability Perspective
by: Chandna, Bhavik, et al.
Published: (2025)
by: Chandna, Bhavik, et al.
Published: (2025)
Expand Heterogeneous Learning Systems with Selective Multi-Source Knowledge Fusion
by: Dai, Gaole, et al.
Published: (2024)
by: Dai, Gaole, et al.
Published: (2024)
A Picture is Worth A Thousand Numbers: Enabling LLMs Reason about Time Series via Visualization
by: Liu, Haoxin, et al.
Published: (2024)
by: Liu, Haoxin, et al.
Published: (2024)
How Post-Training Reshapes LLMs: A Mechanistic View on Knowledge, Truthfulness, Refusal, and Confidence
by: Du, Hongzhe, et al.
Published: (2025)
by: Du, Hongzhe, et al.
Published: (2025)
APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training
by: Qin, Jiarui, et al.
Published: (2025)
by: Qin, Jiarui, et al.
Published: (2025)
Epistemic Uncertainty for Generated Image Detection
by: Nie, Jun, et al.
Published: (2024)
by: Nie, Jun, et al.
Published: (2024)
Towards Autonomous Mechanistic Reasoning in Virtual Cells
by: Jang, Yunhui, et al.
Published: (2026)
by: Jang, Yunhui, et al.
Published: (2026)
Matryoshka Pilot: Learning to Drive Black-Box LLMs with LLMs
by: Li, Changhao, et al.
Published: (2024)
by: Li, Changhao, et al.
Published: (2024)
ATime-Consistent Benchmark for Repository-Level Software Engineering Evaluation
by: Xianpeng, et al.
Published: (2026)
by: Xianpeng, et al.
Published: (2026)
AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering
by: Wang, Ziqing, et al.
Published: (2025)
by: Wang, Ziqing, et al.
Published: (2025)
D$^2$Quant: Accurate Low-bit Post-Training Weight Quantization for LLMs
by: Yan, Xianglong, et al.
Published: (2026)
by: Yan, Xianglong, et al.
Published: (2026)
Towards Verifiable Multimodal Deep Research: A Multi-Agent Harness for Interleaved Report Generation
by: Zhang, Chenghao, et al.
Published: (2026)
by: Zhang, Chenghao, et al.
Published: (2026)
Hierarchical Repository-Level Code Summarization for Business Applications Using Local LLMs
by: Dhulshette, Nilesh, et al.
Published: (2025)
by: Dhulshette, Nilesh, et al.
Published: (2025)
Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability
by: Raimondi, Bianca, et al.
Published: (2025)
by: Raimondi, Bianca, et al.
Published: (2025)
Safety at One Shot: Patching Fine-Tuned LLMs with A Single Instance
by: Zhang, Jiawen, et al.
Published: (2026)
by: Zhang, Jiawen, et al.
Published: (2026)
Towards Adapting Open-Source Large Language Models for Expert-Level Clinical Note Generation
by: Wang, Hanyin, et al.
Published: (2024)
by: Wang, Hanyin, et al.
Published: (2024)
Patched RTC: evaluating LLMs for diverse software development tasks
by: Sharma, Asankhaya
Published: (2024)
by: Sharma, Asankhaya
Published: (2024)
Similar Items
-
Interpreting and Enhancing Emotional Circuits in Large Vision-Language Models via Cross-Modal Information Flow
by: Zhang, Chengsheng, et al.
Published: (2026) -
Prefill-Time Intervention for Mitigating Hallucination in Large Vision-Language Models
by: Zhang, Chengsheng, et al.
Published: (2026) -
MOSS: Self-Evolution through Source-Level Rewriting in Autonomous Agent Systems
by: Cai, Qianshu, et al.
Published: (2026) -
Towards Theoretical Understandings of Self-Consuming Generative Models
by: Fu, Shi, et al.
Published: (2024) -
Drawback of Enforcing Equivariance and its Compensation via the Lens of Expressive Power
by: Chen, Yuzhu, et al.
Published: (2025)