Saved in:
| Main Authors: | Wang, Hanyu, Cao, Yuanpu, Lin, Lu, Chen, Jinghui |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.07187 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation
by: Lan, Yifan, et al.
Published: (2026)
by: Lan, Yifan, et al.
Published: (2026)
Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections
by: Cao, Yuanpu, et al.
Published: (2023)
by: Cao, Yuanpu, et al.
Published: (2023)
TruthFlow: Truthful LLM Generation via Representation Flow Correction
by: Wang, Hanyu, et al.
Published: (2025)
by: Wang, Hanyu, et al.
Published: (2025)
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM
by: Cao, Bochuan, et al.
Published: (2023)
by: Cao, Bochuan, et al.
Published: (2023)
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
by: Zeng, Yaopei, et al.
Published: (2024)
by: Zeng, Yaopei, et al.
Published: (2024)
Restoring the Sweet Spot: Pass-Rate Weighted Self-Distillation for LLM Reasoning
by: Liu, Zehao, et al.
Published: (2026)
by: Liu, Zehao, et al.
Published: (2026)
You Can't Steal Nothing: Mitigating Prompt Leakages in LLMs via System Vectors
by: Cao, Bochuan, et al.
Published: (2025)
by: Cao, Bochuan, et al.
Published: (2025)
In Prospect and Retrospect: Reflective Memory Management for Long-term Personalized Dialogue Agents
by: Tan, Zhen, et al.
Published: (2025)
by: Tan, Zhen, et al.
Published: (2025)
ParaBlock: Communication-Computation Parallel Block Coordinate Federated Learning for Large Language Models
by: Wang, Yujia, et al.
Published: (2025)
by: Wang, Yujia, et al.
Published: (2025)
Adversarially Robust Industrial Anomaly Detection Through Diffusion Model
by: Cao, Yuanpu, et al.
Published: (2024)
by: Cao, Yuanpu, et al.
Published: (2024)
ReFlect: An Effective Harness System for Complex Long-Horizon LLM Reasoning
by: Huang, Fan
Published: (2026)
by: Huang, Fan
Published: (2026)
Phi: Preference Hijacking in Multi-modal Large Language Models at Inference Time
by: Lan, Yifan, et al.
Published: (2025)
by: Lan, Yifan, et al.
Published: (2025)
How Large Language Models Need Symbolism
by: Deng, Xiaotie, et al.
Published: (2025)
by: Deng, Xiaotie, et al.
Published: (2025)
Graph-Augmented Large Language Model Agents: Current Progress and Future Prospects
by: Liu, Yixin, et al.
Published: (2025)
by: Liu, Yixin, et al.
Published: (2025)
Diverse Human Value Alignment for Large Language Models via Ethical Reasoning
by: Wang, Jiahao, et al.
Published: (2025)
by: Wang, Jiahao, et al.
Published: (2025)
Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models
by: Tang, Lei, et al.
Published: (2025)
by: Tang, Lei, et al.
Published: (2025)
Exploring Large Language Model based Intelligent Agents: Definitions, Methods, and Prospects
by: Cheng, Yuheng, et al.
Published: (2024)
by: Cheng, Yuheng, et al.
Published: (2024)
Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization
by: Yao, Weiran, et al.
Published: (2023)
by: Yao, Weiran, et al.
Published: (2023)
APD-Agents: A Large Language Model-Driven Multi-Agents Collaborative Framework for Automated Page Design
by: Chen, Xinpeng, et al.
Published: (2025)
by: Chen, Xinpeng, et al.
Published: (2025)
PaDeLLM-NER: Parallel Decoding in Large Language Models for Named Entity Recognition
by: Lu, Jinghui, et al.
Published: (2024)
by: Lu, Jinghui, et al.
Published: (2024)
ShallowJail: Steering Jailbreaks against Large Language Models
by: Liu, Shang, et al.
Published: (2026)
by: Liu, Shang, et al.
Published: (2026)
Your Agent Can Defend Itself against Backdoor Attacks
by: Changjiang, Li, et al.
Published: (2025)
by: Changjiang, Li, et al.
Published: (2025)
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training
by: Yuan, Siyu, et al.
Published: (2025)
by: Yuan, Siyu, et al.
Published: (2025)
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
by: Cao, Yuanpu, et al.
Published: (2024)
by: Cao, Yuanpu, et al.
Published: (2024)
Efficient Reasoning for Large Reasoning Language Models via Certainty-Guided Reflection Suppression
by: Huang, Jiameng, et al.
Published: (2025)
by: Huang, Jiameng, et al.
Published: (2025)
GRPO and Reflection Reward for Mathematical Reasoning in Large Language Models
by: Wang, Zhijie
Published: (2026)
by: Wang, Zhijie
Published: (2026)
Learning Dynamics in Continual Pre-Training for Large Language Models
by: Wang, Xingjin, et al.
Published: (2025)
by: Wang, Xingjin, et al.
Published: (2025)
Large Language Models as Theory of Mind Aware Generative Agents with Counterfactual Reflection
by: Yang, Bo, et al.
Published: (2025)
by: Yang, Bo, et al.
Published: (2025)
Reflection-Bench: Evaluating Epistemic Agency in Large Language Models
by: Li, Lingyu, et al.
Published: (2024)
by: Li, Lingyu, et al.
Published: (2024)
ReEvo: Large Language Models as Hyper-Heuristics with Reflective Evolution
by: Ye, Haoran, et al.
Published: (2024)
by: Ye, Haoran, et al.
Published: (2024)
Regurgitative Training: The Value of Real Data in Training Large Language Models
by: Zhang, Jinghui, et al.
Published: (2024)
by: Zhang, Jinghui, et al.
Published: (2024)
A Novel Multi-Agent Architecture to Reduce Hallucinations of Large Language Models in Multi-Step Structural Modeling
by: Geng, Ziheng, et al.
Published: (2026)
by: Geng, Ziheng, et al.
Published: (2026)
A Large Language Model-Empowered Agent for Reliable and Robust Structural Analysis
by: Liu, Jiachen, et al.
Published: (2025)
by: Liu, Jiachen, et al.
Published: (2025)
Towards Robust Multimodal Large Language Models Against Jailbreak Attacks
by: Yin, Ziyi, et al.
Published: (2025)
by: Yin, Ziyi, et al.
Published: (2025)
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
by: Hu, Ming, et al.
Published: (2025)
by: Hu, Ming, et al.
Published: (2025)
The Forecast Critic: Leveraging Large Language Models for Poor Forecast Identification
by: Bhan, Luke, et al.
Published: (2025)
by: Bhan, Luke, et al.
Published: (2025)
Tandem: Riding Together with Large and Small Language Models for Efficient Reasoning
by: Fu, Zichuan, et al.
Published: (2026)
by: Fu, Zichuan, et al.
Published: (2026)
Aligning Agents like Large Language Models
by: Jelley, Adam, et al.
Published: (2024)
by: Jelley, Adam, et al.
Published: (2024)
From Correction to Mastery: Reinforced Distillation of Large Language Model Agents
by: Lyu, Yuanjie, et al.
Published: (2025)
by: Lyu, Yuanjie, et al.
Published: (2025)
InsurAgent: A Large Language Model-Empowered Agent for Simulating Individual Behavior in Purchasing Flood Insurance
by: Geng, Ziheng, et al.
Published: (2025)
by: Geng, Ziheng, et al.
Published: (2025)
Similar Items
-
The Illusion of Reasoning: Exposing Evasive Data Contamination in LLMs via Zero-CoT Truncation
by: Lan, Yifan, et al.
Published: (2026) -
Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections
by: Cao, Yuanpu, et al.
Published: (2023) -
TruthFlow: Truthful LLM Generation via Representation Flow Correction
by: Wang, Hanyu, et al.
Published: (2025) -
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM
by: Cao, Bochuan, et al.
Published: (2023) -
AdvI2I: Adversarial Image Attack on Image-to-Image Diffusion models
by: Zeng, Yaopei, et al.
Published: (2024)