Saved in:
| Main Authors: | Li, Zherui, Mi, Yan, Zhou, Zhenhong, Jiang, Houcheng, Zhang, Guibin, Wang, Kun, Fang, Junfeng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.00509 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reinforced Lifelong Editing for Language Models
by: Li, Zherui, et al.
Published: (2025)
by: Li, Zherui, et al.
Published: (2025)
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models
by: Zhou, Zhenhong, et al.
Published: (2025)
by: Zhou, Zhenhong, et al.
Published: (2025)
MemEvolve: Meta-Evolution of Agent Memory Systems
by: Zhang, Guibin, et al.
Published: (2025)
by: Zhang, Guibin, et al.
Published: (2025)
Jailbreaking Large Language Diffusion Models: Revealing Hidden Safety Flaws in Diffusion-Based Text Generation
by: Zhang, Yuanhe, et al.
Published: (2025)
by: Zhang, Yuanhe, et al.
Published: (2025)
Multi-agent Architecture Search via Agentic Supernet
by: Zhang, Guibin, et al.
Published: (2025)
by: Zhang, Guibin, et al.
Published: (2025)
Omni-Safety under Cross-Modality Conflict: Vulnerabilities, Dynamics Mechanisms and Efficient Alignment
by: Wang, Kun, et al.
Published: (2026)
by: Wang, Kun, et al.
Published: (2026)
Neuron-Level Sequential Editing for Large Language Models
by: Jiang, Houcheng, et al.
Published: (2024)
by: Jiang, Houcheng, et al.
Published: (2024)
G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems
by: Zhang, Guibin, et al.
Published: (2025)
by: Zhang, Guibin, et al.
Published: (2025)
LatentEvolve: Self-Evolving Test-Time Scaling in Latent Space
by: Zhang, Guibin, et al.
Published: (2025)
by: Zhang, Guibin, et al.
Published: (2025)
Speak Out of Turn: Safety Vulnerability of Large Language Models in Multi-turn Dialogue
by: Zhou, Zhenhong, et al.
Published: (2024)
by: Zhou, Zhenhong, et al.
Published: (2024)
SOD: Step-wise On-policy Distillation for Small Language Model Agents
by: Zhong, Qiyong, et al.
Published: (2026)
by: Zhong, Qiyong, et al.
Published: (2026)
UniVLR: Unifying Text and Vision in Visual Latent Reasoning for Multimodal LLMs
by: Jiang, Houcheng, et al.
Published: (2026)
by: Jiang, Houcheng, et al.
Published: (2026)
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
by: Fang, Junfeng, et al.
Published: (2024)
by: Fang, Junfeng, et al.
Published: (2024)
Mem-W: Latent Memory-Native GUI Agents
by: Zhang, Guibin, et al.
Published: (2026)
by: Zhang, Guibin, et al.
Published: (2026)
AgenTracer: Who Is Inducing Failure in the LLM Agentic Systems?
by: Zhang, Guibin, et al.
Published: (2025)
by: Zhang, Guibin, et al.
Published: (2025)
LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems
by: Zhang, Yuanhe, et al.
Published: (2025)
by: Zhang, Yuanhe, et al.
Published: (2025)
DualEdit: Mitigating Safety Fallback in LLM Backdoor Editing via Affirmation-Refusal Regulation
by: Jiang, Houcheng, et al.
Published: (2025)
by: Jiang, Houcheng, et al.
Published: (2025)
AnyEdit: Edit Any Knowledge Encoded in Language Models
by: Jiang, Houcheng, et al.
Published: (2025)
by: Jiang, Houcheng, et al.
Published: (2025)
DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models
by: Li, Zherui, et al.
Published: (2025)
by: Li, Zherui, et al.
Published: (2025)
MemGen: Weaving Generative Latent Memory for Self-Evolving Agents
by: Zhang, Guibin, et al.
Published: (2025)
by: Zhang, Guibin, et al.
Published: (2025)
Contrastive Weak-to-strong Generalization
by: Jiang, Houcheng, et al.
Published: (2025)
by: Jiang, Houcheng, et al.
Published: (2025)
HarnessForge: Joint Harness and Policy Evolution for Adaptive Agent Systems
by: Chen, Mingju, et al.
Published: (2026)
by: Chen, Mingju, et al.
Published: (2026)
MAS$^2$: Self-Generative, Self-Configuring, Self-Rectifying Multi-Agent Systems
by: Wang, Kun, et al.
Published: (2025)
by: Wang, Kun, et al.
Published: (2025)
CSSBench: Evaluating the Safety of Lightweight LLMs against Chinese-Specific Adversarial Patterns
by: Zhou, Zhenhong, et al.
Published: (2026)
by: Zhou, Zhenhong, et al.
Published: (2026)
UniErase: Towards Balanced and Precise Unlearning in Language Models
by: Yu, Miao, et al.
Published: (2025)
by: Yu, Miao, et al.
Published: (2025)
On the Role of Attention Heads in Large Language Model Safety
by: Zhou, Zhenhong, et al.
Published: (2024)
by: Zhou, Zhenhong, et al.
Published: (2024)
LIFEBench: Evaluating Length Instruction Following in Large Language Models
by: Zhang, Wei, et al.
Published: (2025)
by: Zhang, Wei, et al.
Published: (2025)
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
by: Xue, Xiangyuan, et al.
Published: (2025)
by: Xue, Xiangyuan, et al.
Published: (2025)
TodoEvolve: Learning to Architect Agent Planning Systems
by: Liu, Jiaxi, et al.
Published: (2026)
by: Liu, Jiaxi, et al.
Published: (2026)
CoTGuard: Using Chain-of-Thought Triggering for Copyright Protection in Multi-Agent LLM Systems
by: Wen, Yan, et al.
Published: (2025)
by: Wen, Yan, et al.
Published: (2025)
EvoRoute: Experience-Driven Self-Routing LLM Agent Systems
by: Zhang, Guibin, et al.
Published: (2026)
by: Zhang, Guibin, et al.
Published: (2026)
Diagnose, Localize, Align: A Full-Stack Framework for Reliable LLM Multi-Agent Systems under Instruction Conflicts
by: Wan, Guancheng, et al.
Published: (2025)
by: Wan, Guancheng, et al.
Published: (2025)
Latent Causal Void: Explicit Missing-Context Reconstruction for Misinformation Detection
by: Li, Hui, et al.
Published: (2026)
by: Li, Hui, et al.
Published: (2026)
Automotive-ENV: Benchmarking Multimodal Agents in Vehicle Interface Systems
by: Yan, Junfeng, et al.
Published: (2025)
by: Yan, Junfeng, et al.
Published: (2025)
RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-Checking
by: Yang, Shuo, et al.
Published: (2025)
by: Yang, Shuo, et al.
Published: (2025)
Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation
by: Anik, Anirban Saha, et al.
Published: (2025)
by: Anik, Anirban Saha, et al.
Published: (2025)
SpeechAgents: Human-Communication Simulation with Multi-Modal Multi-Agent Systems
by: Zhang, Dong, et al.
Published: (2024)
by: Zhang, Dong, et al.
Published: (2024)
LARFT: Closing the Cognition-Action Gap for Length Instruction Following in Large Language Models
by: Zhang, Wei, et al.
Published: (2026)
by: Zhang, Wei, et al.
Published: (2026)
MCPShield: A Security Cognition Layer for Adaptive Trust Calibration in Model Context Protocol Agents
by: Zhou, Zhenhong, et al.
Published: (2026)
by: Zhou, Zhenhong, et al.
Published: (2026)
TDAG: A Multi-Agent Framework based on Dynamic Task Decomposition and Agent Generation
by: Wang, Yaoxiang, et al.
Published: (2024)
by: Wang, Yaoxiang, et al.
Published: (2024)
Similar Items
-
Reinforced Lifelong Editing for Language Models
by: Li, Zherui, et al.
Published: (2025) -
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models
by: Zhou, Zhenhong, et al.
Published: (2025) -
MemEvolve: Meta-Evolution of Agent Memory Systems
by: Zhang, Guibin, et al.
Published: (2025) -
Jailbreaking Large Language Diffusion Models: Revealing Hidden Safety Flaws in Diffusion-Based Text Generation
by: Zhang, Yuanhe, et al.
Published: (2025) -
Multi-agent Architecture Search via Agentic Supernet
by: Zhang, Guibin, et al.
Published: (2025)