Guardado en:
| Autores principales: | Wang, Peng, Zhou, Biyu, Tang, Xuehai, Han, Jizhong, Hu, Songlin |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2505.15702 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
FABLE: Fine-grained Fact Anchoring for Unstructured Model Editing
por: Wang, Peng, et al.
Publicado: (2026)
por: Wang, Peng, et al.
Publicado: (2026)
Exploiting Synergistic Cognitive Biases to Bypass Safety in LLMs
por: Yang, Xikang, et al.
Publicado: (2025)
por: Yang, Xikang, et al.
Publicado: (2025)
Paper Summary Attack: Jailbreaking LLMs through LLM Safety Papers
por: Lin, Liang, et al.
Publicado: (2025)
por: Lin, Liang, et al.
Publicado: (2025)
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
por: Yang, Xikang, et al.
Publicado: (2024)
por: Yang, Xikang, et al.
Publicado: (2024)
RouteGuard: Internal-Signal Detection of Skill Poisoning in LLM Agents
por: Xiao, Wenjie, et al.
Publicado: (2026)
por: Xiao, Wenjie, et al.
Publicado: (2026)
The Dark Side of Trust: Authority Citation-Driven Jailbreak Attacks on Large Language Models
por: Yang, Xikang, et al.
Publicado: (2024)
por: Yang, Xikang, et al.
Publicado: (2024)
AdaPPA: Adaptive Position Pre-Fill Jailbreak Attack Approach Targeting LLMs
por: Lv, Lijia, et al.
Publicado: (2024)
por: Lv, Lijia, et al.
Publicado: (2024)
Enhancing Multi-Hop Fact Verification with Structured Knowledge-Augmented Large Language Models
por: Cao, Han, et al.
Publicado: (2025)
por: Cao, Han, et al.
Publicado: (2025)
Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens
por: Yang, Xikang, et al.
Publicado: (2024)
por: Yang, Xikang, et al.
Publicado: (2024)
The Labyrinth and the Thread: Rethinking Regularizations in Sequential Knowledge Editing for Large Language Models
por: Wang, Zheng, et al.
Publicado: (2026)
por: Wang, Zheng, et al.
Publicado: (2026)
Neuron-Level Sequential Editing for Large Language Models
por: Jiang, Houcheng, et al.
Publicado: (2024)
por: Jiang, Houcheng, et al.
Publicado: (2024)
On the Superimposed Noise Accumulation Problem in Sequential Knowledge Editing of Large Language Models
por: Cao, Ding, et al.
Publicado: (2025)
por: Cao, Ding, et al.
Publicado: (2025)
Disentangling Knowledge Representations for Large Language Model Editing
por: Zhang, Mengqi, et al.
Publicado: (2025)
por: Zhang, Mengqi, et al.
Publicado: (2025)
Knowledge in Superposition: Unveiling the Failures of Lifelong Knowledge Editing for Large Language Models
por: Hu, Chenhui, et al.
Publicado: (2024)
por: Hu, Chenhui, et al.
Publicado: (2024)
Structured Security Auditing and Robustness Enhancement for Untrusted Agent Skills
por: Lv, Lijia, et al.
Publicado: (2026)
por: Lv, Lijia, et al.
Publicado: (2026)
DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language Models
por: Zhang, Taolin, et al.
Publicado: (2024)
por: Zhang, Taolin, et al.
Publicado: (2024)
Benchmarking and Rethinking Knowledge Editing for Large Language Models
por: He, Guoxiu, et al.
Publicado: (2025)
por: He, Guoxiu, et al.
Publicado: (2025)
Editing the Mind of Giants: An In-Depth Exploration of Pitfalls of Knowledge Editing in Large Language Models
por: Hsueh, Cheng-Hsun, et al.
Publicado: (2024)
por: Hsueh, Cheng-Hsun, et al.
Publicado: (2024)
Stable Knowledge Editing in Large Language Models
por: Wei, Zihao, et al.
Publicado: (2024)
por: Wei, Zihao, et al.
Publicado: (2024)
Fine-Grained Behavior Simulation with Role-Playing Large Language Model on Social Media
por: Li, Kun, et al.
Publicado: (2024)
por: Li, Kun, et al.
Publicado: (2024)
Energy-Regularized Sequential Model Editing on Hyperspheres
por: Liu, Qingyuan, et al.
Publicado: (2025)
por: Liu, Qingyuan, et al.
Publicado: (2025)
Aligning Language Models with Real-time Knowledge Editing
por: Tang, Chenming, et al.
Publicado: (2025)
por: Tang, Chenming, et al.
Publicado: (2025)
Spectral Characterization and Mitigation of Sequential Knowledge Editing Collapse
por: Zhang, Chi, et al.
Publicado: (2026)
por: Zhang, Chi, et al.
Publicado: (2026)
Diagnosing Retrieval Bias Under Multiple In-Context Knowledge Updates in Large Language Models
por: Qiao, Boyu, et al.
Publicado: (2026)
por: Qiao, Boyu, et al.
Publicado: (2026)
VLKEB: A Large Vision-Language Model Knowledge Editing Benchmark
por: Huang, Han, et al.
Publicado: (2024)
por: Huang, Han, et al.
Publicado: (2024)
Cross-Lingual Knowledge Editing in Large Language Models
por: Wang, Jiaan, et al.
Publicado: (2023)
por: Wang, Jiaan, et al.
Publicado: (2023)
Knowledge Editing for Large Language Models: A Survey
por: Wang, Song, et al.
Publicado: (2023)
por: Wang, Song, et al.
Publicado: (2023)
Knowledge Graph Enhanced Large Language Model Editing
por: Zhang, Mengqi, et al.
Publicado: (2024)
por: Zhang, Mengqi, et al.
Publicado: (2024)
Neighboring Perturbations of Knowledge Editing on Large Language Models
por: Ma, Jun-Yu, et al.
Publicado: (2024)
por: Ma, Jun-Yu, et al.
Publicado: (2024)
GeoEdit: Geometric Knowledge Editing for Large Language Models
por: Feng, Yujie, et al.
Publicado: (2025)
por: Feng, Yujie, et al.
Publicado: (2025)
Navigating the Dual Facets: A Comprehensive Evaluation of Sequential Memory Editing in Large Language Models
por: Lin, Zihao, et al.
Publicado: (2024)
por: Lin, Zihao, et al.
Publicado: (2024)
Identifying Knowledge Editing Types in Large Language Models
por: Li, Xiaopeng, et al.
Publicado: (2024)
por: Li, Xiaopeng, et al.
Publicado: (2024)
O-Edit: Orthogonal Subspace Editing for Language Model Sequential Editing
por: Cai, Yuchen, et al.
Publicado: (2024)
por: Cai, Yuchen, et al.
Publicado: (2024)
Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble
por: Li, Yongchang, et al.
Publicado: (2024)
por: Li, Yongchang, et al.
Publicado: (2024)
A Comprehensive Study of Knowledge Editing for Large Language Models
por: Zhang, Ningyu, et al.
Publicado: (2024)
por: Zhang, Ningyu, et al.
Publicado: (2024)
AgentAlign: Navigating Safety Alignment in the Shift from Informative to Agentic Large Language Models
por: Zhang, Jinchuan, et al.
Publicado: (2025)
por: Zhang, Jinchuan, et al.
Publicado: (2025)
On the Robustness of Knowledge Editing for Detoxification
por: Dong, Ming, et al.
Publicado: (2026)
por: Dong, Ming, et al.
Publicado: (2026)
Mechanistic Circuit-Based Knowledge Editing in Large Language Models
por: Zhao, Tianyi, et al.
Publicado: (2026)
por: Zhao, Tianyi, et al.
Publicado: (2026)
UniEdit: A Unified Knowledge Editing Benchmark for Large Language Models
por: Chen, Qizhou, et al.
Publicado: (2025)
por: Chen, Qizhou, et al.
Publicado: (2025)
Knowledge Editing on Black-box Large Language Models
por: Song, Xiaoshuai, et al.
Publicado: (2024)
por: Song, Xiaoshuai, et al.
Publicado: (2024)
Ejemplares similares
-
FABLE: Fine-grained Fact Anchoring for Unstructured Model Editing
por: Wang, Peng, et al.
Publicado: (2026) -
Exploiting Synergistic Cognitive Biases to Bypass Safety in LLMs
por: Yang, Xikang, et al.
Publicado: (2025) -
Paper Summary Attack: Jailbreaking LLMs through LLM Safety Papers
por: Lin, Liang, et al.
Publicado: (2025) -
Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM
por: Yang, Xikang, et al.
Publicado: (2024) -
RouteGuard: Internal-Signal Detection of Skill Poisoning in LLM Agents
por: Xiao, Wenjie, et al.
Publicado: (2026)