Saved in:
| Main Authors: | Duan, Zenghao, Duan, Wenbin, Yin, Zhiyi, Shen, Yinghan, Jing, Shaoling, Zhang, Jie, Shen, Huawei, Cheng, Xueqi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.06868 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GloSS over Toxicity: Understanding and Mitigating Toxicity in LLMs via Global Toxic Subspace
by: Duan, Zenghao, et al.
Published: (2025)
by: Duan, Zenghao, et al.
Published: (2025)
Large Language Model Sourcing: A Survey
by: Pang, Liang, et al.
Published: (2025)
by: Pang, Liang, et al.
Published: (2025)
Projecting Out the Malice: A Global Subspace Approach to LLM Detoxification
by: Duan, Zenghao, et al.
Published: (2026)
by: Duan, Zenghao, et al.
Published: (2026)
Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent
by: Zhou, Junkai, et al.
Published: (2024)
by: Zhou, Junkai, et al.
Published: (2024)
Stable Knowledge Editing in Large Language Models
by: Wei, Zihao, et al.
Published: (2024)
by: Wei, Zihao, et al.
Published: (2024)
The Evolution of Thought: Tracking LLM Overthinking via Reasoning Dynamics Analysis
by: Wei, Zihao, et al.
Published: (2025)
by: Wei, Zihao, et al.
Published: (2025)
MLaKE: Multilingual Knowledge Editing Benchmark for Large Language Models
by: Wei, Zihao, et al.
Published: (2024)
by: Wei, Zihao, et al.
Published: (2024)
Circular Reasoning: Understanding Self-Reinforcing Loops in Large Reasoning Models
by: Duan, Zenghao, et al.
Published: (2026)
by: Duan, Zenghao, et al.
Published: (2026)
Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models
by: Deng, Jingcheng, et al.
Published: (2024)
by: Deng, Jingcheng, et al.
Published: (2024)
Think Before You Speak: Cultivating Communication Skills of Large Language Models via Inner Monologue
by: Zhou, Junkai, et al.
Published: (2023)
by: Zhou, Junkai, et al.
Published: (2023)
A Theory for Token-Level Harmonization in Retrieval-Augmented Generation
by: Xu, Shicheng, et al.
Published: (2024)
by: Xu, Shicheng, et al.
Published: (2024)
LLM Latent Reasoning as Chain of Superposition
by: Deng, Jingcheng, et al.
Published: (2025)
by: Deng, Jingcheng, et al.
Published: (2025)
Latent-GRPO: Group Relative Policy Optimization for Latent Reasoning
by: Deng, Jingcheng, et al.
Published: (2026)
by: Deng, Jingcheng, et al.
Published: (2026)
List-aware Reranking-Truncation Joint Model for Search and Retrieval-augmented Generation
by: Xu, Shicheng, et al.
Published: (2024)
by: Xu, Shicheng, et al.
Published: (2024)
Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks
by: Xu, Shicheng, et al.
Published: (2023)
by: Xu, Shicheng, et al.
Published: (2023)
from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors
by: Yan, Yu, et al.
Published: (2025)
by: Yan, Yu, et al.
Published: (2025)
Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities
by: Bi, Baolong, et al.
Published: (2024)
by: Bi, Baolong, et al.
Published: (2024)
Cross-Modal Safety Mechanism Transfer in Large Vision-Language Models
by: Xu, Shicheng, et al.
Published: (2024)
by: Xu, Shicheng, et al.
Published: (2024)
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation
by: Xu, Shicheng, et al.
Published: (2024)
by: Xu, Shicheng, et al.
Published: (2024)
StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models
by: Bi, Baolong, et al.
Published: (2024)
by: Bi, Baolong, et al.
Published: (2024)
Understanding the Collapse of LLMs in Model Editing
by: Yang, Wanli, et al.
Published: (2024)
by: Yang, Wanli, et al.
Published: (2024)
The Mirage of Model Editing: Revisiting Evaluation in the Wild
by: Yang, Wanli, et al.
Published: (2025)
by: Yang, Wanli, et al.
Published: (2025)
Reverse Physician-AI Relationship: Full-process Clinical Diagnosis Driven by a Large Language Model
by: Xu, Shicheng, et al.
Published: (2025)
by: Xu, Shicheng, et al.
Published: (2025)
Unlocking the Power of Large Language Models for Entity Alignment
by: Jiang, Xuhui, et al.
Published: (2024)
by: Jiang, Xuhui, et al.
Published: (2024)
JudgeAgent: Beyond Static Benchmarks for Knowledge-Driven and Dynamic LLM Evaluation
by: Shi, Zhichao, et al.
Published: (2025)
by: Shi, Zhichao, et al.
Published: (2025)
The Labyrinth and the Thread: Rethinking Regularizations in Sequential Knowledge Editing for Large Language Models
by: Wang, Zheng, et al.
Published: (2026)
by: Wang, Zheng, et al.
Published: (2026)
D-Models and E-Models: Diversity-Stability Trade-offs in the Sampling Behavior of Large Language Models
by: Gu, Jia, et al.
Published: (2026)
by: Gu, Jia, et al.
Published: (2026)
Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration
by: Wu, Kangxi, et al.
Published: (2024)
by: Wu, Kangxi, et al.
Published: (2024)
Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation
by: Gu, Jia, et al.
Published: (2024)
by: Gu, Jia, et al.
Published: (2024)
Inference-time Alignment in Continuous Space
by: Yuan, Yige, et al.
Published: (2025)
by: Yuan, Yige, et al.
Published: (2025)
Fact-Level Confidence Calibration and Self-Correction
by: Yuan, Yige, et al.
Published: (2024)
by: Yuan, Yige, et al.
Published: (2024)
LLM4MEA: Data-free Model Extraction Attacks on Sequential Recommenders via Large Language Models
by: Zhao, Shilong, et al.
Published: (2025)
by: Zhao, Shilong, et al.
Published: (2025)
Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models
by: Liu, Xiyu, et al.
Published: (2026)
by: Liu, Xiyu, et al.
Published: (2026)
ChainEdit: Propagating Ripple Effects in LLM Knowledge Editing through Logical Rule-Guided Chains
by: Dong, Zilu, et al.
Published: (2025)
by: Dong, Zilu, et al.
Published: (2025)
Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems
by: Wang, Zixu, et al.
Published: (2026)
by: Wang, Zixu, et al.
Published: (2026)
MEMIT-Merge: Addressing MEMIT's Key-Value Conflicts in Same-Subject Batch Editing for LLMs
by: Dong, Zilu, et al.
Published: (2025)
by: Dong, Zilu, et al.
Published: (2025)
Decoupling Reasoning and Knowledge Injection for In-Context Knowledge Editing
by: Wang, Changyue, et al.
Published: (2025)
by: Wang, Changyue, et al.
Published: (2025)
Modeling Balanced Explicit and Implicit Relations with Contrastive Learning for Knowledge Concept Recommendation in MOOCs
by: Gu, Hengnian, et al.
Published: (2024)
by: Gu, Hengnian, et al.
Published: (2024)
Editing Conceptual Knowledge for Large Language Models
by: Wang, Xiaohan, et al.
Published: (2024)
by: Wang, Xiaohan, et al.
Published: (2024)
Identifying Knowledge Editing Types in Large Language Models
by: Li, Xiaopeng, et al.
Published: (2024)
by: Li, Xiaopeng, et al.
Published: (2024)
Similar Items
-
GloSS over Toxicity: Understanding and Mitigating Toxicity in LLMs via Global Toxic Subspace
by: Duan, Zenghao, et al.
Published: (2025) -
Large Language Model Sourcing: A Survey
by: Pang, Liang, et al.
Published: (2025) -
Projecting Out the Malice: A Global Subspace Approach to LLM Detoxification
by: Duan, Zenghao, et al.
Published: (2026) -
Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent
by: Zhou, Junkai, et al.
Published: (2024) -
Stable Knowledge Editing in Large Language Models
by: Wei, Zihao, et al.
Published: (2024)