:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Duan, Zenghao, Duan, Wenbin, Yin, Zhiyi, Shen, Yinghan, Jing, Shaoling, Zhang, Jie, Shen, Huawei, Cheng, Xueqi
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.06868
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

GloSS over Toxicity: Understanding and Mitigating Toxicity in LLMs via Global Toxic Subspace
by: Duan, Zenghao, et al.
Published: (2025)

Large Language Model Sourcing: A Survey
by: Pang, Liang, et al.
Published: (2025)

Projecting Out the Malice: A Global Subspace Approach to LLM Detoxification
by: Duan, Zenghao, et al.
Published: (2026)

Knowledge Boundary and Persona Dynamic Shape A Better Social Media Agent
by: Zhou, Junkai, et al.
Published: (2024)

Stable Knowledge Editing in Large Language Models
by: Wei, Zihao, et al.
Published: (2024)

The Evolution of Thought: Tracking LLM Overthinking via Reasoning Dynamics Analysis
by: Wei, Zihao, et al.
Published: (2025)

MLaKE: Multilingual Knowledge Editing Benchmark for Large Language Models
by: Wei, Zihao, et al.
Published: (2024)

Circular Reasoning: Understanding Self-Reinforcing Loops in Large Reasoning Models
by: Duan, Zenghao, et al.
Published: (2026)

Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language Models
by: Deng, Jingcheng, et al.
Published: (2024)

Think Before You Speak: Cultivating Communication Skills of Large Language Models via Inner Monologue
by: Zhou, Junkai, et al.
Published: (2023)

A Theory for Token-Level Harmonization in Retrieval-Augmented Generation
by: Xu, Shicheng, et al.
Published: (2024)

LLM Latent Reasoning as Chain of Superposition
by: Deng, Jingcheng, et al.
Published: (2025)

Latent-GRPO: Group Relative Policy Optimization for Latent Reasoning
by: Deng, Jingcheng, et al.
Published: (2026)

List-aware Reranking-Truncation Joint Model for Search and Retrieval-augmented Generation
by: Xu, Shicheng, et al.
Published: (2024)

Search-in-the-Chain: Interactively Enhancing Large Language Models with Search for Knowledge-intensive Tasks
by: Xu, Shicheng, et al.
Published: (2023)

from Benign import Toxic: Jailbreaking the Language Model via Adversarial Metaphors
by: Yan, Yu, et al.
Published: (2025)

Adaptive Token Biaser: Knowledge Editing via Biasing Key Entities
by: Bi, Baolong, et al.
Published: (2024)

Cross-Modal Safety Mechanism Transfer in Large Vision-Language Models
by: Xu, Shicheng, et al.
Published: (2024)

Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation
by: Xu, Shicheng, et al.
Published: (2024)

StruEdit: Structured Outputs Enable the Fast and Accurate Knowledge Editing for Large Language Models
by: Bi, Baolong, et al.
Published: (2024)

Understanding the Collapse of LLMs in Model Editing
by: Yang, Wanli, et al.
Published: (2024)

The Mirage of Model Editing: Revisiting Evaluation in the Wild
by: Yang, Wanli, et al.
Published: (2025)

Reverse Physician-AI Relationship: Full-process Clinical Diagnosis Driven by a Large Language Model
by: Xu, Shicheng, et al.
Published: (2025)

Unlocking the Power of Large Language Models for Entity Alignment
by: Jiang, Xuhui, et al.
Published: (2024)

JudgeAgent: Beyond Static Benchmarks for Knowledge-Driven and Dynamic LLM Evaluation
by: Shi, Zhichao, et al.
Published: (2025)

The Labyrinth and the Thread: Rethinking Regularizations in Sequential Knowledge Editing for Large Language Models
by: Wang, Zheng, et al.
Published: (2026)

D-Models and E-Models: Diversity-Stability Trade-offs in the Sampling Behavior of Large Language Models
by: Gu, Jia, et al.
Published: (2026)

Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration
by: Wu, Kangxi, et al.
Published: (2024)

Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation
by: Gu, Jia, et al.
Published: (2024)

Inference-time Alignment in Continuous Space
by: Yuan, Yige, et al.
Published: (2025)

Fact-Level Confidence Calibration and Self-Correction
by: Yuan, Yige, et al.
Published: (2024)

LLM4MEA: Data-free Model Extraction Attacks on Sequential Recommenders via Large Language Models
by: Zhao, Shilong, et al.
Published: (2025)

Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models
by: Liu, Xiyu, et al.
Published: (2026)

ChainEdit: Propagating Ripple Effects in LLM Knowledge Editing through Logical Rule-Guided Chains
by: Dong, Zilu, et al.
Published: (2025)

Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems
by: Wang, Zixu, et al.
Published: (2026)

MEMIT-Merge: Addressing MEMIT's Key-Value Conflicts in Same-Subject Batch Editing for LLMs
by: Dong, Zilu, et al.
Published: (2025)

Decoupling Reasoning and Knowledge Injection for In-Context Knowledge Editing
by: Wang, Changyue, et al.
Published: (2025)

Modeling Balanced Explicit and Implicit Relations with Contrastive Learning for Knowledge Concept Recommendation in MOOCs
by: Gu, Hengnian, et al.
Published: (2024)

Editing Conceptual Knowledge for Large Language Models
by: Wang, Xiaohan, et al.
Published: (2024)

Identifying Knowledge Editing Types in Large Language Models
by: Li, Xiaopeng, et al.
Published: (2024)