Saved in:
| Main Authors: | Dong, Ming, Tang, Shiyi, Peng, Ziyan, Chen, Guanyi, He, Tingting |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.10504 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DSCD: Large Language Model Detoxification with Self-Constrained Decoding
by: Dong, Ming, et al.
Published: (2025)
by: Dong, Ming, et al.
Published: (2025)
How Much Do LLMs Know About Chinese Zero Pronouns?
by: Li, Yifei, et al.
Published: (2026)
by: Li, Yifei, et al.
Published: (2026)
Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing
by: Lu, Yifan, et al.
Published: (2025)
by: Lu, Yifan, et al.
Published: (2025)
How Do People Quantify Naturally: Evidence from Mandarin Picture Description
by: Zhang, Yayun, et al.
Published: (2026)
by: Zhang, Yayun, et al.
Published: (2026)
Do Large Language Models Judge Error Severity Like Humans?
by: Sun, Diege, et al.
Published: (2025)
by: Sun, Diege, et al.
Published: (2025)
Emotional Supporters often Use Multiple Strategies in a Single Turn
by: Bai, Xin, et al.
Published: (2025)
by: Bai, Xin, et al.
Published: (2025)
CCNU at SemEval-2025 Task 3: Leveraging Internal and External Knowledge of Large Language Models for Multilingual Hallucination Annotation
by: Liu, Xu, et al.
Published: (2025)
by: Liu, Xu, et al.
Published: (2025)
Rich Semantic Knowledge Enhanced Large Language Models for Few-shot Chinese Spell Checking
by: Dong, Ming, et al.
Published: (2024)
by: Dong, Ming, et al.
Published: (2024)
When Seekers Are Hard to Help: Evaluating Emotional Support Dialogue Systems in Worst-Case Interactions
by: Yang, Jiajie, et al.
Published: (2026)
by: Yang, Jiajie, et al.
Published: (2026)
An Effective, Robust and Fairness-aware Hate Speech Detection Framework
by: Mou, Guanyi, et al.
Published: (2024)
by: Mou, Guanyi, et al.
Published: (2024)
Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit
by: Chen, Qizhou, et al.
Published: (2024)
by: Chen, Qizhou, et al.
Published: (2024)
Mechanistic Circuit-Based Knowledge Editing in Large Language Models
by: Zhao, Tianyi, et al.
Published: (2026)
by: Zhao, Tianyi, et al.
Published: (2026)
LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing
by: Wang, Peng, et al.
Published: (2025)
by: Wang, Peng, et al.
Published: (2025)
CMD: a framework for Context-aware Model self-Detoxification
by: Tang, Zecheng, et al.
Published: (2023)
by: Tang, Zecheng, et al.
Published: (2023)
SetKE: Knowledge Editing for Knowledge Elements Overlap
by: Wei, Yifan, et al.
Published: (2025)
by: Wei, Yifan, et al.
Published: (2025)
Targeted Efficient Fine-tuning: Optimizing Parameter Updates with Data-Driven Sample Selection
by: Dong, Ming, et al.
Published: (2024)
by: Dong, Ming, et al.
Published: (2024)
FISH-Tuning: Enhancing PEFT Methods with Fisher Information
by: Xue, Kang, et al.
Published: (2025)
by: Xue, Kang, et al.
Published: (2025)
Knowledge Editing through Chain-of-Thought
by: Wang, Changyue, et al.
Published: (2024)
by: Wang, Changyue, et al.
Published: (2024)
Context-Robust Knowledge Editing for Language Models
by: Park, Haewon, et al.
Published: (2025)
by: Park, Haewon, et al.
Published: (2025)
ylmmcl at Multilingual Text Detoxification 2025: Lexicon-Guided Detoxification and Classifier-Gated Rewriting
by: Lai-Lopez, Nicole, et al.
Published: (2025)
by: Lai-Lopez, Nicole, et al.
Published: (2025)
Detoxification for LLM: From Dataset Itself
by: Shao, Wei, et al.
Published: (2026)
by: Shao, Wei, et al.
Published: (2026)
DeepEdit: Knowledge Editing as Decoding with Constraints
by: Wang, Yiwei, et al.
Published: (2024)
by: Wang, Yiwei, et al.
Published: (2024)
Fine-Grained Detoxification via Instance-Level Prefixes for Large Language Models
by: Yi, Xin, et al.
Published: (2024)
by: Yi, Xin, et al.
Published: (2024)
Editing the Mind of Giants: An In-Depth Exploration of Pitfalls of Knowledge Editing in Large Language Models
by: Hsueh, Cheng-Hsun, et al.
Published: (2024)
by: Hsueh, Cheng-Hsun, et al.
Published: (2024)
EvoEdit: Evolving Null-space Alignment for Robust and Efficient Knowledge Editing
by: Lyu, Sicheng, et al.
Published: (2025)
by: Lyu, Sicheng, et al.
Published: (2025)
Breaking Bad Tokens: Detoxification of LLMs Using Sparse Autoencoders
by: Goyal, Agam, et al.
Published: (2025)
by: Goyal, Agam, et al.
Published: (2025)
Intrinsic Task-based Evaluation for Referring Expression Generation
by: Chen, Guanyi, et al.
Published: (2024)
by: Chen, Guanyi, et al.
Published: (2024)
Computational Modelling of Plurality and Definiteness in Chinese Noun Phrases
by: Liu, Yuqi, et al.
Published: (2024)
by: Liu, Yuqi, et al.
Published: (2024)
Text Detoxification as Style Transfer in English and Hindi
by: Mukherjee, Sourabrata, et al.
Published: (2024)
by: Mukherjee, Sourabrata, et al.
Published: (2024)
Mitigating Heterogeneous Token Overfitting in LLM Knowledge Editing
by: Liu, Tianci, et al.
Published: (2025)
by: Liu, Tianci, et al.
Published: (2025)
Event-level Knowledge Editing
by: Peng, Hao, et al.
Published: (2024)
by: Peng, Hao, et al.
Published: (2024)
Parameter-Efficient Detoxification with Contrastive Decoding
by: Niu, Tong, et al.
Published: (2024)
by: Niu, Tong, et al.
Published: (2024)
Benchmarking and Rethinking Knowledge Editing for Large Language Models
by: He, Guoxiu, et al.
Published: (2025)
by: He, Guoxiu, et al.
Published: (2025)
Revealing the Deceptiveness of Knowledge Editing: A Mechanistic Analysis of Superficial Editing
by: Xie, Jiakuan, et al.
Published: (2025)
by: Xie, Jiakuan, et al.
Published: (2025)
Beyond Local Edits: Embedding-Virtualized Knowledge for Broader Evaluation and Preservation of Model Editing
by: Liu, Shuainan, et al.
Published: (2026)
by: Liu, Shuainan, et al.
Published: (2026)
Knowledge Editing on Black-box Large Language Models
by: Song, Xiaoshuai, et al.
Published: (2024)
by: Song, Xiaoshuai, et al.
Published: (2024)
Knowledge Editing with Dynamic Knowledge Graphs for Multi-Hop Question Answering
by: Lu, Yifan, et al.
Published: (2024)
by: Lu, Yifan, et al.
Published: (2024)
CKnowEdit: A New Chinese Knowledge Editing Dataset for Linguistics, Facts, and Logic Error Correction in LLMs
by: Fang, Jizhan, et al.
Published: (2024)
by: Fang, Jizhan, et al.
Published: (2024)
Aligning Language Models with Real-time Knowledge Editing
by: Tang, Chenming, et al.
Published: (2025)
by: Tang, Chenming, et al.
Published: (2025)
DetoxLLM: A Framework for Detoxification with Explanations
by: Khondaker, Md Tawkat Islam, et al.
Published: (2024)
by: Khondaker, Md Tawkat Islam, et al.
Published: (2024)
Similar Items
-
DSCD: Large Language Model Detoxification with Self-Constrained Decoding
by: Dong, Ming, et al.
Published: (2025) -
How Much Do LLMs Know About Chinese Zero Pronouns?
by: Li, Yifei, et al.
Published: (2026) -
Adaptive Detoxification: Safeguarding General Capabilities of LLMs through Toxicity-Aware Knowledge Editing
by: Lu, Yifan, et al.
Published: (2025) -
How Do People Quantify Naturally: Evidence from Mandarin Picture Description
by: Zhang, Yayun, et al.
Published: (2026) -
Do Large Language Models Judge Error Severity Like Humans?
by: Sun, Diege, et al.
Published: (2025)