Saved in:
| Main Authors: | Pan, Haowen, Wang, Xiaozhi, Cao, Yixin, Shi, Zenglin, Yang, Xun, Li, Juanzi, Wang, Meng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.01090 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers
by: Pan, Haowen, et al.
Published: (2023)
by: Pan, Haowen, et al.
Published: (2023)
Event-level Knowledge Editing
by: Peng, Hao, et al.
Published: (2024)
by: Peng, Hao, et al.
Published: (2024)
Auxiliary Metrics Help Decoding Skill Neurons in the Wild
by: Zhao, Yixiu, et al.
Published: (2025)
by: Zhao, Yixiu, et al.
Published: (2025)
Towards Localized and Disentangled Knowledge Editing for Multimodal Large Language Models
by: Gu, Leijiang, et al.
Published: (2026)
by: Gu, Leijiang, et al.
Published: (2026)
Towards Understanding Safety Alignment: A Mechanistic Perspective from Safety Neurons
by: Chen, Jianhui, et al.
Published: (2024)
by: Chen, Jianhui, et al.
Published: (2024)
Visual-Oriented Fine-Grained Knowledge Editing for MultiModal Large Language Models
by: Zeng, Zhen, et al.
Published: (2024)
by: Zeng, Zhen, et al.
Published: (2024)
OpenEP: Open-Ended Future Event Prediction
by: Guan, Yong, et al.
Published: (2024)
by: Guan, Yong, et al.
Published: (2024)
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
by: Zhang, Jiajie, et al.
Published: (2024)
by: Zhang, Jiajie, et al.
Published: (2024)
XMoE: Sparse Models with Fine-grained and Adaptive Expert Selection
by: Yang, Yuanhang, et al.
Published: (2024)
by: Yang, Yuanhang, et al.
Published: (2024)
Bridging the Editing Gap in LLMs: FineEdit for Precise and Targeted Text Modifications
by: Zeng, Yiming, et al.
Published: (2025)
by: Zeng, Yiming, et al.
Published: (2025)
R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models
by: Tu, Shangqing, et al.
Published: (2024)
by: Tu, Shangqing, et al.
Published: (2024)
Edit Less, Achieve More: Dynamic Sparse Neuron Masking for Lifelong Knowledge Editing in LLMs
by: Liu, Jinzhe, et al.
Published: (2025)
by: Liu, Jinzhe, et al.
Published: (2025)
TacoERE: Cluster-aware Compression for Event Relation Extraction
by: Guan, Yong, et al.
Published: (2024)
by: Guan, Yong, et al.
Published: (2024)
ADELIE: Aligning Large Language Models on Information Extraction
by: Qi, Yunjia, et al.
Published: (2024)
by: Qi, Yunjia, et al.
Published: (2024)
Multilingual Knowledge Editing with Language-Agnostic Factual Neurons
by: Zhang, Xue, et al.
Published: (2024)
by: Zhang, Xue, et al.
Published: (2024)
ChatLog: Carefully Evaluating the Evolution of ChatGPT Across Time
by: Tu, Shangqing, et al.
Published: (2023)
by: Tu, Shangqing, et al.
Published: (2023)
MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification
by: Sun, Kai, et al.
Published: (2024)
by: Sun, Kai, et al.
Published: (2024)
WildReward: Learning Reward Models from In-the-Wild Human Interactions
by: Peng, Hao, et al.
Published: (2026)
by: Peng, Hao, et al.
Published: (2026)
Constraint Back-translation Improves Complex Instruction Following of Large Language Models
by: Qi, Yunjia, et al.
Published: (2024)
by: Qi, Yunjia, et al.
Published: (2024)
VerIF: Verification Engineering for Reinforcement Learning in Instruction Following
by: Peng, Hao, et al.
Published: (2025)
by: Peng, Hao, et al.
Published: (2025)
On the Paradoxical Interference between Instruction-Following and Task Solving
by: Qi, Yunjia, et al.
Published: (2026)
by: Qi, Yunjia, et al.
Published: (2026)
MAVEN-Fact: A Large-scale Event Factuality Detection Dataset
by: Li, Chunyang, et al.
Published: (2024)
by: Li, Chunyang, et al.
Published: (2024)
Do LLMs Signal When They're Right? Evidence from Neuron Agreement
by: Chen, Kang, et al.
Published: (2025)
by: Chen, Kang, et al.
Published: (2025)
Are LLMs Really Not Knowledgeable? Mining the Submerged Knowledge in LLMs' Memory
by: Tao, Xingjian, et al.
Published: (2024)
by: Tao, Xingjian, et al.
Published: (2024)
LinguaLens: Towards Interpreting Linguistic Mechanisms of Large Language Models via Sparse Auto-Encoder
by: Jing, Yi, et al.
Published: (2025)
by: Jing, Yi, et al.
Published: (2025)
Fine-grained Hallucination Detection and Editing for Language Models
by: Mishra, Abhika, et al.
Published: (2024)
by: Mishra, Abhika, et al.
Published: (2024)
Knowledge Editing for Large Language Model with Knowledge Neuronal Ensemble
by: Li, Yongchang, et al.
Published: (2024)
by: Li, Yongchang, et al.
Published: (2024)
MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing
by: Li, Jiaqi, et al.
Published: (2024)
by: Li, Jiaqi, et al.
Published: (2024)
FABLE: Fine-grained Fact Anchoring for Unstructured Model Editing
by: Wang, Peng, et al.
Published: (2026)
by: Wang, Peng, et al.
Published: (2026)
MEMLA: Enhancing Multilingual Knowledge Editing with Neuron-Masked Low-Rank Adaptation
by: Xie, Jiakuan, et al.
Published: (2024)
by: Xie, Jiakuan, et al.
Published: (2024)
StoryWriter: A Multi-Agent Framework for Long Story Generation
by: Xia, Haotian, et al.
Published: (2025)
by: Xia, Haotian, et al.
Published: (2025)
StoryAlign: Evaluating and Training Reward Models for Story Generation
by: Xia, Haotian, et al.
Published: (2026)
by: Xia, Haotian, et al.
Published: (2026)
Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems
by: Peng, Hao, et al.
Published: (2025)
by: Peng, Hao, et al.
Published: (2025)
RPC-Bench: A Fine-grained Benchmark for Research Paper Comprehension
by: Chen, Yelin, et al.
Published: (2026)
by: Chen, Yelin, et al.
Published: (2026)
Navigating the Nuances: A Fine-grained Evaluation of Vision-Language Navigation
by: Wang, Zehao, et al.
Published: (2024)
by: Wang, Zehao, et al.
Published: (2024)
Learning to Edit: Aligning LLMs with Knowledge Editing
by: Jiang, Yuxin, et al.
Published: (2024)
by: Jiang, Yuxin, et al.
Published: (2024)
PairJudge RM: Perform Best-of-N Sampling with Knockout Tournament
by: Liu, Yantao, et al.
Published: (2025)
by: Liu, Yantao, et al.
Published: (2025)
RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style
by: Liu, Yantao, et al.
Published: (2024)
by: Liu, Yantao, et al.
Published: (2024)
PsyMem: Fine-grained psychological alignment and Explicit Memory Control for Advanced Role-Playing LLMs
by: Cheng, Xilong, et al.
Published: (2025)
by: Cheng, Xilong, et al.
Published: (2025)
NeuronTune: Fine-Grained Neuron Modulation for Balanced Safety-Utility Alignment in LLMs
by: Pan, Birong, et al.
Published: (2025)
by: Pan, Birong, et al.
Published: (2025)
Similar Items
-
Finding and Editing Multi-Modal Neurons in Pre-Trained Transformers
by: Pan, Haowen, et al.
Published: (2023) -
Event-level Knowledge Editing
by: Peng, Hao, et al.
Published: (2024) -
Auxiliary Metrics Help Decoding Skill Neurons in the Wild
by: Zhao, Yixiu, et al.
Published: (2025) -
Towards Localized and Disentangled Knowledge Editing for Multimodal Large Language Models
by: Gu, Leijiang, et al.
Published: (2026) -
Towards Understanding Safety Alignment: A Mechanistic Perspective from Safety Neurons
by: Chen, Jianhui, et al.
Published: (2024)