Saved in:
| Main Authors: | Yao, Zihan, He, Yu, Qi, Tianyu, Li, Ming |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.02699 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LMCD: Language Models are Zeroshot Cognitive Diagnosis Learners
by: He, Yu, et al.
Published: (2025)
by: He, Yu, et al.
Published: (2025)
ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis
by: Tu, Zeao, et al.
Published: (2024)
by: Tu, Zeao, et al.
Published: (2024)
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
MEMoE: Enhancing Model Editing with Mixture of Experts Adaptors
by: Wang, Renzhi, et al.
Published: (2024)
by: Wang, Renzhi, et al.
Published: (2024)
Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts
by: Chen, Qizhou, et al.
Published: (2024)
by: Chen, Qizhou, et al.
Published: (2024)
Scalable Training of Mixture-of-Experts Models with Megatron Core
by: Yan, Zijie, et al.
Published: (2026)
by: Yan, Zijie, et al.
Published: (2026)
Pruning General Large Language Models into Customized Expert Models
by: Zhao, Yirao, et al.
Published: (2025)
by: Zhao, Yirao, et al.
Published: (2025)
SciCustom: A Framework for Custom Evaluation of Scientific Capabilities in Large Language Models
by: Gu, Yiyang, et al.
Published: (2026)
by: Gu, Yiyang, et al.
Published: (2026)
Robust and Scalable Model Editing for Large Language Models
by: Chen, Yingfa, et al.
Published: (2024)
by: Chen, Yingfa, et al.
Published: (2024)
LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models
by: Wang, Renzhi, et al.
Published: (2024)
by: Wang, Renzhi, et al.
Published: (2024)
Neuron-Level Sequential Editing for Large Language Models
by: Jiang, Houcheng, et al.
Published: (2024)
by: Jiang, Houcheng, et al.
Published: (2024)
On the Robustness of Knowledge Editing for Detoxification
by: Dong, Ming, et al.
Published: (2026)
by: Dong, Ming, et al.
Published: (2026)
One-for-All Pruning: A Universal Model for Customized Compression of Large Language Models
by: Ye, Rongguang, et al.
Published: (2025)
by: Ye, Rongguang, et al.
Published: (2025)
Reinforced Strategy Optimization for Conversational Recommender Systems via Network-of-Experts
by: Zhao, Xiaoyan, et al.
Published: (2025)
by: Zhao, Xiaoyan, et al.
Published: (2025)
ChatPattern: Layout Pattern Customization via Natural Language
by: Wang, Zixiao, et al.
Published: (2024)
by: Wang, Zixiao, et al.
Published: (2024)
Cluster-Driven Expert Pruning for Mixture-of-Experts Large Language Models
by: Guo, Hongcheng, et al.
Published: (2025)
by: Guo, Hongcheng, et al.
Published: (2025)
Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models
by: Wang, Zihan, et al.
Published: (2024)
by: Wang, Zihan, et al.
Published: (2024)
Scalable Multi-Stage Influence Function for Large Language Models via Eigenvalue-Corrected Kronecker-Factored Parameterization
by: Bao, Yuntai, et al.
Published: (2025)
by: Bao, Yuntai, et al.
Published: (2025)
A Closer Look into Mixture-of-Experts in Large Language Models
by: Lo, Ka Man, et al.
Published: (2024)
by: Lo, Ka Man, et al.
Published: (2024)
Unlocking Efficient, Scalable, and Continual Knowledge Editing with Basis-Level Representation Fine-Tuning
by: Liu, Tianci, et al.
Published: (2025)
by: Liu, Tianci, et al.
Published: (2025)
Hybrid and Unitary PEFT for Resource-Efficient Large Language Models
by: Qi, Haomin, et al.
Published: (2025)
by: Qi, Haomin, et al.
Published: (2025)
Reverse-Engineering Model Editing on Language Models
by: Sun, Zhiyu, et al.
Published: (2026)
by: Sun, Zhiyu, et al.
Published: (2026)
Energy-Regularized Sequential Model Editing on Hyperspheres
by: Liu, Qingyuan, et al.
Published: (2025)
by: Liu, Qingyuan, et al.
Published: (2025)
Unveiling Super Experts in Mixture-of-Experts Large Language Models
by: Su, Zunhai, et al.
Published: (2025)
by: Su, Zunhai, et al.
Published: (2025)
Benchmarking and Learning Real-World Customer Service Dialogue
by: Gao, Tianhong, et al.
Published: (2025)
by: Gao, Tianhong, et al.
Published: (2025)
Scalable Multi-Domain Adaptation of Language Models using Modular Experts
by: Schafhalter, Peter, et al.
Published: (2024)
by: Schafhalter, Peter, et al.
Published: (2024)
GeoEdit: Geometric Knowledge Editing for Large Language Models
by: Feng, Yujie, et al.
Published: (2025)
by: Feng, Yujie, et al.
Published: (2025)
FAME: Towards Factual Multi-Task Model Editing
by: Zeng, Li, et al.
Published: (2024)
by: Zeng, Li, et al.
Published: (2024)
Layerwise Recurrent Router for Mixture-of-Experts
by: Qiu, Zihan, et al.
Published: (2024)
by: Qiu, Zihan, et al.
Published: (2024)
Are Expert-Level Language Models Expert-Level Annotators?
by: Tseng, Yu-Min, et al.
Published: (2024)
by: Tseng, Yu-Min, et al.
Published: (2024)
CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
by: Zheng, Jiamu, et al.
Published: (2024)
by: Zheng, Jiamu, et al.
Published: (2024)
Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances
by: Singh, Rishu Kumar, et al.
Published: (2025)
by: Singh, Rishu Kumar, et al.
Published: (2025)
SciDFM: A Large Language Model with Mixture-of-Experts for Science
by: Sun, Liangtai, et al.
Published: (2024)
by: Sun, Liangtai, et al.
Published: (2024)
Editing the Mind of Giants: An In-Depth Exploration of Pitfalls of Knowledge Editing in Large Language Models
by: Hsueh, Cheng-Hsun, et al.
Published: (2024)
by: Hsueh, Cheng-Hsun, et al.
Published: (2024)
Diagnosing Model Editing via Knowledge Spectrum
by: Pan, Tsung-Hsuan, et al.
Published: (2025)
by: Pan, Tsung-Hsuan, et al.
Published: (2025)
Rethinking Residual Distribution in Locate-then-Edit Model Editing
by: Li, Xiaopeng, et al.
Published: (2025)
by: Li, Xiaopeng, et al.
Published: (2025)
MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts
by: Yu, Haofei, et al.
Published: (2023)
by: Yu, Haofei, et al.
Published: (2023)
EAMET: Robust Massive Model Editing via Embedding Alignment Optimization
by: Dai, Yanbo, et al.
Published: (2025)
by: Dai, Yanbo, et al.
Published: (2025)
ReXMoE: Reusing Experts with Minimal Overhead in Mixture-of-Experts
by: Tan, Zheyue, et al.
Published: (2025)
by: Tan, Zheyue, et al.
Published: (2025)
The Mirage of Model Editing: Revisiting Evaluation in the Wild
by: Yang, Wanli, et al.
Published: (2025)
by: Yang, Wanli, et al.
Published: (2025)
Similar Items
-
LMCD: Language Models are Zeroshot Cognitive Diagnosis Learners
by: He, Yu, et al.
Published: (2025) -
ResoFilter: Fine-grained Synthetic Data Filtering for Large Language Models through Data-Parameter Resonance Analysis
by: Tu, Zeao, et al.
Published: (2024) -
Chain-of-Experts: Unlocking the Communication Power of Mixture-of-Experts Models
by: Wang, Zihan, et al.
Published: (2025) -
MEMoE: Enhancing Model Editing with Mixture of Experts Adaptors
by: Wang, Renzhi, et al.
Published: (2024) -
Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts
by: Chen, Qizhou, et al.
Published: (2024)