Saved in:
| Main Authors: | Kong, Fanshuang, Zhang, Richong, Wang, Ziqiao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.09485 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt Tuning
by: Kong, Fanshuang, et al.
Published: (2024)
by: Kong, Fanshuang, et al.
Published: (2024)
MOMA: Masked Orthogonal Matrix Alignment for Zero-Additional-Parameter Model Merging
by: Kong, Fanshuang, et al.
Published: (2024)
by: Kong, Fanshuang, et al.
Published: (2024)
inversedMixup: Data Augmentation via Inverting Mixed Embeddings
by: Kong, Fanshuang, et al.
Published: (2026)
by: Kong, Fanshuang, et al.
Published: (2026)
Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation
by: Zhang, Junhao, et al.
Published: (2025)
by: Zhang, Junhao, et al.
Published: (2025)
On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions
by: Liu, Dezhi, et al.
Published: (2020)
by: Liu, Dezhi, et al.
Published: (2020)
Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging
by: Li, Mingxin, et al.
Published: (2024)
by: Li, Mingxin, et al.
Published: (2024)
Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs
by: Yu, Zeping, et al.
Published: (2025)
by: Yu, Zeping, et al.
Published: (2025)
LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint
by: Ma, Qianli, et al.
Published: (2025)
by: Ma, Qianli, et al.
Published: (2025)
General Table Question Answering via Answer-Formula Joint Generation
by: Wang, Zhongyuan, et al.
Published: (2025)
by: Wang, Zhongyuan, et al.
Published: (2025)
Towards Better Understanding of Contrastive Sentence Representation Learning: A Unified Paradigm for Gradient
by: Li, Mingxin, et al.
Published: (2024)
by: Li, Mingxin, et al.
Published: (2024)
Tool-Assisted Agent on SQL Inspection and Refinement in Real-World Scenarios
by: Wang, Zhongyuan, et al.
Published: (2024)
by: Wang, Zhongyuan, et al.
Published: (2024)
Dynamic Task Vector Grouping for Efficient Multi-Task Prompt Tuning
by: Zhang, Pieyi, et al.
Published: (2025)
by: Zhang, Pieyi, et al.
Published: (2025)
Multimodal Abstractive Summarization of Instructional Videos with Vision-Language Models
by: Nazir, Maham, et al.
Published: (2026)
by: Nazir, Maham, et al.
Published: (2026)
A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens
by: Nie, Zhijie, et al.
Published: (2024)
by: Nie, Zhijie, et al.
Published: (2024)
CoRect: Context-Aware Logit Contrast for Hidden State Rectification to Resolve Knowledge Conflicts
by: Ma, Xuhua, et al.
Published: (2026)
by: Ma, Xuhua, et al.
Published: (2026)
Code-Style In-Context Learning for Knowledge-Based Question Answering
by: Nie, Zhijie, et al.
Published: (2023)
by: Nie, Zhijie, et al.
Published: (2023)
Parameter Competition Balancing for Model Merging
by: Du, Guodong, et al.
Published: (2024)
by: Du, Guodong, et al.
Published: (2024)
Activation-Guided Consensus Merging for Large Language Models
by: Yao, Yuxuan, et al.
Published: (2025)
by: Yao, Yuxuan, et al.
Published: (2025)
Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models
by: Liu, Shuqi, et al.
Published: (2025)
by: Liu, Shuqi, et al.
Published: (2025)
Activation-Informed Merging of Large Language Models
by: Nobari, Amin Heyrani, et al.
Published: (2025)
by: Nobari, Amin Heyrani, et al.
Published: (2025)
Merging by Matching Models in Task Parameter Subspaces
by: Tam, Derek, et al.
Published: (2023)
by: Tam, Derek, et al.
Published: (2023)
Progressively Modality Freezing for Multi-Modal Entity Alignment
by: Huang, Yani, et al.
Published: (2024)
by: Huang, Yani, et al.
Published: (2024)
A Graph-based Verification Framework for Fact-Checking
by: Huang, Yani, et al.
Published: (2025)
by: Huang, Yani, et al.
Published: (2025)
CausalDetox: Causal Head Selection and Intervention for Language Model Detoxification
by: Wang, Yian, et al.
Published: (2026)
by: Wang, Yian, et al.
Published: (2026)
Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
by: Hui, Tingfeng, et al.
Published: (2024)
by: Hui, Tingfeng, et al.
Published: (2024)
Multi-Modality Expansion and Retention for LLMs through Parameter Merging and Decoupling
by: Li, Junlin, et al.
Published: (2025)
by: Li, Junlin, et al.
Published: (2025)
Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors
by: Wang, Weixuan, et al.
Published: (2024)
by: Wang, Weixuan, et al.
Published: (2024)
Improving Zero-Shot Cross-Lingual Transfer via Progressive Code-Switching
by: Li, Zhuoran, et al.
Published: (2024)
by: Li, Zhuoran, et al.
Published: (2024)
Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
by: Ma, Ziqiao, et al.
Published: (2024)
by: Ma, Ziqiao, et al.
Published: (2024)
Merging Beyond: Streaming LLM Updates via Activation-Guided Rotations
by: Yao, Yuxuan, et al.
Published: (2026)
by: Yao, Yuxuan, et al.
Published: (2026)
AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging
by: Zhao, Yiran, et al.
Published: (2024)
by: Zhao, Yiran, et al.
Published: (2024)
Debiasing Reward Models via Causally Motivated Inference-Time Intervention
by: Shinoda, Kazutoshi, et al.
Published: (2026)
by: Shinoda, Kazutoshi, et al.
Published: (2026)
Exploring Activation Patterns of Parameters in Language Models
by: Wang, Yudong, et al.
Published: (2024)
by: Wang, Yudong, et al.
Published: (2024)
Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging
by: Ju, Yiming, et al.
Published: (2024)
by: Ju, Yiming, et al.
Published: (2024)
Ensemble Debiasing Across Class and Sample Levels for Fairer Prompting Accuracy
by: Lin, Ruixi, et al.
Published: (2025)
by: Lin, Ruixi, et al.
Published: (2025)
Dynamic Fisher-weighted Model Merging via Bayesian Optimization
by: Lee, Sanwoo, et al.
Published: (2025)
by: Lee, Sanwoo, et al.
Published: (2025)
Prompt-Activation Duality: Improving Activation Steering via Attention-Level Interventions
by: Kang, Diancheng, et al.
Published: (2026)
by: Kang, Diancheng, et al.
Published: (2026)
Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing
by: Li, Zhuoran, et al.
Published: (2025)
by: Li, Zhuoran, et al.
Published: (2025)
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
by: Zheng, Yaowei, et al.
Published: (2024)
by: Zheng, Yaowei, et al.
Published: (2024)
Merge to Mix: Mixing Datasets via Model Merging
by: Tao, Zhixu Silvia, et al.
Published: (2025)
by: Tao, Zhixu Silvia, et al.
Published: (2025)
Similar Items
-
LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt Tuning
by: Kong, Fanshuang, et al.
Published: (2024) -
MOMA: Masked Orthogonal Matrix Alignment for Zero-Additional-Parameter Model Merging
by: Kong, Fanshuang, et al.
Published: (2024) -
inversedMixup: Data Augmentation via Inverting Mixed Embeddings
by: Kong, Fanshuang, et al.
Published: (2026) -
Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation
by: Zhang, Junhao, et al.
Published: (2025) -
On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions
by: Liu, Dezhi, et al.
Published: (2020)