:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kong, Fanshuang, Zhang, Richong, Wang, Ziqiao
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2408.09485
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LH-Mix: Local Hierarchy Correlation Guided Mixup over Hierarchical Prompt Tuning
by: Kong, Fanshuang, et al.
Published: (2024)

MOMA: Masked Orthogonal Matrix Alignment for Zero-Additional-Parameter Model Merging
by: Kong, Fanshuang, et al.
Published: (2024)

inversedMixup: Data Augmentation via Inverting Mixed Embeddings
by: Kong, Fanshuang, et al.
Published: (2026)

Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation
by: Zhang, Junhao, et al.
Published: (2025)

On SkipGram Word Embedding Models with Negative Sampling: Unified Framework and Impact of Noise Distributions
by: Liu, Dezhi, et al.
Published: (2020)

Improving General Text Embedding Model: Tackling Task Conflict and Data Imbalance through Model Merging
by: Li, Mingxin, et al.
Published: (2024)

Locate-then-Merge: Neuron-Level Parameter Fusion for Mitigating Catastrophic Forgetting in Multimodal LLMs
by: Yu, Zeping, et al.
Published: (2025)

LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint
by: Ma, Qianli, et al.
Published: (2025)

General Table Question Answering via Answer-Formula Joint Generation
by: Wang, Zhongyuan, et al.
Published: (2025)

Towards Better Understanding of Contrastive Sentence Representation Learning: A Unified Paradigm for Gradient
by: Li, Mingxin, et al.
Published: (2024)

Tool-Assisted Agent on SQL Inspection and Refinement in Real-World Scenarios
by: Wang, Zhongyuan, et al.
Published: (2024)

Dynamic Task Vector Grouping for Efficient Multi-Task Prompt Tuning
by: Zhang, Pieyi, et al.
Published: (2025)

Multimodal Abstractive Summarization of Instructional Videos with Vision-Language Models
by: Nazir, Maham, et al.
Published: (2026)

A Text is Worth Several Tokens: Text Embedding from LLMs Secretly Aligns Well with The Key Tokens
by: Nie, Zhijie, et al.
Published: (2024)

CoRect: Context-Aware Logit Contrast for Hidden State Rectification to Resolve Knowledge Conflicts
by: Ma, Xuhua, et al.
Published: (2026)

Code-Style In-Context Learning for Knowledge-Based Question Answering
by: Nie, Zhijie, et al.
Published: (2023)

Parameter Competition Balancing for Model Merging
by: Du, Guodong, et al.
Published: (2024)

Activation-Guided Consensus Merging for Large Language Models
by: Yao, Yuxuan, et al.
Published: (2025)

Sens-Merging: Sensitivity-Guided Parameter Balancing for Merging Large Language Models
by: Liu, Shuqi, et al.
Published: (2025)

Activation-Informed Merging of Large Language Models
by: Nobari, Amin Heyrani, et al.
Published: (2025)

Merging by Matching Models in Task Parameter Subspaces
by: Tam, Derek, et al.
Published: (2023)

Progressively Modality Freezing for Multi-Modal Entity Alignment
by: Huang, Yani, et al.
Published: (2024)

A Graph-based Verification Framework for Fact-Checking
by: Huang, Yani, et al.
Published: (2025)

CausalDetox: Causal Head Selection and Intervention for Language Model Detoxification
by: Wang, Yian, et al.
Published: (2026)

Upcycling Instruction Tuning from Dense to Mixture-of-Experts via Parameter Merging
by: Hui, Tingfeng, et al.
Published: (2024)

Multi-Modality Expansion and Retention for LLMs through Parameter Merging and Decoupling
by: Li, Junlin, et al.
Published: (2025)

Semantics-Adaptive Activation Intervention for LLMs via Dynamic Steering Vectors
by: Wang, Weixuan, et al.
Published: (2024)

Improving Zero-Shot Cross-Lingual Transfer via Progressive Code-Switching
by: Li, Zhuoran, et al.
Published: (2024)

Babysit A Language Model From Scratch: Interactive Language Learning by Trials and Demonstrations
by: Ma, Ziqiao, et al.
Published: (2024)

Merging Beyond: Streaming LLM Updates via Activation-Guided Rotations
by: Yao, Yuxuan, et al.
Published: (2026)

AdaMergeX: Cross-Lingual Transfer with Large Language Models via Adaptive Adapter Merging
by: Zhao, Yiran, et al.
Published: (2024)

Debiasing Reward Models via Causally Motivated Inference-Time Intervention
by: Shinoda, Kazutoshi, et al.
Published: (2026)

Exploring Activation Patterns of Parameters in Language Models
by: Wang, Yudong, et al.
Published: (2024)

Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging
by: Ju, Yiming, et al.
Published: (2024)

Ensemble Debiasing Across Class and Sample Levels for Fairer Prompting Accuracy
by: Lin, Ruixi, et al.
Published: (2025)

Dynamic Fisher-weighted Model Merging via Bayesian Optimization
by: Lee, Sanwoo, et al.
Published: (2025)

Prompt-Activation Duality: Improving Activation Steering via Attention-Level Interventions
by: Kang, Diancheng, et al.
Published: (2026)

Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing
by: Li, Zhuoran, et al.
Published: (2025)

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
by: Zheng, Yaowei, et al.
Published: (2024)

Merge to Mix: Mixing Datasets via Model Merging
by: Tao, Zhixu Silvia, et al.
Published: (2025)