Saved in:
| Main Authors: | Du, Peimian, Liu, Jiabin, Jin, Xiaowei, Zuo, Wangmeng, Li, Hui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.11578 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Outlier-weighed Layerwise Sampling for LLM Fine-tuning
by: Li, Pengxiang, et al.
Published: (2024)
by: Li, Pengxiang, et al.
Published: (2024)
InfoMamba: An Attention-Free Hybrid Mamba-Transformer Model
by: Wang, Youjin, et al.
Published: (2026)
by: Wang, Youjin, et al.
Published: (2026)
Semi-Supervised Online Learning on the Edge by Transforming Knowledge from Teacher Models
by: Xue, Jiabin
Published: (2025)
by: Xue, Jiabin
Published: (2025)
Faster Convergence for Transformer Fine-tuning with Line Search Methods
by: Kenneweg, Philip, et al.
Published: (2024)
by: Kenneweg, Philip, et al.
Published: (2024)
LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models
by: Li, Qianxi, et al.
Published: (2023)
by: Li, Qianxi, et al.
Published: (2023)
Fine-tuning Flow Matching Generative Models with Intermediate Feedback
by: Fan, Jiajun, et al.
Published: (2025)
by: Fan, Jiajun, et al.
Published: (2025)
Fine-tuning Language Models with Generative Adversarial Reward Modelling
by: Yu, Zhang Ze, et al.
Published: (2023)
by: Yu, Zhang Ze, et al.
Published: (2023)
Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning
by: Tong, Anh, et al.
Published: (2025)
by: Tong, Anh, et al.
Published: (2025)
Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models
by: Fan, Jiajun, et al.
Published: (2025)
by: Fan, Jiajun, et al.
Published: (2025)
Unsupervised Generative Feature Transformation via Graph Contrastive Pre-training and Multi-objective Fine-tuning
by: Ying, Wangyang, et al.
Published: (2024)
by: Ying, Wangyang, et al.
Published: (2024)
HybriDNA: A Hybrid Transformer-Mamba2 Long-Range DNA Language Model
by: Ma, Mingqian, et al.
Published: (2025)
by: Ma, Mingqian, et al.
Published: (2025)
PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs
by: Yu, Tongzhou, et al.
Published: (2025)
by: Yu, Tongzhou, et al.
Published: (2025)
Diffusion Transformers as Open-World Spatiotemporal Foundation Models
by: Yuan, Yuan, et al.
Published: (2024)
by: Yuan, Yuan, et al.
Published: (2024)
Membership Inference Attacks Against Fine-tuned Diffusion Language Models
by: Chen, Yuetian, et al.
Published: (2026)
by: Chen, Yuetian, et al.
Published: (2026)
Can Differentially Private Fine-tuning LLMs Protect Against Privacy Attacks?
by: Du, Hao, et al.
Published: (2025)
by: Du, Hao, et al.
Published: (2025)
Hybrid Quantum-Classical Spatiotemporal Forecasting for 3D Cloud Fields
by: Wang, Fu, et al.
Published: (2026)
by: Wang, Fu, et al.
Published: (2026)
Physics-informed Attention-enhanced Fourier Neural Operator for Solar Magnetic Field Extrapolations
by: Cao, Jinghao, et al.
Published: (2025)
by: Cao, Jinghao, et al.
Published: (2025)
Hybrid Mamba-Transformer Decoder for Error-Correcting Codes
by: Cohen, Shy-el, et al.
Published: (2025)
by: Cohen, Shy-el, et al.
Published: (2025)
Speculative Coreset Selection for Task-Specific Fine-tuning
by: Zhang, Xiaoyu, et al.
Published: (2024)
by: Zhang, Xiaoyu, et al.
Published: (2024)
I can't see it but I can Fine-tune it: On Encrypted Fine-tuning of Transformers using Fully Homomorphic Encryption
by: Panzade, Prajwal, et al.
Published: (2024)
by: Panzade, Prajwal, et al.
Published: (2024)
Rethinking Safety in LLM Fine-tuning: An Optimization Perspective
by: Kim, Minseon, et al.
Published: (2025)
by: Kim, Minseon, et al.
Published: (2025)
SST: Multi-Scale Hybrid Mamba-Transformer Experts for Time Series Forecasting
by: Xu, Xiongxiao, et al.
Published: (2024)
by: Xu, Xiongxiao, et al.
Published: (2024)
Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning
by: Zou, Jiaru, et al.
Published: (2025)
by: Zou, Jiaru, et al.
Published: (2025)
Conservation-informed Graph Learning for Spatiotemporal Dynamics Prediction
by: Mi, Yuan, et al.
Published: (2024)
by: Mi, Yuan, et al.
Published: (2024)
Sparse is Enough in Fine-tuning Pre-trained Large Language Models
by: Song, Weixi, et al.
Published: (2023)
by: Song, Weixi, et al.
Published: (2023)
A Survey of Mamba
by: Qu, Haohao, et al.
Published: (2024)
by: Qu, Haohao, et al.
Published: (2024)
HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization
by: Zhao, Huaqin, et al.
Published: (2024)
by: Zhao, Huaqin, et al.
Published: (2024)
Physics-Guided Tiny-Mamba Transformer for Reliability-Aware Early Fault Warning
by: Li, Changyu, et al.
Published: (2026)
by: Li, Changyu, et al.
Published: (2026)
Kron-LoRA: Hybrid Kronecker-LoRA Adapters for Scalable, Sustainable Fine-tuning
by: Shen, Yixin
Published: (2025)
by: Shen, Yixin
Published: (2025)
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
by: NVIDIA, et al.
Published: (2025)
by: NVIDIA, et al.
Published: (2025)
Loquetier: A Virtualized Multi-LoRA Framework for Unified LLM Fine-tuning and Serving
by: Zhang, Yuchen, et al.
Published: (2025)
by: Zhang, Yuchen, et al.
Published: (2025)
Fine-tuning Behavioral Cloning Policies with Preference-Based Reinforcement Learning
by: Macuglia, Maël, et al.
Published: (2025)
by: Macuglia, Maël, et al.
Published: (2025)
Mamba Modulation: On the Length Generalization of Mamba
by: Lu, Peng, et al.
Published: (2025)
by: Lu, Peng, et al.
Published: (2025)
A HyperGraphMamba-Based Multichannel Adaptive Model for ncRNA Classification
by: An, Xin, et al.
Published: (2025)
by: An, Xin, et al.
Published: (2025)
Fine-tuning Large Language Model for Automated Algorithm Design
by: Liu, Fei, et al.
Published: (2025)
by: Liu, Fei, et al.
Published: (2025)
Exploring Memorization in Fine-tuned Language Models
by: Zeng, Shenglai, et al.
Published: (2023)
by: Zeng, Shenglai, et al.
Published: (2023)
The Mamba in the Llama: Distilling and Accelerating Hybrid Models
by: Wang, Junxiong, et al.
Published: (2024)
by: Wang, Junxiong, et al.
Published: (2024)
TuneComp: Joint Fine-tuning and Compression for Large Foundation Models
by: Chen, Xiangyu, et al.
Published: (2025)
by: Chen, Xiangyu, et al.
Published: (2025)
Ratio-Variance Regularized Policy Optimization for Efficient LLM Fine-tuning
by: Luo, Yu, et al.
Published: (2026)
by: Luo, Yu, et al.
Published: (2026)
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
by: NVIDIA, et al.
Published: (2025)
by: NVIDIA, et al.
Published: (2025)
Similar Items
-
Outlier-weighed Layerwise Sampling for LLM Fine-tuning
by: Li, Pengxiang, et al.
Published: (2024) -
InfoMamba: An Attention-Free Hybrid Mamba-Transformer Model
by: Wang, Youjin, et al.
Published: (2026) -
Semi-Supervised Online Learning on the Edge by Transforming Knowledge from Teacher Models
by: Xue, Jiabin
Published: (2025) -
Faster Convergence for Transformer Fine-tuning with Line Search Methods
by: Kenneweg, Philip, et al.
Published: (2024) -
LaFFi: Leveraging Hybrid Natural Language Feedback for Fine-tuning Language Models
by: Li, Qianxi, et al.
Published: (2023)