Saved in:
| Main Authors: | Chen, Jack, Liu, Fazhong, Liu, Naruto, Luo, Yuhan, Qin, Erqu, Zheng, Harry, Dong, Tian, Zhu, Haojin, Meng, Yan, Wang, Xiao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.13026 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Trojan's Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance
by: Liu, Fazhong, et al.
Published: (2026)
by: Liu, Fazhong, et al.
Published: (2026)
Speculative Coreset Selection for Task-Specific Fine-tuning
by: Zhang, Xiaoyu, et al.
Published: (2024)
by: Zhang, Xiaoyu, et al.
Published: (2024)
Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory
by: Shu, Yunmeng, et al.
Published: (2025)
by: Shu, Yunmeng, et al.
Published: (2025)
GRASS: Gradient-based Adaptive Layer-wise Importance Sampling for Memory-efficient Large Language Model Fine-tuning
by: Tian, Kaiyuan, et al.
Published: (2026)
by: Tian, Kaiyuan, et al.
Published: (2026)
Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction
by: Liu, Zhexiong, et al.
Published: (2025)
by: Liu, Zhexiong, et al.
Published: (2025)
Are LLMs Effective Backbones for Fine-tuning? An Experimental Investigation of Supervised LLMs on Chinese Short Text Matching
by: Liu, Shulin, et al.
Published: (2024)
by: Liu, Shulin, et al.
Published: (2024)
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
by: Deng, Yihe, et al.
Published: (2025)
by: Deng, Yihe, et al.
Published: (2025)
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs
by: Lai, Xin, et al.
Published: (2024)
by: Lai, Xin, et al.
Published: (2024)
RobustFT: Robust Supervised Fine-tuning for Large Language Models under Noisy Response
by: Luo, Junyu, et al.
Published: (2024)
by: Luo, Junyu, et al.
Published: (2024)
LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views
by: Roh, Yuji, et al.
Published: (2024)
by: Roh, Yuji, et al.
Published: (2024)
Stage-wise Fine-tuning for Graph-to-Text Generation
by: Wang, Qingyun, et al.
Published: (2021)
by: Wang, Qingyun, et al.
Published: (2021)
Depth Gives a False Sense of Privacy: LLM Internal States Inversion
by: Dong, Tian, et al.
Published: (2025)
by: Dong, Tian, et al.
Published: (2025)
The Harder The Better: Maintaining Supervised Fine-tuning Generalization with Less but Harder Data
by: Shang, Zhaoyang, et al.
Published: (2025)
by: Shang, Zhaoyang, et al.
Published: (2025)
AIDE: Attribute-Guided MultI-Hop Data Expansion for Data Scarcity in Task-Specific Fine-tuning
by: Li, Jiayu, et al.
Published: (2024)
by: Li, Jiayu, et al.
Published: (2024)
On the Non-decoupling of Supervised Fine-tuning and Reinforcement Learning in Post-training
by: Niu, Xueyan, et al.
Published: (2026)
by: Niu, Xueyan, et al.
Published: (2026)
Improving Low-Resource Knowledge Tracing Tasks by Supervised Pre-training and Importance Mechanism Fine-tuning
by: Zhang, Hengyuan, et al.
Published: (2024)
by: Zhang, Hengyuan, et al.
Published: (2024)
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
by: Mitra, Purbesh, et al.
Published: (2025)
by: Mitra, Purbesh, et al.
Published: (2025)
Supervised Fine-Tuning Achieve Rapid Task Adaption Via Alternating Attention Head Activation Patterns
by: Zhao, Yang, et al.
Published: (2024)
by: Zhao, Yang, et al.
Published: (2024)
MMRAG-RFT: Two-stage Reinforcement Fine-tuning for Explainable Multi-modal Retrieval-augmented Generation
by: Zhao, Shengwei, et al.
Published: (2025)
by: Zhao, Shengwei, et al.
Published: (2025)
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
by: Li, Ziniu, et al.
Published: (2024)
by: Li, Ziniu, et al.
Published: (2024)
Debiasing LLMs by Fine-tuning
by: Gao, Zhenyu, et al.
Published: (2026)
by: Gao, Zhenyu, et al.
Published: (2026)
Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
by: Zhou, Huichi, et al.
Published: (2025)
by: Zhou, Huichi, et al.
Published: (2025)
A Layer-wise Analysis of Supervised Fine-Tuning
by: Zhao, Qinghua, et al.
Published: (2026)
by: Zhao, Qinghua, et al.
Published: (2026)
Semi-supervised Fine-tuning for Large Language Models
by: Luo, Junyu, et al.
Published: (2024)
by: Luo, Junyu, et al.
Published: (2024)
SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification
by: Zhang, Jiacheng, et al.
Published: (2026)
by: Zhang, Jiacheng, et al.
Published: (2026)
HELENE: Hessian Layer-wise Clipping and Gradient Annealing for Accelerating Fine-tuning LLM with Zeroth-order Optimization
by: Zhao, Huaqin, et al.
Published: (2024)
by: Zhao, Huaqin, et al.
Published: (2024)
Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning
by: Tan, Zhaoxuan, et al.
Published: (2024)
by: Tan, Zhaoxuan, et al.
Published: (2024)
Towards Minimizing Feature Drift in Model Merging: Layer-wise Task Vector Fusion for Adaptive Knowledge Integration
by: Sun, Wenju, et al.
Published: (2025)
by: Sun, Wenju, et al.
Published: (2025)
Full-Step-DPO: Self-Supervised Preference Optimization with Step-wise Rewards for Mathematical Reasoning
by: Xu, Huimin, et al.
Published: (2025)
by: Xu, Huimin, et al.
Published: (2025)
Is One-Shot In-Context Learning Helpful for Data Selection in Task-Specific Fine-Tuning of Multimodal LLMs?
by: An, Xiao, et al.
Published: (2026)
by: An, Xiao, et al.
Published: (2026)
Numerical analysis and optimization on flow and heat transfer performance of HPD ‐type perforated fins
by: Hao Dai, et al.
Published: (2024)
by: Hao Dai, et al.
Published: (2024)
ITERTL: An Iterative Framework for Fine-tuning LLMs for RTL Code Generation
by: Wu, Peiyang, et al.
Published: (2024)
by: Wu, Peiyang, et al.
Published: (2024)
ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy
by: Chen, Yuhui, et al.
Published: (2025)
by: Chen, Yuhui, et al.
Published: (2025)
Are there type-III multiferroics?
by: Wang, Haojin, et al.
Published: (2025)
by: Wang, Haojin, et al.
Published: (2025)
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition
by: Dong, Guanting, et al.
Published: (2023)
by: Dong, Guanting, et al.
Published: (2023)
UFT: Unifying Supervised and Reinforcement Fine-Tuning
by: Liu, Mingyang, et al.
Published: (2025)
by: Liu, Mingyang, et al.
Published: (2025)
Adaptive and Fine-grained Module-wise Expert Pruning for Efficient LoRA-MoE Fine-Tuning
by: Li, Weihang, et al.
Published: (2026)
by: Li, Weihang, et al.
Published: (2026)
ViTree: Single-path Neural Tree for Step-wise Interpretable Fine-grained Visual Categorization
by: Lao, Danning, et al.
Published: (2024)
by: Lao, Danning, et al.
Published: (2024)
When MOE Meets LLMs: Parameter Efficient Fine-tuning for Multi-task Medical Applications
by: Liu, Qidong, et al.
Published: (2023)
by: Liu, Qidong, et al.
Published: (2023)
Step-by-Step Unmasking for Parameter-Efficient Fine-tuning of Large Language Models
by: Agarwal, Aradhye, et al.
Published: (2024)
by: Agarwal, Aradhye, et al.
Published: (2024)
Similar Items
-
Trojan's Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance
by: Liu, Fazhong, et al.
Published: (2026) -
Speculative Coreset Selection for Task-Specific Fine-tuning
by: Zhang, Xiaoyu, et al.
Published: (2024) -
Model Inversion in Split Learning for Personalized LLMs: New Insights from Information Bottleneck Theory
by: Shu, Yunmeng, et al.
Published: (2025) -
GRASS: Gradient-based Adaptive Layer-wise Importance Sampling for Memory-efficient Large Language Model Fine-tuning
by: Tian, Kaiyuan, et al.
Published: (2026) -
Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction
by: Liu, Zhexiong, et al.
Published: (2025)