Saved in:
| Main Authors: | Young, Rory, Pugeault, Nicolas |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.23312 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
by: Young, Rory, et al.
Published: (2024)
by: Young, Rory, et al.
Published: (2024)
GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs
by: Zhelnin, Maxim, et al.
Published: (2024)
by: Zhelnin, Maxim, et al.
Published: (2024)
Thompson Sampling via Fine-Tuning of LLMs
by: Menet, Nicolas, et al.
Published: (2025)
by: Menet, Nicolas, et al.
Published: (2025)
DONOD: Efficient and Generalizable Instruction Fine-Tuning for LLMs via Model-Intrinsic Dataset Pruning
by: Hu, Jucheng, et al.
Published: (2025)
by: Hu, Jucheng, et al.
Published: (2025)
GIFT: Gradient-aware Immunization of diffusion models against malicious Fine-Tuning with safe concepts retention
by: Abdalla, Amro, et al.
Published: (2025)
by: Abdalla, Amro, et al.
Published: (2025)
GPart: End-to-End Isometric Fine-Tuning via Global Parameter Partitioning
by: Mandica, Paolo, et al.
Published: (2026)
by: Mandica, Paolo, et al.
Published: (2026)
GIFT: Reconciling Post-Training Objectives via Finite-Temperature Gibbs Initialization
by: Zhao, Zhengyang, et al.
Published: (2026)
by: Zhao, Zhengyang, et al.
Published: (2026)
Order-Independence Without Fine Tuning
by: McIlroy-Young, Reid, et al.
Published: (2024)
by: McIlroy-Young, Reid, et al.
Published: (2024)
Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods
by: Zekri, Oussama, et al.
Published: (2025)
by: Zekri, Oussama, et al.
Published: (2025)
GIFT: Bootstrapping Image-to-CAD Program Synthesis via Geometric Feedback
by: Giannone, Giorgio, et al.
Published: (2026)
by: Giannone, Giorgio, et al.
Published: (2026)
Activated LoRA: Fine-tuned LLMs for Intrinsics
by: Greenewald, Kristjan, et al.
Published: (2025)
by: Greenewald, Kristjan, et al.
Published: (2025)
Fine-Tuning Diffusion Models via Intermediate Distribution Shaping
by: Anil, Gautham Govind, et al.
Published: (2025)
by: Anil, Gautham Govind, et al.
Published: (2025)
RIFT: Repurposing Negative Samples via Reward-Informed Fine-Tuning
by: Liu, Zehua, et al.
Published: (2026)
by: Liu, Zehua, et al.
Published: (2026)
Memory-Efficient Fine-Tuning via Low-Rank Activation Compression
by: Shi, Jiang-Xin, et al.
Published: (2025)
by: Shi, Jiang-Xin, et al.
Published: (2025)
Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques
by: Sharma, Asankhaya
Published: (2025)
by: Sharma, Asankhaya
Published: (2025)
Hallucination Detection in LLMs: Fast and Memory-Efficient Fine-Tuned Models
by: Arteaga, Gabriel Y., et al.
Published: (2024)
by: Arteaga, Gabriel Y., et al.
Published: (2024)
Alignment Dynamics in LLM Fine-Tuning
by: Huang, Yuhan, et al.
Published: (2026)
by: Huang, Yuhan, et al.
Published: (2026)
Rotation-Preserving Supervised Fine-Tuning
by: Jin, Hangzhan, et al.
Published: (2026)
by: Jin, Hangzhan, et al.
Published: (2026)
Fine-Tuning without Performance Degradation
by: Wang, Han, et al.
Published: (2025)
by: Wang, Han, et al.
Published: (2025)
Active Few-Shot Fine-Tuning
by: Hübotter, Jonas, et al.
Published: (2024)
by: Hübotter, Jonas, et al.
Published: (2024)
QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals
by: Zhang, Nan, et al.
Published: (2026)
by: Zhang, Nan, et al.
Published: (2026)
$α$-LoRA: Effective Fine-Tuning via Base Model Rescaling
by: Firdoussi, Aymane El, et al.
Published: (2025)
by: Firdoussi, Aymane El, et al.
Published: (2025)
Bayesian Natural Gradient Fine-Tuning of CLIP Models via Kalman Filtering
by: Abdi, Hossein, et al.
Published: (2025)
by: Abdi, Hossein, et al.
Published: (2025)
Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
by: Kang, Hyeongyu, et al.
Published: (2025)
by: Kang, Hyeongyu, et al.
Published: (2025)
Understanding and Preserving Safety in Fine-Tuned LLMs
by: Zhang, Jiawen, et al.
Published: (2026)
by: Zhang, Jiawen, et al.
Published: (2026)
Fine-Tuned In-Context Learners for Efficient Adaptation
by: Bornschein, Jorg, et al.
Published: (2025)
by: Bornschein, Jorg, et al.
Published: (2025)
Spectral Adapter: Fine-Tuning in Spectral Space
by: Zhang, Fangzhao, et al.
Published: (2024)
by: Zhang, Fangzhao, et al.
Published: (2024)
MetaTT: A Global Tensor-Train Adapter for Parameter-Efficient Fine-Tuning
by: Lopez-Piqueres, Javier, et al.
Published: (2025)
by: Lopez-Piqueres, Javier, et al.
Published: (2025)
Fine-Tuning Diffusion Models for Molecular Generation via Reinforcement Learning and Fast Sampling
by: Lin, Guang, et al.
Published: (2026)
by: Lin, Guang, et al.
Published: (2026)
H2Tune: Federated Foundation Model Fine-Tuning with Hybrid Heterogeneity
by: Guo, Wei, et al.
Published: (2025)
by: Guo, Wei, et al.
Published: (2025)
Explaining Fine Tuned LLMs via Counterfactuals A Knowledge Graph Driven Framework
by: Wang, Yucheng, et al.
Published: (2025)
by: Wang, Yucheng, et al.
Published: (2025)
RPO: Fine-Tuning Visual Generative Models via Rich Vision-Language Preferences
by: Zhao, Hanyang, et al.
Published: (2025)
by: Zhao, Hanyang, et al.
Published: (2025)
Not All Adapters Matter: Selective Adapter Freezing for Memory-Efficient Fine-Tuning of Language Models
by: Son, Hyegang, et al.
Published: (2024)
by: Son, Hyegang, et al.
Published: (2024)
FedGTST: Boosting Global Transferability of Federated Models via Statistics Tuning
by: Ma, Evelyn, et al.
Published: (2024)
by: Ma, Evelyn, et al.
Published: (2024)
TabTune: A Unified Library for Inference and Fine-Tuning Tabular Foundation Models
by: Tanna, Aditya, et al.
Published: (2025)
by: Tanna, Aditya, et al.
Published: (2025)
Portable Reward Tuning: Towards Reusable Fine-Tuning across Different Pretrained Models
by: Chijiwa, Daiki, et al.
Published: (2025)
by: Chijiwa, Daiki, et al.
Published: (2025)
SafeTuneBed: A Toolkit for Benchmarking LLM Safety Alignment in Fine-Tuning
by: Hossain, Saad, et al.
Published: (2025)
by: Hossain, Saad, et al.
Published: (2025)
Wireless Federated Multi-Task LLM Fine-Tuning via Sparse-and-Orthogonal LoRA
by: Yang, Nuocheng, et al.
Published: (2026)
by: Yang, Nuocheng, et al.
Published: (2026)
FedProxy: Federated Fine-Tuning of LLMs via Proxy SLMs and Heterogeneity-Aware Fusion
by: Fan, Tao, et al.
Published: (2026)
by: Fan, Tao, et al.
Published: (2026)
Robust and Efficient Zeroth-Order LLM Fine-Tuning via Adaptive Bayesian Subspace Optimizer
by: Feng, Jian, et al.
Published: (2026)
by: Feng, Jian, et al.
Published: (2026)
Similar Items
-
Enhancing Robustness in Deep Reinforcement Learning: A Lyapunov Exponent Approach
by: Young, Rory, et al.
Published: (2024) -
GIFT-SW: Gaussian noise Injected Fine-Tuning of Salient Weights for LLMs
by: Zhelnin, Maxim, et al.
Published: (2024) -
Thompson Sampling via Fine-Tuning of LLMs
by: Menet, Nicolas, et al.
Published: (2025) -
DONOD: Efficient and Generalizable Instruction Fine-Tuning for LLMs via Model-Intrinsic Dataset Pruning
by: Hu, Jucheng, et al.
Published: (2025) -
GIFT: Gradient-aware Immunization of diffusion models against malicious Fine-Tuning with safe concepts retention
by: Abdalla, Amro, et al.
Published: (2025)