Saved in:
| Main Authors: | Vaina, Sofia Maria Lo Cicero, Chumachenko, Artem, Ryabinin, Max |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.10156 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FFT-based Dynamic Subspace Selection for Low-Rank Adaptive Optimization of Large Language Models
by: Modoranu, Ionut-Vlad, et al.
Published: (2025)
by: Modoranu, Ionut-Vlad, et al.
Published: (2025)
Diffusion Language Models Generation Can Be Halted Early
by: Vaina, Sofia Maria Lo Cicero, et al.
Published: (2023)
by: Vaina, Sofia Maria Lo Cicero, et al.
Published: (2023)
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
by: Ma, Xiaoyu, et al.
Published: (2025)
by: Ma, Xiaoyu, et al.
Published: (2025)
Linear Transformers with Learnable Kernel Functions are Better In-Context Models
by: Aksenov, Yaroslav, et al.
Published: (2024)
by: Aksenov, Yaroslav, et al.
Published: (2024)
Representation Finetuning for Continual Learning
by: Luo, Haihua, et al.
Published: (2026)
by: Luo, Haihua, et al.
Published: (2026)
Learning Dynamics of VLM Finetuning
by: Zhang, Jusheng, et al.
Published: (2025)
by: Zhang, Jusheng, et al.
Published: (2025)
Learning Dynamics of LLM Finetuning
by: Ren, Yi, et al.
Published: (2024)
by: Ren, Yi, et al.
Published: (2024)
Traveling Waves Encode the Recent Past and Enhance Sequence Learning
by: Keller, T. Anderson, et al.
Published: (2023)
by: Keller, T. Anderson, et al.
Published: (2023)
OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning
by: Yue, Sheng, et al.
Published: (2024)
by: Yue, Sheng, et al.
Published: (2024)
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
by: Yan, Kai, et al.
Published: (2024)
by: Yan, Kai, et al.
Published: (2024)
Towards Fast Safe Online Reinforcement Learning via Policy Finetuning
by: Chen, Keru, et al.
Published: (2024)
by: Chen, Keru, et al.
Published: (2024)
Holdout-Loss-Based Data Selection for LLM Finetuning via In-Context Learning
by: Zhang, Ling, et al.
Published: (2025)
by: Zhang, Ling, et al.
Published: (2025)
Active Learning for Continual Learning: Keeping the Past Alive in the Present
by: Park, Jaehyun, et al.
Published: (2025)
by: Park, Jaehyun, et al.
Published: (2025)
Finetune-Informed Pretraining Boosts Downstream Performance
by: Faysal, Atik, et al.
Published: (2026)
by: Faysal, Atik, et al.
Published: (2026)
Order Independence With Finetuning
by: Brown, Katrina, et al.
Published: (2025)
by: Brown, Katrina, et al.
Published: (2025)
Record-Remix-Replay: Hierarchical GPU Kernel Optimization using Evolutionary Search
by: Nichols, Daniel, et al.
Published: (2026)
by: Nichols, Daniel, et al.
Published: (2026)
SPADE: Faster Drug Discovery by Learning from Sparse Data
by: Nandakumar, Rahul, et al.
Published: (2026)
by: Nandakumar, Rahul, et al.
Published: (2026)
Understanding and Improving Noisy Embedding Techniques in Instruction Finetuning
by: Yadav, Abhay
Published: (2026)
by: Yadav, Abhay
Published: (2026)
Online Finetuning Decision Transformers with Pure RL Gradients
by: Luo, Junkai, et al.
Published: (2026)
by: Luo, Junkai, et al.
Published: (2026)
Multimodal Web Navigation with Instruction-Finetuned Foundation Models
by: Furuta, Hiroki, et al.
Published: (2023)
by: Furuta, Hiroki, et al.
Published: (2023)
On the Emergence of Cross-Task Linearity in the Pretraining-Finetuning Paradigm
by: Zhou, Zhanpeng, et al.
Published: (2024)
by: Zhou, Zhanpeng, et al.
Published: (2024)
Bias-Restrained Prefix Representation Finetuning for Mathematical Reasoning
by: Liang, Sirui, et al.
Published: (2025)
by: Liang, Sirui, et al.
Published: (2025)
Fuzzy-Pattern Tsetlin Machine
by: Hnilov, Artem
Published: (2025)
by: Hnilov, Artem
Published: (2025)
Transfer Learning of Tabular Data by Finetuning Large Language Models
by: Rabbani, Shourav B., et al.
Published: (2025)
by: Rabbani, Shourav B., et al.
Published: (2025)
Neural Organ Transplantation (NOT): Checkpoint-Based Modular Adaptation for Transformer Models
by: Al-Zuraiqi, Ahmad
Published: (2026)
by: Al-Zuraiqi, Ahmad
Published: (2026)
Finetune Once: Decoupling General & Domain Learning with Dynamic Boosted Annealing
by: Tang, Yang, et al.
Published: (2025)
by: Tang, Yang, et al.
Published: (2025)
ANO : Faster is Better in Noisy Landscape
by: Kegreisz, Adrien
Published: (2025)
by: Kegreisz, Adrien
Published: (2025)
Predicting the Future by Retrieving the Past
by: Du, Dazhao, et al.
Published: (2025)
by: Du, Dazhao, et al.
Published: (2025)
Orthogonal Finetuning for Direct Preference Optimization
by: Yang, Chenxu, et al.
Published: (2024)
by: Yang, Chenxu, et al.
Published: (2024)
Large Language Models to Diffusion Finetuning
by: Cetin, Edoardo, et al.
Published: (2025)
by: Cetin, Edoardo, et al.
Published: (2025)
Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models
by: Mao, Yixiu, et al.
Published: (2026)
by: Mao, Yixiu, et al.
Published: (2026)
Beyond Parameter Finetuning: Test-Time Representation Refinement for Node Classification
by: Zhang, Jiaxin, et al.
Published: (2026)
by: Zhang, Jiaxin, et al.
Published: (2026)
Scaling Agentic Capabilities, Not Context: Efficient Reinforcement Finetuning for Large Toolspaces
by: Gupta, Karan, et al.
Published: (2026)
by: Gupta, Karan, et al.
Published: (2026)
LLM-Inspired Pretrain-Then-Finetune for Small-Data, Large-Scale Optimization
by: Zhang, Zishi, et al.
Published: (2026)
by: Zhang, Zishi, et al.
Published: (2026)
Robust Federated Finetuning of LLMs via Alternating Optimization of LoRA
by: Chen, Shuangyi, et al.
Published: (2025)
by: Chen, Shuangyi, et al.
Published: (2025)
GeoAda: Efficiently Finetune Geometric Diffusion Models with Equivariant Adapters
by: Zhao, Wanjia, et al.
Published: (2025)
by: Zhao, Wanjia, et al.
Published: (2025)
Learning Rate Scaling across LoRA Ranks and Transfer to Full Finetuning
by: Chen, Nan, et al.
Published: (2026)
by: Chen, Nan, et al.
Published: (2026)
HashAttention: Semantic Sparsity for Faster Inference
by: Desai, Aditya, et al.
Published: (2024)
by: Desai, Aditya, et al.
Published: (2024)
Calibrated Dataset Condensation for Faster Hyperparameter Search
by: Ding, Mucong, et al.
Published: (2024)
by: Ding, Mucong, et al.
Published: (2024)
On Faster Marginalization with Squared Circuits via Orthonormalization
by: Loconte, Lorenzo, et al.
Published: (2024)
by: Loconte, Lorenzo, et al.
Published: (2024)
Similar Items
-
FFT-based Dynamic Subspace Selection for Low-Rank Adaptive Optimization of Large Language Models
by: Modoranu, Ionut-Vlad, et al.
Published: (2025) -
Diffusion Language Models Generation Can Be Halted Early
by: Vaina, Sofia Maria Lo Cicero, et al.
Published: (2023) -
Improving Multimodal Learning Balance and Sufficiency through Data Remixing
by: Ma, Xiaoyu, et al.
Published: (2025) -
Linear Transformers with Learnable Kernel Functions are Better In-Context Models
by: Aksenov, Yaroslav, et al.
Published: (2024) -
Representation Finetuning for Continual Learning
by: Luo, Haihua, et al.
Published: (2026)