Saved in:
| Main Authors: | Li, Shijun, Dong, Kaiwen, Gao, Xiang, Ghosh, Joydeep |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.16345 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Goal-Conditioned Supervised Learning for Multi-Objective Recommendation
by: Li, Shijun, et al.
Published: (2024)
by: Li, Shijun, et al.
Published: (2024)
OMAC: A Holistic Optimization Framework for LLM-Based Multi-Agent Collaboration
by: Li, Shijun, et al.
Published: (2025)
by: Li, Shijun, et al.
Published: (2025)
RRCM: Ranking-Driven Retrieval over Collaborative and Meta Memories for LLM Recommendation
by: Li, Shijun, et al.
Published: (2026)
by: Li, Shijun, et al.
Published: (2026)
Alignment Dynamics in LLM Fine-Tuning
by: Huang, Yuhan, et al.
Published: (2026)
by: Huang, Yuhan, et al.
Published: (2026)
Rotation-Preserving Supervised Fine-Tuning
by: Jin, Hangzhan, et al.
Published: (2026)
by: Jin, Hangzhan, et al.
Published: (2026)
SVFT: Parameter-Efficient Fine-Tuning with Singular Vectors
by: Lingam, Vijay, et al.
Published: (2024)
by: Lingam, Vijay, et al.
Published: (2024)
Autonomous Learning From Success and Failure: Goal-Conditioned Supervised Learning with Negative Feedback
by: Zhang, Zeqiang, et al.
Published: (2025)
by: Zhang, Zeqiang, et al.
Published: (2025)
Supervised Fine-Tuning as Inverse Reinforcement Learning
by: Sun, Hao
Published: (2024)
by: Sun, Hao
Published: (2024)
A Layer-wise Analysis of Supervised Fine-Tuning
by: Zhao, Qinghua, et al.
Published: (2026)
by: Zhao, Qinghua, et al.
Published: (2026)
Implicit Federated In-context Learning For Task-Specific LLM Fine-Tuning
by: Li, Dongcheng, et al.
Published: (2025)
by: Li, Dongcheng, et al.
Published: (2025)
Proximal Supervised Fine-Tuning
by: Zhu, Wenhong, et al.
Published: (2025)
by: Zhu, Wenhong, et al.
Published: (2025)
Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)
by: Qin, Chongli, et al.
Published: (2025)
by: Qin, Chongli, et al.
Published: (2025)
Stabilizing LLM Supervised Fine-Tuning via Explicit Distributional Control
by: Wang, Xinyu, et al.
Published: (2026)
by: Wang, Xinyu, et al.
Published: (2026)
Preserving Diversity in Supervised Fine-Tuning of Large Language Models
by: Li, Ziniu, et al.
Published: (2024)
by: Li, Ziniu, et al.
Published: (2024)
Backward Learning for Goal-Conditioned Policies
by: Höftmann, Marc, et al.
Published: (2023)
by: Höftmann, Marc, et al.
Published: (2023)
HIQL: Offline Goal-Conditioned RL with Latent States as Actions
by: Park, Seohong, et al.
Published: (2023)
by: Park, Seohong, et al.
Published: (2023)
Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models
by: Chen, Jie, et al.
Published: (2024)
by: Chen, Jie, et al.
Published: (2024)
Selection of LLM Fine-Tuning Data based on Orthogonal Rules
by: Li, Xiaomin, et al.
Published: (2024)
by: Li, Xiaomin, et al.
Published: (2024)
DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy
by: Xu, Kaixuan, et al.
Published: (2025)
by: Xu, Kaixuan, et al.
Published: (2025)
Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning
by: Wu, Lisheng, et al.
Published: (2024)
by: Wu, Lisheng, et al.
Published: (2024)
Proposing Hierarchical Goal-Conditioned Policy Planning in Multi-Goal Reinforcement Learning
by: Rens, Gavin B.
Published: (2025)
by: Rens, Gavin B.
Published: (2025)
ASFT: Aligned Supervised Fine-Tuning through Absolute Likelihood
by: Wang, Ruoyu, et al.
Published: (2024)
by: Wang, Ruoyu, et al.
Published: (2024)
Abstraction for Offline Goal-Conditioned Reinforcement Learning
by: Wibault, Clarisse, et al.
Published: (2026)
by: Wibault, Clarisse, et al.
Published: (2026)
SVL: Goal-Conditioned Reinforcement Learning as Survival Learning
by: Tiofack, Franki Nguimatsia, et al.
Published: (2026)
by: Tiofack, Franki Nguimatsia, et al.
Published: (2026)
RoRA: Efficient Fine-Tuning of LLM with Reliability Optimization for Rank Adaptation
by: Liu, Jun, et al.
Published: (2025)
by: Liu, Jun, et al.
Published: (2025)
Data Difficulty and the Generalization--Extrapolation Tradeoff in LLM Fine-Tuning
by: Liu, Siyuan, et al.
Published: (2026)
by: Liu, Siyuan, et al.
Published: (2026)
Understanding Forgetting in LLM Supervised Fine-Tuning and Preference Learning -- A Convex Optimization Perspective
by: Fernando, Heshan, et al.
Published: (2024)
by: Fernando, Heshan, et al.
Published: (2024)
Goal-Conditioned Agents that Learn Everything All at Once
by: Matthews, Michael, et al.
Published: (2026)
by: Matthews, Michael, et al.
Published: (2026)
Internalizing Curriculum Judgment for LLM Reinforcement Fine-Tuning
by: Zheng, Han, et al.
Published: (2026)
by: Zheng, Han, et al.
Published: (2026)
Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies
by: Schweighofer, Kajetan, et al.
Published: (2026)
by: Schweighofer, Kajetan, et al.
Published: (2026)
LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection
by: Zeng, Xinyue, et al.
Published: (2025)
by: Zeng, Xinyue, et al.
Published: (2025)
Large Language Models for Sequential Decision-Making: Improving In-Context Learning via Supervised Fine-Tuning
by: Zhang, Minmin, et al.
Published: (2026)
by: Zhang, Minmin, et al.
Published: (2026)
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
by: Hong, Joey, et al.
Published: (2024)
by: Hong, Joey, et al.
Published: (2024)
On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting
by: Zhang, Wenhao, et al.
Published: (2025)
by: Zhang, Wenhao, et al.
Published: (2025)
Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
by: Wang, Liangzhou, et al.
Published: (2024)
by: Wang, Liangzhou, et al.
Published: (2024)
Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling
by: Huang, Zeyu, et al.
Published: (2025)
by: Huang, Zeyu, et al.
Published: (2025)
FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents
by: Li, Qizheng, et al.
Published: (2026)
by: Li, Qizheng, et al.
Published: (2026)
SFT-GO: Supervised Fine-Tuning with Group Optimization for Large Language Models
by: Kim, Gyuhak, et al.
Published: (2025)
by: Kim, Gyuhak, et al.
Published: (2025)
Null Counterfactual Factor Interactions for Goal-Conditioned Reinforcement Learning
by: Chuck, Caleb, et al.
Published: (2025)
by: Chuck, Caleb, et al.
Published: (2025)
Goal-Conditioned Reinforcement Learning for Data-Driven Maritime Navigation
by: Vaidheeswaran, Vaishnav, et al.
Published: (2025)
by: Vaidheeswaran, Vaishnav, et al.
Published: (2025)
Similar Items
-
Goal-Conditioned Supervised Learning for Multi-Objective Recommendation
by: Li, Shijun, et al.
Published: (2024) -
OMAC: A Holistic Optimization Framework for LLM-Based Multi-Agent Collaboration
by: Li, Shijun, et al.
Published: (2025) -
RRCM: Ranking-Driven Retrieval over Collaborative and Meta Memories for LLM Recommendation
by: Li, Shijun, et al.
Published: (2026) -
Alignment Dynamics in LLM Fine-Tuning
by: Huang, Yuhan, et al.
Published: (2026) -
Rotation-Preserving Supervised Fine-Tuning
by: Jin, Hangzhan, et al.
Published: (2026)