Saved in:
| Main Authors: | Yixuan, Deng, Xiaoqiang, Ji |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.06023 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment
by: Tang, Yixuan, et al.
Published: (2025)
by: Tang, Yixuan, et al.
Published: (2025)
Safety-Aware Fine-Tuning of Large Language Models
by: Choi, Hyeong Kyu, et al.
Published: (2024)
by: Choi, Hyeong Kyu, et al.
Published: (2024)
Dynamics of Instruction Fine-Tuning for Chinese Large Language Models
by: Song, Chiyu, et al.
Published: (2023)
by: Song, Chiyu, et al.
Published: (2023)
Discriminative Finetuning of Generative Large Language Models without Reward Models and Human Preference Data
by: Guo, Siqi, et al.
Published: (2025)
by: Guo, Siqi, et al.
Published: (2025)
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models
by: Zhou, Xiongtao, et al.
Published: (2024)
by: Zhou, Xiongtao, et al.
Published: (2024)
Fine-Tuning Language Models with Reward Learning on Policy
by: Lang, Hao, et al.
Published: (2024)
by: Lang, Hao, et al.
Published: (2024)
Structure-Learnable Adapter Fine-Tuning for Parameter-Efficient Large Language Models
by: Gong, Ming, et al.
Published: (2025)
by: Gong, Ming, et al.
Published: (2025)
You Only Fine-tune Once: Many-Shot In-Context Fine-Tuning for Large Language Models
by: He, Wenchong, et al.
Published: (2025)
by: He, Wenchong, et al.
Published: (2025)
MeTA-LoRA: Data-Efficient Multi-Task Fine-Tuning for Large Language Models
by: Cheng, Bo, et al.
Published: (2025)
by: Cheng, Bo, et al.
Published: (2025)
Improving Generalization in Intent Detection: GRPO with Reward-Based Curriculum Sampling
by: Feng, Zihao, et al.
Published: (2025)
by: Feng, Zihao, et al.
Published: (2025)
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
by: Chen, Zixiang, et al.
Published: (2024)
by: Chen, Zixiang, et al.
Published: (2024)
Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning
by: Liu, Jinlong, et al.
Published: (2025)
by: Liu, Jinlong, et al.
Published: (2025)
Efficient Fine-Tuning of Large Language Models for Automated Medical Documentation
by: Leong, Hui Yi, et al.
Published: (2024)
by: Leong, Hui Yi, et al.
Published: (2024)
GIFT: Group-Relative Implicit Fine-Tuning Integrates GRPO with DPO and UNA
by: Wang, Zhichao
Published: (2025)
by: Wang, Zhichao
Published: (2025)
Fine-Tuning Large Language Models for Scientific Text Classification: A Comparative Study
by: Rostam, Zhyar Rzgar K, et al.
Published: (2024)
by: Rostam, Zhyar Rzgar K, et al.
Published: (2024)
A Study of Large Language Models for Patient Information Extraction: Model Architecture, Fine-Tuning Strategy, and Multi-task Instruction Tuning
by: Peng, Cheng, et al.
Published: (2025)
by: Peng, Cheng, et al.
Published: (2025)
LiFT: Does Instruction Fine-Tuning Improve In-Context Learning for Longitudinal Modelling by Large Language Models?
by: Ali, Iqra, et al.
Published: (2026)
by: Ali, Iqra, et al.
Published: (2026)
MMICT: Boosting Multi-Modal Fine-Tuning with In-Context Examples
by: Chen, Tao, et al.
Published: (2023)
by: Chen, Tao, et al.
Published: (2023)
HFT: Half Fine-Tuning for Large Language Models
by: Hui, Tingfeng, et al.
Published: (2024)
by: Hui, Tingfeng, et al.
Published: (2024)
Memorization in Fine-Tuned Large Language Models
by: Savine, Danil
Published: (2025)
by: Savine, Danil
Published: (2025)
Exploring Fine-Tuning for In-Context Retrieval and Efficient KV-Caching in Long-Context Language Models
by: Molfese, Francesco Maria, et al.
Published: (2026)
by: Molfese, Francesco Maria, et al.
Published: (2026)
Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models
by: Wu, Minghao, et al.
Published: (2024)
by: Wu, Minghao, et al.
Published: (2024)
Efficiently Seeking Flat Minima for Better Generalization in Fine-Tuning Large Language Models and Beyond
by: Deng, Jiaxin, et al.
Published: (2025)
by: Deng, Jiaxin, et al.
Published: (2025)
Towards Better Chinese-centric Neural Machine Translation for Low-resource Languages
by: Li, Bin, et al.
Published: (2022)
by: Li, Bin, et al.
Published: (2022)
Assessing and Mitigating Data Memorization Risks in Fine-Tuned Large Language Models
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task
by: Garcia, Gabriel Lino, et al.
Published: (2024)
by: Garcia, Gabriel Lino, et al.
Published: (2024)
Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice?
by: Zhu, Dawei, et al.
Published: (2024)
by: Zhu, Dawei, et al.
Published: (2024)
Fine-Tuning A Large Language Model for Systematic Review Screening
by: Yamoah, Kweku, et al.
Published: (2026)
by: Yamoah, Kweku, et al.
Published: (2026)
Shaping Explanations: Semantic Reward Modeling with Encoder-Only Transformers for GRPO
by: Pappone, Francesco, et al.
Published: (2025)
by: Pappone, Francesco, et al.
Published: (2025)
Unveiling the Generalization Power of Fine-Tuned Large Language Models
by: Yang, Haoran, et al.
Published: (2024)
by: Yang, Haoran, et al.
Published: (2024)
PICLe: Eliciting Diverse Behaviors from Large Language Models with Persona In-Context Learning
by: Choi, Hyeong Kyu, et al.
Published: (2024)
by: Choi, Hyeong Kyu, et al.
Published: (2024)
Take the essence and discard the dross: A Rethinking on Data Selection for Fine-Tuning Large Language Models
by: Liu, Ziche, et al.
Published: (2024)
by: Liu, Ziche, et al.
Published: (2024)
FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models
by: Gao, Yan, et al.
Published: (2025)
by: Gao, Yan, et al.
Published: (2025)
Dynamic Adaptive Optimization for Effective Sentiment Analysis Fine-Tuning on Large Language Models
by: Ding, Hongcheng, et al.
Published: (2024)
by: Ding, Hongcheng, et al.
Published: (2024)
Hyperbolic Fine-Tuning for Large Language Models
by: Yang, Menglin, et al.
Published: (2024)
by: Yang, Menglin, et al.
Published: (2024)
Phased Instruction Fine-Tuning for Large Language Models
by: Pang, Wei, et al.
Published: (2024)
by: Pang, Wei, et al.
Published: (2024)
Dissecting Fine-Tuning Unlearning in Large Language Models
by: Hong, Yihuai, et al.
Published: (2024)
by: Hong, Yihuai, et al.
Published: (2024)
Automated Data Curation for Robust Language Model Fine-Tuning
by: Chen, Jiuhai, et al.
Published: (2024)
by: Chen, Jiuhai, et al.
Published: (2024)
LaF-GRPO: In-Situ Navigation Instruction Generation for the Visually Impaired via GRPO with LLM-as-Follower Reward
by: Zhao, Yi, et al.
Published: (2025)
by: Zhao, Yi, et al.
Published: (2025)
Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge Tuning
by: Prottasha, Nusrat Jahan, et al.
Published: (2024)
by: Prottasha, Nusrat Jahan, et al.
Published: (2024)
Similar Items
-
SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment
by: Tang, Yixuan, et al.
Published: (2025) -
Safety-Aware Fine-Tuning of Large Language Models
by: Choi, Hyeong Kyu, et al.
Published: (2024) -
Dynamics of Instruction Fine-Tuning for Chinese Large Language Models
by: Song, Chiyu, et al.
Published: (2023) -
Discriminative Finetuning of Generative Large Language Models without Reward Models and Human Preference Data
by: Guo, Siqi, et al.
Published: (2025) -
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models
by: Zhou, Xiongtao, et al.
Published: (2024)