Saved in:
| Main Authors: | Wang, Ziming, Shi, Zeyu, Zhou, Haoyi, Gao, Shiqi, Sun, Qingyun, Li, Jianxin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.20903 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fine-Tuned LLMs Know They Don't Know: A Parameter-Efficient Approach to Recovering Honesty
by: Shi, Zeyu, et al.
Published: (2025)
by: Shi, Zeyu, et al.
Published: (2025)
MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discovery
by: Lu, Feihong, et al.
Published: (2024)
by: Lu, Feihong, et al.
Published: (2024)
Stage-wise Fine-tuning for Graph-to-Text Generation
by: Wang, Qingyun, et al.
Published: (2021)
by: Wang, Qingyun, et al.
Published: (2021)
Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs
by: Shen, Zichao, et al.
Published: (2024)
by: Shen, Zichao, et al.
Published: (2024)
Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
by: Zhou, Huichi, et al.
Published: (2025)
by: Zhou, Huichi, et al.
Published: (2025)
Explain Less, Understand More: Jargon Detection via Personalized Parameter-Efficient Fine-tuning
by: Wu, Bohao, et al.
Published: (2025)
by: Wu, Bohao, et al.
Published: (2025)
Double-Calibration: Towards Reliable LLMs via Calibrating Knowledge and Reasoning Confidence
by: Lu, Yuyin, et al.
Published: (2026)
by: Lu, Yuyin, et al.
Published: (2026)
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs
by: Christophe, Clément, et al.
Published: (2024)
by: Christophe, Clément, et al.
Published: (2024)
Be Careful When Fine-tuning On Open-Source LLMs: Your Fine-tuning Data Could Be Secretly Stolen!
by: Zhang, Zhexin, et al.
Published: (2025)
by: Zhang, Zhexin, et al.
Published: (2025)
Unlocking the Potentials of Retrieval-Augmented Generation for Diffusion Language Models
by: Yu, Chuanyue, et al.
Published: (2026)
by: Yu, Chuanyue, et al.
Published: (2026)
GRAVER: Generative Graph Vocabularies for Robust Graph Foundation Models Fine-tuning
by: Yuan, Haonan, et al.
Published: (2025)
by: Yuan, Haonan, et al.
Published: (2025)
From Text to Trajectory: Exploring Complex Constraint Representation and Decomposition in Safe Reinforcement Learning
by: Dong, Pusen, et al.
Published: (2024)
by: Dong, Pusen, et al.
Published: (2024)
Disentangling Reasoning Tokens and Boilerplate Tokens For Language Model Fine-tuning
by: Ye, Ziang, et al.
Published: (2024)
by: Ye, Ziang, et al.
Published: (2024)
Towards Context-Robust LLMs: A Gated Representation Fine-tuning Approach
by: Zeng, Shenglai, et al.
Published: (2025)
by: Zeng, Shenglai, et al.
Published: (2025)
KnowTuning: Knowledge-aware Fine-tuning for Large Language Models
by: Lyu, Yougang, et al.
Published: (2024)
by: Lyu, Yougang, et al.
Published: (2024)
Practical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration
by: Fu, Wenjie, et al.
Published: (2023)
by: Fu, Wenjie, et al.
Published: (2023)
GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning
by: Zhou, Sifan, et al.
Published: (2025)
by: Zhou, Sifan, et al.
Published: (2025)
Intent Mismatch Causes LLMs to Get Lost in Multi-Turn Conversation
by: Liu, Geng, et al.
Published: (2026)
by: Liu, Geng, et al.
Published: (2026)
AgenticGEO: A Self-Evolving Agentic System for Generative Engine Optimization
by: Yuan, Jiaqi, et al.
Published: (2026)
by: Yuan, Jiaqi, et al.
Published: (2026)
UltraLink: An Open-Source Knowledge-Enhanced Multilingual Supervised Fine-tuning Dataset
by: Wang, Haoyu, et al.
Published: (2024)
by: Wang, Haoyu, et al.
Published: (2024)
KaFT: Knowledge-aware Fine-tuning for Boosting LLMs' Domain-specific Question-Answering Performance
by: Zhong, Qihuang, et al.
Published: (2025)
by: Zhong, Qihuang, et al.
Published: (2025)
KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs
by: Xu, Yongqin, et al.
Published: (2024)
by: Xu, Yongqin, et al.
Published: (2024)
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition
by: Dong, Guanting, et al.
Published: (2023)
by: Dong, Guanting, et al.
Published: (2023)
Instructing the Architecture Search for Spatial-temporal Sequence Forecasting with LLM
by: Xue, Xin, et al.
Published: (2025)
by: Xue, Xin, et al.
Published: (2025)
medIKAL: Integrating Knowledge Graphs as Assistants of LLMs for Enhanced Clinical Diagnosis on EMRs
by: Jia, Mingyi, et al.
Published: (2024)
by: Jia, Mingyi, et al.
Published: (2024)
Identity Lock: Locking API Fine-tuned LLMs With Identity-based Wake Words
by: Su, Hongyu, et al.
Published: (2025)
by: Su, Hongyu, et al.
Published: (2025)
Topic Modeling with Fine-tuning LLMs and Bag of Sentences
by: Schneider, Johannes
Published: (2024)
by: Schneider, Johannes
Published: (2024)
Fine-tuning and Utilization Methods of Domain-specific LLMs
by: Jeong, Cheonsu
Published: (2024)
by: Jeong, Cheonsu
Published: (2024)
Precise Localization of Memories: A Fine-grained Neuron-level Knowledge Editing Technique for LLMs
by: Pan, Haowen, et al.
Published: (2025)
by: Pan, Haowen, et al.
Published: (2025)
Structure-aware Fine-tuning for Code Pre-trained Models
by: Wu, Jiayi, et al.
Published: (2024)
by: Wu, Jiayi, et al.
Published: (2024)
ITERTL: An Iterative Framework for Fine-tuning LLMs for RTL Code Generation
by: Wu, Peiyang, et al.
Published: (2024)
by: Wu, Peiyang, et al.
Published: (2024)
RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models
by: Zhang, Fujun, et al.
Published: (2025)
by: Zhang, Fujun, et al.
Published: (2025)
UniARM: Towards a Unified Autoregressive Reward Model for Multi-Objective Test-Time Alignment
by: Xie, Hongyan, et al.
Published: (2026)
by: Xie, Hongyan, et al.
Published: (2026)
oMeBench: Towards Robust Benchmarking of LLMs in Organic Mechanism Elucidation and Reasoning
by: Xu, Ruiling, et al.
Published: (2025)
by: Xu, Ruiling, et al.
Published: (2025)
RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning
by: Wang, Haoyu, et al.
Published: (2024)
by: Wang, Haoyu, et al.
Published: (2024)
FineTuneBench: How well do commercial fine-tuning APIs infuse knowledge into LLMs?
by: Wu, Eric, et al.
Published: (2024)
by: Wu, Eric, et al.
Published: (2024)
Aloe: A Family of Fine-tuned Open Healthcare LLMs
by: Gururajan, Ashwin Kumar, et al.
Published: (2024)
by: Gururajan, Ashwin Kumar, et al.
Published: (2024)
Agent Fine-tuning through Distillation for Domain-specific LLMs in Microdomains
by: Xue, Yawen, et al.
Published: (2025)
by: Xue, Yawen, et al.
Published: (2025)
Relational Knowledge Distillation Using Fine-tuned Function Vectors
by: Kang, Andrea, et al.
Published: (2026)
by: Kang, Andrea, et al.
Published: (2026)
Fine-tuning Done Right in Model Editing
by: Yang, Wanli, et al.
Published: (2025)
by: Yang, Wanli, et al.
Published: (2025)
Similar Items
-
Fine-Tuned LLMs Know They Don't Know: A Parameter-Efficient Approach to Recovering Honesty
by: Shi, Zeyu, et al.
Published: (2025) -
MIKO: Multimodal Intention Knowledge Distillation from Large Language Models for Social-Media Commonsense Discovery
by: Lu, Feihong, et al.
Published: (2024) -
Stage-wise Fine-tuning for Graph-to-Text Generation
by: Wang, Qingyun, et al.
Published: (2021) -
Beyond Human Preferences: Exploring Reinforcement Learning Trajectory Evaluation and Improvement through LLMs
by: Shen, Zichao, et al.
Published: (2024) -
Memento: Fine-tuning LLM Agents without Fine-tuning LLMs
by: Zhou, Huichi, et al.
Published: (2025)