Saved in:
| Main Authors: | Hu, Xiao, Lian, Yuansheng, Zhang, Ke, Li, Yunxuan, Su, Yuelong, Li, Meng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.22333 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Rethinking Reinforcement fine-tuning of LLMs: A Multi-armed Bandit Learning Perspective
by: Hu, Xiao, et al.
Published: (2026)
by: Hu, Xiao, et al.
Published: (2026)
Game-Theoretic Modeling of Vehicle Unprotected Left Turns Considering Drivers' Bounded Rationality
by: Lian, Yuansheng, et al.
Published: (2025)
by: Lian, Yuansheng, et al.
Published: (2025)
Learning from models beyond fine-tuning
by: Zheng, Hongling, et al.
Published: (2023)
by: Zheng, Hongling, et al.
Published: (2023)
Predicting trucking accidents with truck drivers 'safety climate perception across companies: A transfer learning approach
by: Sun, Kailai, et al.
Published: (2024)
by: Sun, Kailai, et al.
Published: (2024)
Using construction waste hauling trucks' GPS data to classify earthwork-related locations: A Chengdu case study
by: Yu, Lei, et al.
Published: (2024)
by: Yu, Lei, et al.
Published: (2024)
Uncertainty quantification in fine-tuned LLMs using LoRA ensembles
by: Balabanov, Oleksandr, et al.
Published: (2024)
by: Balabanov, Oleksandr, et al.
Published: (2024)
Physics-informed fine-tuning of foundation models for partial differential equations
by: Medvedev, Vlad, et al.
Published: (2026)
by: Medvedev, Vlad, et al.
Published: (2026)
Demystifying Design Choices of Reinforcement Fine-tuning: A Batched Contextual Bandit Learning Perspective
by: Xie, Hong, et al.
Published: (2026)
by: Xie, Hong, et al.
Published: (2026)
Empirical influence functions to understand the logic of fine-tuning
by: Matelsky, Jordan K., et al.
Published: (2024)
by: Matelsky, Jordan K., et al.
Published: (2024)
FLoRA: Fused forward-backward adapters for parameter efficient fine-tuning and reducing inference-time latencies of LLMs
by: Gowda, Dhananjaya, et al.
Published: (2025)
by: Gowda, Dhananjaya, et al.
Published: (2025)
A Tidal Current Speed Forecasting Model based on Multi-Periodicity Learning
by: Cheng, Tengfei, et al.
Published: (2024)
by: Cheng, Tengfei, et al.
Published: (2024)
Improving embedding with contrastive fine-tuning on small datasets with expert-augmented scores
by: Lu, Jun, et al.
Published: (2024)
by: Lu, Jun, et al.
Published: (2024)
KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs
by: Xu, Yongqin, et al.
Published: (2024)
by: Xu, Yongqin, et al.
Published: (2024)
Short-term prediction of construction waste transport activities using AI-Truck
by: Xu, Meng, et al.
Published: (2023)
by: Xu, Meng, et al.
Published: (2023)
LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning
by: Mao, Yansheng, et al.
Published: (2024)
by: Mao, Yansheng, et al.
Published: (2024)
ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation
by: Meng, Chuiyang, et al.
Published: (2026)
by: Meng, Chuiyang, et al.
Published: (2026)
Step-wise Adaptive Integration of Supervised Fine-tuning and Reinforcement Learning for Task-Specific LLMs
by: Chen, Jack, et al.
Published: (2025)
by: Chen, Jack, et al.
Published: (2025)
Overcoming systematic softening in universal machine learning interatomic potentials by fine-tuning
by: Deng, Bowen, et al.
Published: (2024)
by: Deng, Bowen, et al.
Published: (2024)
Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning
by: Zhao, Hanyang, et al.
Published: (2024)
by: Zhao, Hanyang, et al.
Published: (2024)
Train on Validation (ToV): Fast data selection with applications to fine-tuning
by: Jain, Ayush, et al.
Published: (2025)
by: Jain, Ayush, et al.
Published: (2025)
Uncertainty modeling for fine-tuned implicit functions
by: Susmelj, Anna, et al.
Published: (2024)
by: Susmelj, Anna, et al.
Published: (2024)
Rethinking harmless refusals when fine-tuning foundation models
by: Pop, Florin, et al.
Published: (2024)
by: Pop, Florin, et al.
Published: (2024)
Jal Anveshak: Prediction of fishing zones using fine-tuned LlaMa 2
by: Mejari, Arnav, et al.
Published: (2024)
by: Mejari, Arnav, et al.
Published: (2024)
Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
by: Vanlioglu, Abdullah
Published: (2025)
by: Vanlioglu, Abdullah
Published: (2025)
Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective
by: Xu, Chengyin, et al.
Published: (2025)
by: Xu, Chengyin, et al.
Published: (2025)
GeoLoRA: Geometric integration for parameter efficient fine-tuning
by: Schotthöfer, Steffen, et al.
Published: (2024)
by: Schotthöfer, Steffen, et al.
Published: (2024)
LncRNA-disease association prediction method based on heterogeneous information completion and convolutional neural network
by: Xi, Wen-Yu, et al.
Published: (2024)
by: Xi, Wen-Yu, et al.
Published: (2024)
One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments
by: Yi, Ke, et al.
Published: (2024)
by: Yi, Ke, et al.
Published: (2024)
Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models
by: Ma, Olivia, et al.
Published: (2024)
by: Ma, Olivia, et al.
Published: (2024)
Open-weight genome language model safeguards: Assessing robustness via adversarial fine-tuning
by: Black, James R. M., et al.
Published: (2025)
by: Black, James R. M., et al.
Published: (2025)
LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning
by: Liu, Zihang, et al.
Published: (2025)
by: Liu, Zihang, et al.
Published: (2025)
MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs
by: Zhao, Guojiang, et al.
Published: (2025)
by: Zhao, Guojiang, et al.
Published: (2025)
A multi-locus predictiveness curve and its summary assessment for genetic risk prediction
by: Wei, Changshuai, et al.
Published: (2025)
by: Wei, Changshuai, et al.
Published: (2025)
Explainable AML Triage with LLMs: Evidence Retrieval and Counterfactual Checks
by: Torres, Dorothy, et al.
Published: (2026)
by: Torres, Dorothy, et al.
Published: (2026)
Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models
by: Liu, Junhao, et al.
Published: (2025)
by: Liu, Junhao, et al.
Published: (2025)
Activated LoRA: Fine-tuned LLMs for Intrinsics
by: Greenewald, Kristjan, et al.
Published: (2025)
by: Greenewald, Kristjan, et al.
Published: (2025)
Evolutionary fine tuning of quantized convolution-based deep learning models
by: Pietroń, Marcin
Published: (2026)
by: Pietroń, Marcin
Published: (2026)
Spatiotemporal Field Generation Based on Hybrid Mamba-Transformer with Physics-informed Fine-tuning
by: Du, Peimian, et al.
Published: (2025)
by: Du, Peimian, et al.
Published: (2025)
PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs
by: Yu, Tongzhou, et al.
Published: (2025)
by: Yu, Tongzhou, et al.
Published: (2025)
Static and multivariate-temporal attentive fusion transformer for readmission risk prediction
by: Sun, Zhe, et al.
Published: (2024)
by: Sun, Zhe, et al.
Published: (2024)
Similar Items
-
Rethinking Reinforcement fine-tuning of LLMs: A Multi-armed Bandit Learning Perspective
by: Hu, Xiao, et al.
Published: (2026) -
Game-Theoretic Modeling of Vehicle Unprotected Left Turns Considering Drivers' Bounded Rationality
by: Lian, Yuansheng, et al.
Published: (2025) -
Learning from models beyond fine-tuning
by: Zheng, Hongling, et al.
Published: (2023) -
Predicting trucking accidents with truck drivers 'safety climate perception across companies: A transfer learning approach
by: Sun, Kailai, et al.
Published: (2024) -
Using construction waste hauling trucks' GPS data to classify earthwork-related locations: A Chengdu case study
by: Yu, Lei, et al.
Published: (2024)