:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hu, Xiao, Lian, Yuansheng, Zhang, Ke, Li, Yunxuan, Su, Yuelong, Li, Meng
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2510.22333
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Rethinking Reinforcement fine-tuning of LLMs: A Multi-armed Bandit Learning Perspective
by: Hu, Xiao, et al.
Published: (2026)

Game-Theoretic Modeling of Vehicle Unprotected Left Turns Considering Drivers' Bounded Rationality
by: Lian, Yuansheng, et al.
Published: (2025)

Learning from models beyond fine-tuning
by: Zheng, Hongling, et al.
Published: (2023)

Predicting trucking accidents with truck drivers 'safety climate perception across companies: A transfer learning approach
by: Sun, Kailai, et al.
Published: (2024)

Using construction waste hauling trucks' GPS data to classify earthwork-related locations: A Chengdu case study
by: Yu, Lei, et al.
Published: (2024)

Uncertainty quantification in fine-tuned LLMs using LoRA ensembles
by: Balabanov, Oleksandr, et al.
Published: (2024)

Physics-informed fine-tuning of foundation models for partial differential equations
by: Medvedev, Vlad, et al.
Published: (2026)

Demystifying Design Choices of Reinforcement Fine-tuning: A Batched Contextual Bandit Learning Perspective
by: Xie, Hong, et al.
Published: (2026)

Empirical influence functions to understand the logic of fine-tuning
by: Matelsky, Jordan K., et al.
Published: (2024)

FLoRA: Fused forward-backward adapters for parameter efficient fine-tuning and reducing inference-time latencies of LLMs
by: Gowda, Dhananjaya, et al.
Published: (2025)

A Tidal Current Speed Forecasting Model based on Multi-Periodicity Learning
by: Cheng, Tengfei, et al.
Published: (2024)

Improving embedding with contrastive fine-tuning on small datasets with expert-augmented scores
by: Lu, Jun, et al.
Published: (2024)

KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs
by: Xu, Yongqin, et al.
Published: (2024)

Short-term prediction of construction waste transport activities using AI-Truck
by: Xu, Meng, et al.
Published: (2023)

LIFT: Improving Long Context Understanding Through Long Input Fine-Tuning
by: Mao, Yansheng, et al.
Published: (2024)

ZorBA: Zeroth-order Federated Fine-tuning of LLMs with Heterogeneous Block Activation
by: Meng, Chuiyang, et al.
Published: (2026)

Step-wise Adaptive Integration of Supervised Fine-tuning and Reinforcement Learning for Task-Specific LLMs
by: Chen, Jack, et al.
Published: (2025)

Overcoming systematic softening in universal machine learning interatomic potentials by fine-tuning
by: Deng, Bowen, et al.
Published: (2024)

Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning
by: Zhao, Hanyang, et al.
Published: (2024)

Train on Validation (ToV): Fast data selection with applications to fine-tuning
by: Jain, Ayush, et al.
Published: (2025)

Uncertainty modeling for fine-tuned implicit functions
by: Susmelj, Anna, et al.
Published: (2024)

Rethinking harmless refusals when fine-tuning foundation models
by: Pop, Florin, et al.
Published: (2024)

Jal Anveshak: Prediction of fishing zones using fine-tuned LlaMa 2
by: Mejari, Arnav, et al.
Published: (2024)

Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning
by: Vanlioglu, Abdullah
Published: (2025)

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective
by: Xu, Chengyin, et al.
Published: (2025)

GeoLoRA: Geometric integration for parameter efficient fine-tuning
by: Schotthöfer, Steffen, et al.
Published: (2024)

LncRNA-disease association prediction method based on heterogeneous information completion and convolutional neural network
by: Xi, Wen-Yu, et al.
Published: (2024)

One QuantLLM for ALL: Fine-tuning Quantized LLMs Once for Efficient Deployments
by: Yi, Ke, et al.
Published: (2024)

Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models
by: Ma, Olivia, et al.
Published: (2024)

Open-weight genome language model safeguards: Assessing robustness via adversarial fine-tuning
by: Black, James R. M., et al.
Published: (2025)

LIFT the Veil for the Truth: Principal Weights Emerge after Rank Reduction for Reasoning-Focused Supervised Fine-Tuning
by: Liu, Zihang, et al.
Published: (2025)

MolReasoner: Toward Effective and Interpretable Reasoning for Molecular LLMs
by: Zhao, Guojiang, et al.
Published: (2025)

A multi-locus predictiveness curve and its summary assessment for genetic risk prediction
by: Wei, Changshuai, et al.
Published: (2025)

Explainable AML Triage with LLMs: Evidence Retrieval and Counterfactual Checks
by: Torres, Dorothy, et al.
Published: (2026)

Revitalizing Black-Box Interpretability: Actionable Interpretability for LLMs via Proxy Models
by: Liu, Junhao, et al.
Published: (2025)

Activated LoRA: Fine-tuned LLMs for Intrinsics
by: Greenewald, Kristjan, et al.
Published: (2025)

Evolutionary fine tuning of quantized convolution-based deep learning models
by: Pietroń, Marcin
Published: (2026)

Spatiotemporal Field Generation Based on Hybrid Mamba-Transformer with Physics-informed Fine-tuning
by: Du, Peimian, et al.
Published: (2025)

PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs
by: Yu, Tongzhou, et al.
Published: (2025)

Static and multivariate-temporal attentive fusion transformer for readmission risk prediction
by: Sun, Zhe, et al.
Published: (2024)