Guardado en:
| Autores principales: | Li, Yueyan, Gao, Wenhao, Yuan, Caixia, Wang, Xiaojie |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2502.06106 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Subgraph-level Universal Prompt Tuning
por: Lee, Junhyun, et al.
Publicado: (2024)
por: Lee, Junhyun, et al.
Publicado: (2024)
Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer
por: Du, Guodong, et al.
Publicado: (2025)
por: Du, Guodong, et al.
Publicado: (2025)
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform
por: Gao, Ziqi, et al.
Publicado: (2024)
por: Gao, Ziqi, et al.
Publicado: (2024)
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning
por: Zhou, Han, et al.
Publicado: (2023)
por: Zhou, Han, et al.
Publicado: (2023)
Fine-Tuning Language Models with Reward Learning on Policy
por: Lang, Hao, et al.
Publicado: (2024)
por: Lang, Hao, et al.
Publicado: (2024)
Linear Chain Transformation: Expanding Optimization Dynamics for Fine-Tuning Large Language Models
por: Wang, Yulong, et al.
Publicado: (2024)
por: Wang, Yulong, et al.
Publicado: (2024)
Selection of LLM Fine-Tuning Data based on Orthogonal Rules
por: Li, Xiaomin, et al.
Publicado: (2024)
por: Li, Xiaomin, et al.
Publicado: (2024)
BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models
por: Chang, Aofei, et al.
Publicado: (2024)
por: Chang, Aofei, et al.
Publicado: (2024)
Proximal Supervised Fine-Tuning
por: Zhu, Wenhong, et al.
Publicado: (2025)
por: Zhu, Wenhong, et al.
Publicado: (2025)
MMICT: Boosting Multi-Modal Fine-Tuning with In-Context Examples
por: Chen, Tao, et al.
Publicado: (2023)
por: Chen, Tao, et al.
Publicado: (2023)
Supervised Fine-Tuning as Inverse Reinforcement Learning
por: Sun, Hao
Publicado: (2024)
por: Sun, Hao
Publicado: (2024)
FactLens: Benchmarking Fine-Grained Fact Verification
por: Mitra, Kushan, et al.
Publicado: (2024)
por: Mitra, Kushan, et al.
Publicado: (2024)
Teaching LLMs How to Learn with Contextual Fine-Tuning
por: Choi, Younwoo, et al.
Publicado: (2025)
por: Choi, Younwoo, et al.
Publicado: (2025)
iTool: Reinforced Fine-Tuning with Dynamic Deficiency Calibration for Advanced Tool Use
por: Zeng, Yirong, et al.
Publicado: (2025)
por: Zeng, Yirong, et al.
Publicado: (2025)
Parameter-Efficient Fine-Tuning for Foundation Models
por: Zhang, Dan, et al.
Publicado: (2025)
por: Zhang, Dan, et al.
Publicado: (2025)
Parameter Efficient Quasi-Orthogonal Fine-Tuning via Givens Rotation
por: Ma, Xinyu, et al.
Publicado: (2024)
por: Ma, Xinyu, et al.
Publicado: (2024)
Boosting Large Language Models with Mask Fine-Tuning
por: Zhang, Mingyuan, et al.
Publicado: (2025)
por: Zhang, Mingyuan, et al.
Publicado: (2025)
Prompting and Fine-Tuning of Small LLMs for Length-Controllable Telephone Call Summarization
por: Thulke, David, et al.
Publicado: (2024)
por: Thulke, David, et al.
Publicado: (2024)
Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing
por: Zhang, Yi-Kai, et al.
Publicado: (2025)
por: Zhang, Yi-Kai, et al.
Publicado: (2025)
Efficient Differentially Private Fine-Tuning of LLMs via Reinforcement Learning
por: Khadangi, Afshin, et al.
Publicado: (2025)
por: Khadangi, Afshin, et al.
Publicado: (2025)
GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning
por: Zhou, Sifan, et al.
Publicado: (2025)
por: Zhou, Sifan, et al.
Publicado: (2025)
Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs
por: Wang, Ruoyu, et al.
Publicado: (2024)
por: Wang, Ruoyu, et al.
Publicado: (2024)
Crafting Efficient Fine-Tuning Strategies for Large Language Models
por: Oliver, Michael, et al.
Publicado: (2024)
por: Oliver, Michael, et al.
Publicado: (2024)
Order-Independence Without Fine Tuning
por: McIlroy-Young, Reid, et al.
Publicado: (2024)
por: McIlroy-Young, Reid, et al.
Publicado: (2024)
Model Editing by Standard Fine-Tuning
por: Gangadhar, Govind, et al.
Publicado: (2024)
por: Gangadhar, Govind, et al.
Publicado: (2024)
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
por: Chen, Zixiang, et al.
Publicado: (2024)
por: Chen, Zixiang, et al.
Publicado: (2024)
Blending Supervised and Reinforcement Fine-Tuning with Prefix Sampling
por: Huang, Zeyu, et al.
Publicado: (2025)
por: Huang, Zeyu, et al.
Publicado: (2025)
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
por: Wang, Luping, et al.
Publicado: (2024)
por: Wang, Luping, et al.
Publicado: (2024)
Stabilizing LLM Supervised Fine-Tuning via Explicit Distributional Control
por: Wang, Xinyu, et al.
Publicado: (2026)
por: Wang, Xinyu, et al.
Publicado: (2026)
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
por: Hong, Joey, et al.
Publicado: (2024)
por: Hong, Joey, et al.
Publicado: (2024)
Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis
por: Wang, Xu, et al.
Publicado: (2025)
por: Wang, Xu, et al.
Publicado: (2025)
Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning
por: Lyu, Mengyao, et al.
Publicado: (2025)
por: Lyu, Mengyao, et al.
Publicado: (2025)
FineSteer: A Unified Framework for Fine-Grained Inference-Time Steering in Large Language Models
por: Weng, Zixuan, et al.
Publicado: (2026)
por: Weng, Zixuan, et al.
Publicado: (2026)
SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning
por: Fu, Yuqian, et al.
Publicado: (2025)
por: Fu, Yuqian, et al.
Publicado: (2025)
EMORL: Ensemble Multi-Objective Reinforcement Learning for Efficient and Flexible LLM Fine-Tuning
por: Kong, Lingxiao, et al.
Publicado: (2025)
por: Kong, Lingxiao, et al.
Publicado: (2025)
Supervised Fine-Tuning Needs to Unlock the Potential of Token Priority
por: Shen, Zhanming, et al.
Publicado: (2026)
por: Shen, Zhanming, et al.
Publicado: (2026)
Amplifying, Not Learning: Fine-Tuned AI Text Detectors Amplify a Pretrained Direction
por: Smirnov, Alexander
Publicado: (2026)
por: Smirnov, Alexander
Publicado: (2026)
MiCA Learns More Knowledge Than LoRA and Full Fine-Tuning
por: Rüdiger, Sten, et al.
Publicado: (2026)
por: Rüdiger, Sten, et al.
Publicado: (2026)
DARE the Extreme: Revisiting Delta-Parameter Pruning For Fine-Tuned Models
por: Deng, Wenlong, et al.
Publicado: (2024)
por: Deng, Wenlong, et al.
Publicado: (2024)
Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy Sparsity
por: Wang, Tuowei, et al.
Publicado: (2025)
por: Wang, Tuowei, et al.
Publicado: (2025)
Ejemplares similares
-
Subgraph-level Universal Prompt Tuning
por: Lee, Junhyun, et al.
Publicado: (2024) -
Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer
por: Du, Guodong, et al.
Publicado: (2025) -
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform
por: Gao, Ziqi, et al.
Publicado: (2024) -
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning
por: Zhou, Han, et al.
Publicado: (2023) -
Fine-Tuning Language Models with Reward Learning on Policy
por: Lang, Hao, et al.
Publicado: (2024)