Saved in:
| Main Authors: | Yao, Xinhao, Qian, Hongjin, Hu, Xiaolin, Xu, Gengze, Liu, Wei, Luan, Jian, Wang, Bin, Liu, Yong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.02247 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Enhancing In-Context Learning Performance with just SVD-Based Weight Pruning: A Theoretical Perspective
by: Yao, Xinhao, et al.
Published: (2024)
by: Yao, Xinhao, et al.
Published: (2024)
On the Emergence of Weak-to-Strong Generalization: A Bias-Variance Perspective
by: Xu, Gengze, et al.
Published: (2025)
by: Xu, Gengze, et al.
Published: (2025)
PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning
by: Wang, Qibin, et al.
Published: (2024)
by: Wang, Qibin, et al.
Published: (2024)
On the Blessing of Pre-training in Weak-to-Strong Generalization
by: Yao, Wei, et al.
Published: (2026)
by: Yao, Wei, et al.
Published: (2026)
The Capabilities and Limitations of Weak-to-Strong Generalization: Generalization and Calibration
by: Yao, Wei, et al.
Published: (2025)
by: Yao, Wei, et al.
Published: (2025)
DoTA: Weight-Decomposed Tensor Adaptation for Large Language Models
by: Hu, Xiaolin, et al.
Published: (2024)
by: Hu, Xiaolin, et al.
Published: (2024)
On Weak-to-Strong Generalization and f-Divergence
by: Yao, Wei, et al.
Published: (2025)
by: Yao, Wei, et al.
Published: (2025)
Compositional Generalization from Learned Skills via CoT Training: A Theoretical and Structural Analysis for Reasoning
by: Yao, Xinhao, et al.
Published: (2025)
by: Yao, Xinhao, et al.
Published: (2025)
The Debate on RLVR Reasoning Capability Boundary: Shrinkage, Expansion, or Both? A Two-Stage Dynamic View
by: Yao, Xinhao, et al.
Published: (2025)
by: Yao, Xinhao, et al.
Published: (2025)
Look Within or Look Beyond? A Theoretical Comparison Between Parameter-Efficient and Full Fine-Tuning
by: Liu, Yongkang, et al.
Published: (2025)
by: Liu, Yongkang, et al.
Published: (2025)
Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner
by: Ma, Hao, et al.
Published: (2026)
by: Ma, Hao, et al.
Published: (2026)
Multi-branch of Attention Yields Accurate Results for Tabular Data
by: Li, Xuechen, et al.
Published: (2025)
by: Li, Xuechen, et al.
Published: (2025)
Control Theoretic Approach to Fine-Tuning and Transfer Learning
by: Bayram, Erkan, et al.
Published: (2024)
by: Bayram, Erkan, et al.
Published: (2024)
SASA: Semantic-Aware Contrastive Learning Framework with Separated Attention for Triple Classification
by: Xiaodan, Xu, et al.
Published: (2026)
by: Xiaodan, Xu, et al.
Published: (2026)
LoSiA: Efficient High-Rank Fine-Tuning via Subnet Localization and Optimization
by: Wang, Xujia, et al.
Published: (2025)
by: Wang, Xujia, et al.
Published: (2025)
PSEO: Optimizing Post-hoc Stacking Ensemble Through Hyperparameter Tuning
by: Xu, Beicheng, et al.
Published: (2025)
by: Xu, Beicheng, et al.
Published: (2025)
Mixture of Diverse Size Experts
by: Sun, Manxi, et al.
Published: (2024)
by: Sun, Manxi, et al.
Published: (2024)
Self-Generative Adversarial Fine-Tuning for Large Language Models
by: Wu, Shiguang, et al.
Published: (2026)
by: Wu, Shiguang, et al.
Published: (2026)
Towards Auto-Regressive Next-Token Prediction: In-Context Learning Emerges from Generalization
by: Gong, Zixuan, et al.
Published: (2025)
by: Gong, Zixuan, et al.
Published: (2025)
Beyond the Black Box: A Survey on the Theory and Mechanism of Large Language Models
by: Gan, Zeyu, et al.
Published: (2026)
by: Gan, Zeyu, et al.
Published: (2026)
Attention Mechanism, Max-Affine Partition, and Universal Approximation
by: Liu, Hude, et al.
Published: (2025)
by: Liu, Hude, et al.
Published: (2025)
Generative Representational Instruction Tuning
by: Muennighoff, Niklas, et al.
Published: (2024)
by: Muennighoff, Niklas, et al.
Published: (2024)
Fed-pilot: Optimizing LoRA Allocation for Efficient Federated Fine-Tuning with Heterogeneous Clients
by: Zhang, Zikai, et al.
Published: (2024)
by: Zhang, Zikai, et al.
Published: (2024)
Beyond Progress Measures: Theoretical Insights into the Mechanism of Grokking
by: Gu, Zihan, et al.
Published: (2025)
by: Gu, Zihan, et al.
Published: (2025)
Learning to Think: Information-Theoretic Reinforcement Fine-Tuning for LLMs
by: Wang, Jingyao, et al.
Published: (2025)
by: Wang, Jingyao, et al.
Published: (2025)
Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis
by: Wang, Xu, et al.
Published: (2025)
by: Wang, Xu, et al.
Published: (2025)
Bi-LoRA: Efficient Sharpness-Aware Minimization for Fine-Tuning Large-Scale Models
by: Liu, Yuhang, et al.
Published: (2025)
by: Liu, Yuhang, et al.
Published: (2025)
ChunkFT: Byte-Streamed Optimization for Memory-Efficient Full Fine-Tuning
by: Liu, Yongkang, et al.
Published: (2026)
by: Liu, Yongkang, et al.
Published: (2026)
Fine-Tuning Without Forgetting In-Context Learning: A Theoretical Analysis of Linear Attention Models
by: Lee, Chungpa, et al.
Published: (2026)
by: Lee, Chungpa, et al.
Published: (2026)
Understanding and Preserving Safety in Fine-Tuned LLMs
by: Zhang, Jiawen, et al.
Published: (2026)
by: Zhang, Jiawen, et al.
Published: (2026)
HoPE: A Novel Positional Encoding Without Long-Term Decay for Enhanced Context Awareness and Extrapolation
by: Chen, Yuhan, et al.
Published: (2024)
by: Chen, Yuhan, et al.
Published: (2024)
Preserving Domain Generalization in Fine-Tuning via Joint Parameter Selection
by: Pan, Bin, et al.
Published: (2025)
by: Pan, Bin, et al.
Published: (2025)
Information-Theoretic Generalization Bounds for Transductive Learning and its Applications
by: Tang, Huayi, et al.
Published: (2023)
by: Tang, Huayi, et al.
Published: (2023)
Graph Attention is Not Always Beneficial: A Theoretical Analysis of Graph Attention Mechanisms via Contextual Stochastic Block Models
by: Ma, Zhongtian, et al.
Published: (2024)
by: Ma, Zhongtian, et al.
Published: (2024)
Towards a Theoretical Understanding to the Generalization of RLHF
by: Li, Zhaochun, et al.
Published: (2026)
by: Li, Zhaochun, et al.
Published: (2026)
Rethinking Training Dynamics in Scale-wise Autoregressive Generation
by: Zhou, Gengze, et al.
Published: (2025)
by: Zhou, Gengze, et al.
Published: (2025)
Invariance Makes LLM Unlearning Resilient Even to Unanticipated Downstream Fine-Tuning
by: Wang, Changsheng, et al.
Published: (2025)
by: Wang, Changsheng, et al.
Published: (2025)
RPO:Reinforcement Fine-Tuning with Partial Reasoning Optimization
by: Yi, Hongzhu, et al.
Published: (2026)
by: Yi, Hongzhu, et al.
Published: (2026)
STEP: Success-Rate-Aware Trajectory-Efficient Policy Optimization
by: Chen, Yuhan, et al.
Published: (2025)
by: Chen, Yuhan, et al.
Published: (2025)
An Optimization Framework for Differentially Private Sparse Fine-Tuning
by: Makni, Mehdi, et al.
Published: (2025)
by: Makni, Mehdi, et al.
Published: (2025)
Similar Items
-
Enhancing In-Context Learning Performance with just SVD-Based Weight Pruning: A Theoretical Perspective
by: Yao, Xinhao, et al.
Published: (2024) -
On the Emergence of Weak-to-Strong Generalization: A Bias-Variance Perspective
by: Xu, Gengze, et al.
Published: (2025) -
PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning
by: Wang, Qibin, et al.
Published: (2024) -
On the Blessing of Pre-training in Weak-to-Strong Generalization
by: Yao, Wei, et al.
Published: (2026) -
The Capabilities and Limitations of Weak-to-Strong Generalization: Generalization and Calibration
by: Yao, Wei, et al.
Published: (2025)