Saved in:
| Main Author: | Yan, Yao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.07824 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SlimGPT: Layer-wise Structured Pruning for Large Language Models
by: Ling, Gui, et al.
Published: (2024)
by: Ling, Gui, et al.
Published: (2024)
Mechanistic Steering of LLMs Reveals Layer-wise Feature Vulnerabilities in Adversarial Settings
by: Das, Nilanjana, et al.
Published: (2026)
by: Das, Nilanjana, et al.
Published: (2026)
OmniDrop: Layer-wise Token Pruning for Omni-modal LLMs via Query-Guidance
by: Park, Yeo Jeong, et al.
Published: (2026)
by: Park, Yeo Jeong, et al.
Published: (2026)
Information-Theoretic Greedy Layer-wise Training for Traffic Sign Recognition
by: Lyu, Shuyan, et al.
Published: (2025)
by: Lyu, Shuyan, et al.
Published: (2025)
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
by: Liu, Yuxi, et al.
Published: (2025)
by: Liu, Yuxi, et al.
Published: (2025)
How Vision Becomes Language: A Layer-wise Information-Theoretic Analysis of Multimodal Reasoning
by: Wu, Hongxuan, et al.
Published: (2026)
by: Wu, Hongxuan, et al.
Published: (2026)
Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models
by: Xiao, He, et al.
Published: (2025)
by: Xiao, He, et al.
Published: (2025)
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
by: Deng, Yihe, et al.
Published: (2025)
by: Deng, Yihe, et al.
Published: (2025)
LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
by: Yan, Zihe, et al.
Published: (2025)
by: Yan, Zihe, et al.
Published: (2025)
A Layer-wise Analysis of Supervised Fine-Tuning
by: Zhao, Qinghua, et al.
Published: (2026)
by: Zhao, Qinghua, et al.
Published: (2026)
Layer-wise Regularized Dropout for Neural Language Models
by: Ni, Shiwen, et al.
Published: (2024)
by: Ni, Shiwen, et al.
Published: (2024)
Composing Diffusion Policies for Few-shot Learning of Movement Trajectories
by: Patil, Omkar, et al.
Published: (2024)
by: Patil, Omkar, et al.
Published: (2024)
Stochastic Layer-wise Learning: Scalable and Efficient Alternative to Backpropagation
by: Yin, Bojian, et al.
Published: (2025)
by: Yin, Bojian, et al.
Published: (2025)
Layer-wise Positional Bias in Short-Context Language Modeling
by: Rahimi, Maryam, et al.
Published: (2026)
by: Rahimi, Maryam, et al.
Published: (2026)
Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
by: Darrin, Maxime, et al.
Published: (2023)
by: Darrin, Maxime, et al.
Published: (2023)
Resource-efficient Layer-wise Federated Self-supervised Learning
by: Tun, Ye Lin, et al.
Published: (2024)
by: Tun, Ye Lin, et al.
Published: (2024)
Don't Look at the Numbers: Visual Anchoring Bias and Layer-wise Representation in VLMs
by: Shalankin, M.
Published: (2026)
by: Shalankin, M.
Published: (2026)
Adaptive Layer-skipping in Pre-trained LLMs
by: Luo, Xuan, et al.
Published: (2025)
by: Luo, Xuan, et al.
Published: (2025)
Off-Trajectory Reasoning: Can LLMs Collaborate on Reasoning Trajectory?
by: Li, Aochong Oliver, et al.
Published: (2025)
by: Li, Aochong Oliver, et al.
Published: (2025)
LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
by: Shen, Yiqun, et al.
Published: (2025)
by: Shen, Yiqun, et al.
Published: (2025)
Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction
by: Liu, Zhexiong, et al.
Published: (2025)
by: Liu, Zhexiong, et al.
Published: (2025)
Layer-wise QUBO-Based Training of CNN Classifiers for Quantum Annealing
by: Atallah, Mostafa, et al.
Published: (2026)
by: Atallah, Mostafa, et al.
Published: (2026)
Two-Stage Grid Optimization for Group-wise Quantization of LLMs
by: Kim, Junhan, et al.
Published: (2026)
by: Kim, Junhan, et al.
Published: (2026)
MID-L: Matrix-Interpolated Dropout Layer with Layer-wise Neuron Selection
by: Shaeri, Pouya, et al.
Published: (2025)
by: Shaeri, Pouya, et al.
Published: (2025)
LayerCache: Exploiting Layer-wise Velocity Heterogeneity for Efficient Flow Matching Inference
by: Li, Guandong
Published: (2026)
by: Li, Guandong
Published: (2026)
Step-wise Adaptive Integration of Supervised Fine-tuning and Reinforcement Learning for Task-Specific LLMs
by: Chen, Jack, et al.
Published: (2025)
by: Chen, Jack, et al.
Published: (2025)
LP-DETR: Layer-wise Progressive Relations for Object Detection
by: Kang, Zhengjian, et al.
Published: (2025)
by: Kang, Zhengjian, et al.
Published: (2025)
Zero-Shot Cellular Trajectory Map Matching
by: Shi, Weijie, et al.
Published: (2025)
by: Shi, Weijie, et al.
Published: (2025)
DP-LLM: Runtime Model Adaptation with Dynamic Layer-wise Precision Assignment
by: Kwon, Sangwoo, et al.
Published: (2025)
by: Kwon, Sangwoo, et al.
Published: (2025)
LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views
by: Roh, Yuji, et al.
Published: (2024)
by: Roh, Yuji, et al.
Published: (2024)
Optimal FALQON for Quantum Approximate Optimization via Layer-wise Parameter Tuning
by: Mancini, Michael, et al.
Published: (2026)
by: Mancini, Michael, et al.
Published: (2026)
Training Long-Context LLMs Efficiently via Chunk-wise Optimization
by: Li, Wenhao, et al.
Published: (2025)
by: Li, Wenhao, et al.
Published: (2025)
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
by: Yue, Murong, et al.
Published: (2024)
by: Yue, Murong, et al.
Published: (2024)
LEAP: Layer-wise Exit-Aware Pretraining for Efficient Transformer Inference
by: Kapadia, Shashank, et al.
Published: (2026)
by: Kapadia, Shashank, et al.
Published: (2026)
DeepIcon: A Hierarchical Network for Layer-wise Icon Vectorization
by: Bing, Qi, et al.
Published: (2024)
by: Bing, Qi, et al.
Published: (2024)
Disentangling Recall and Reasoning in Transformer Models through Layer-wise Attention and Activation Analysis
by: Fartale, Harshwardhan, et al.
Published: (2025)
by: Fartale, Harshwardhan, et al.
Published: (2025)
Improve Decoding Factuality by Token-wise Cross Layer Entropy of Large Language Models
by: Wu, Jialiang, et al.
Published: (2025)
by: Wu, Jialiang, et al.
Published: (2025)
Continual Learning with Embedding Layer Surgery and Task-wise Beam Search using Whisper
by: Kwok, Chin Yuen, et al.
Published: (2025)
by: Kwok, Chin Yuen, et al.
Published: (2025)
AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing
by: Thillainathan, Sarubi, et al.
Published: (2026)
by: Thillainathan, Sarubi, et al.
Published: (2026)
TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization
by: Chen, Zengjue, et al.
Published: (2025)
by: Chen, Zengjue, et al.
Published: (2025)
Similar Items
-
SlimGPT: Layer-wise Structured Pruning for Large Language Models
by: Ling, Gui, et al.
Published: (2024) -
Mechanistic Steering of LLMs Reveals Layer-wise Feature Vulnerabilities in Adversarial Settings
by: Das, Nilanjana, et al.
Published: (2026) -
OmniDrop: Layer-wise Token Pruning for Omni-modal LLMs via Query-Guidance
by: Park, Yeo Jeong, et al.
Published: (2026) -
Information-Theoretic Greedy Layer-wise Training for Traffic Sign Recognition
by: Lyu, Shuyan, et al.
Published: (2025) -
MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
by: Liu, Yuxi, et al.
Published: (2025)