:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Yan, Yao
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2506.07824
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SlimGPT: Layer-wise Structured Pruning for Large Language Models
by: Ling, Gui, et al.
Published: (2024)

Mechanistic Steering of LLMs Reveals Layer-wise Feature Vulnerabilities in Adversarial Settings
by: Das, Nilanjana, et al.
Published: (2026)

OmniDrop: Layer-wise Token Pruning for Omni-modal LLMs via Query-Guidance
by: Park, Yeo Jeong, et al.
Published: (2026)

Information-Theoretic Greedy Layer-wise Training for Traffic Sign Recognition
by: Lyu, Shuyan, et al.
Published: (2025)

MISA: Memory-Efficient LLMs Optimization with Module-wise Importance Sampling
by: Liu, Yuxi, et al.
Published: (2025)

How Vision Becomes Language: A Layer-wise Information-Theoretic Analysis of Multimodal Reasoning
by: Wu, Hongxuan, et al.
Published: (2026)

Exploring Layer-wise Information Effectiveness for Post-Training Quantization in Small Language Models
by: Xiao, He, et al.
Published: (2025)

Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning
by: Deng, Yihe, et al.
Published: (2025)

LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
by: Yan, Zihe, et al.
Published: (2025)

A Layer-wise Analysis of Supervised Fine-Tuning
by: Zhao, Qinghua, et al.
Published: (2026)

Layer-wise Regularized Dropout for Neural Language Models
by: Ni, Shiwen, et al.
Published: (2024)

Composing Diffusion Policies for Few-shot Learning of Movement Trajectories
by: Patil, Omkar, et al.
Published: (2024)

Stochastic Layer-wise Learning: Scalable and Efficient Alternative to Backpropagation
by: Yin, Bojian, et al.
Published: (2025)

Layer-wise Positional Bias in Short-Context Language Modeling
by: Rahimi, Maryam, et al.
Published: (2026)

Unsupervised Layer-wise Score Aggregation for Textual OOD Detection
by: Darrin, Maxime, et al.
Published: (2023)

Resource-efficient Layer-wise Federated Self-supervised Learning
by: Tun, Ye Lin, et al.
Published: (2024)

Don't Look at the Numbers: Visual Anchoring Bias and Layer-wise Representation in VLMs
by: Shalankin, M.
Published: (2026)

Adaptive Layer-skipping in Pre-trained LLMs
by: Luo, Xuan, et al.
Published: (2025)

Off-Trajectory Reasoning: Can LLMs Collaborate on Reasoning Trajectory?
by: Li, Aochong Oliver, et al.
Published: (2025)

LAVa: Layer-wise KV Cache Eviction with Dynamic Budget Allocation
by: Shen, Yiqun, et al.
Published: (2025)

Efficient Layer-wise LLM Fine-tuning for Revision Intention Prediction
by: Liu, Zhexiong, et al.
Published: (2025)

Layer-wise QUBO-Based Training of CNN Classifiers for Quantum Annealing
by: Atallah, Mostafa, et al.
Published: (2026)

Two-Stage Grid Optimization for Group-wise Quantization of LLMs
by: Kim, Junhan, et al.
Published: (2026)

MID-L: Matrix-Interpolated Dropout Layer with Layer-wise Neuron Selection
by: Shaeri, Pouya, et al.
Published: (2025)

LayerCache: Exploiting Layer-wise Velocity Heterogeneity for Efficient Flow Matching Inference
by: Li, Guandong
Published: (2026)

Step-wise Adaptive Integration of Supervised Fine-tuning and Reinforcement Learning for Task-Specific LLMs
by: Chen, Jack, et al.
Published: (2025)

LP-DETR: Layer-wise Progressive Relations for Object Detection
by: Kang, Zhengjian, et al.
Published: (2025)

Zero-Shot Cellular Trajectory Map Matching
by: Shi, Weijie, et al.
Published: (2025)

DP-LLM: Runtime Model Adaptation with Dynamic Layer-wise Precision Assignment
by: Kwon, Sangwoo, et al.
Published: (2025)

LEVI: Generalizable Fine-tuning via Layer-wise Ensemble of Different Views
by: Roh, Yuji, et al.
Published: (2024)

Optimal FALQON for Quantum Approximate Optimization via Layer-wise Parameter Tuning
by: Mancini, Michael, et al.
Published: (2026)

Training Long-Context LLMs Efficiently via Chunk-wise Optimization
by: Li, Wenhao, et al.
Published: (2025)

DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
by: Yue, Murong, et al.
Published: (2024)

LEAP: Layer-wise Exit-Aware Pretraining for Efficient Transformer Inference
by: Kapadia, Shashank, et al.
Published: (2026)

DeepIcon: A Hierarchical Network for Layer-wise Icon Vectorization
by: Bing, Qi, et al.
Published: (2024)

Disentangling Recall and Reasoning in Transformer Models through Layer-wise Attention and Activation Analysis
by: Fartale, Harshwardhan, et al.
Published: (2025)

Improve Decoding Factuality by Token-wise Cross Layer Entropy of Large Language Models
by: Wu, Jialiang, et al.
Published: (2025)

Continual Learning with Embedding Layer Surgery and Task-wise Beam Search using Whisper
by: Kwok, Chin Yuen, et al.
Published: (2025)

AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing
by: Thillainathan, Sarubi, et al.
Published: (2026)

TGRPO :Fine-tuning Vision-Language-Action Model via Trajectory-wise Group Relative Policy Optimization
by: Chen, Zengjue, et al.
Published: (2025)