:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Zhiming, Wei, Yujie, Feng, Lei, Su, Xiu, Xia, Xiaobo, Guan, Weili, Xie, Zeke, Yang, Shuo
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.01167
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning to Accelerate Vision-Language-Action Models through Adaptive Visual Token Caching
by: Wei, Yujie, et al.
Published: (2026)

UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation
by: Guo, Jiyu, et al.
Published: (2025)

Inject Once Survive Later: Backdooring Vision-Language-Action Models to Persist Through Downstream Fine-tuning
by: Zhou, Jianyi, et al.
Published: (2026)

LayerMatch: Do Pseudo-labels Benefit All Layers?
by: Liang, Chaoqi, et al.
Published: (2024)

Model Specific Task Similarity for Vision Language Model Selection via Layer Conductance
by: Yang, Wei, et al.
Published: (2026)

Frozen-Tag-Based Physical-Layer Authentication Against User Interference
by: Yao, Lei, et al.
Published: (2026)

PUMA: Layer-Pruned Language Model for Efficient Unified Multimodal Retrieval with Modality-Adaptive Learning
by: Lyu, Yibo, et al.
Published: (2025)

Thickness‐Dependent Creep in Lithium Layers of All‐Solid‐State Batteries under Stack Pressures
by: Chuangchuang Duan, et al.
Published: (2025)

ConLA: Contrastive Latent Action Learning from Human Videos for Robotic Manipulation
by: Dai, Weisheng, et al.
Published: (2026)

Calibrated Multimodal Representation Learning with Missing Modalities
by: Liu, Xiaohao, et al.
Published: (2025)

APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation
by: Zhang, Daoxuan, et al.
Published: (2026)

Not All Layers of LLMs Are Necessary During Inference
by: Fan, Siqi, et al.
Published: (2024)

Not All Noises Are Created Equally:Diffusion Noise Selection and Optimization
by: Qi, Zipeng, et al.
Published: (2024)

Tag-based Physical-Layer Authentication Against Message Interference
by: Yao, Lei, et al.
Published: (2026)

Language-Specific Layer Matters: Efficient Multilingual Enhancement for Large Vision-Language Models
by: Fan, Yuchun, et al.
Published: (2025)

MangaFlow: An End-to-End Agentic Framework for Controllable Story to Manga Generation
by: Wang, Muyao, et al.
Published: (2026)

MultiPriv: Benchmarking Individual-Level Privacy Reasoning in Vision-Language Models
by: Sun, Xiongtao, et al.
Published: (2025)

L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models
by: Liu, Xiaohao, et al.
Published: (2025)

Last Layer Empirical Bayes
by: Villecroze, Valentin, et al.
Published: (2025)

Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
by: Li, Zhenyang, et al.
Published: (2024)

First Glimpse on Physical Layer Security in Internet of Vehicles: Transformed from Communication Interference to Sensing Interference
by: Li, Kaixuan, et al.
Published: (2025)

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
by: Chuang, Yung-Sung, et al.
Published: (2023)

Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
by: Yu, Songsong, et al.
Published: (2025)

Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models
by: Liu, Ziwei, et al.
Published: (2025)

Mano: Restriking Manifold Optimization for LLM Training
by: Gu, Yufei, et al.
Published: (2026)

Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs
by: Yang, Zhipeng, et al.
Published: (2025)

Controllable Layer Decomposition for Reversible Multi-Layer Image Generation
by: Liu, Zihao, et al.
Published: (2025)

Study on Improving the Vertical Thermal Conductivity of the Film by Single‐Layer Alumina Arrangement
by: Jin‐Biao Wang, et al.
Published: (2025)

FashionLens: Toward Versatile Fashion Image Retrieval via Task-Adaptive Learning
by: Wen, Haokun, et al.
Published: (2026)

Double‐side Interfacial Engineering of Hole Transport Layer Enables Efficient and Operationally Stable Colloidal Quantum Dot Solar Cells
by: Xin Wen, et al.
Published: (2025)

Benchmarking Large Vision-Language Models on Fine-Grained Image Tasks: A Comprehensive Evaluation
by: Yu, Hong-Tao, et al.
Published: (2025)

Empirical Study of Large Language Models as Automated Essay Scoring Tools in English Composition__Taking TOEFL Independent Writing Task for Example
by: Xia, Wei, et al.
Published: (2024)

Understanding Layer Significance in LLM Alignment
by: Shi, Guangyuan, et al.
Published: (2024)

Not All Layers Need Tuning: Selective Layer Restoration Recovers Diversity
by: Zhang, Bowen, et al.
Published: (2026)

CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think
by: Sun, Zening, et al.
Published: (2026)

A Single-Layer Model Can Do Language Modeling
by: Wang, Zanmin
Published: (2026)

PhysLayer: Language-Guided Layered Animation with Depth-Aware Physics
by: Xie, Tianyidan, et al.
Published: (2026)

SonoVision: A Computer Vision Approach for Helping Visually Challenged Individuals Locate Objects with the Help of Sound Cues
by: Zishan, Md Abu Obaida, et al.
Published: (2025)

Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models
by: Zhao, Zheng, et al.
Published: (2024)

Logic Unseen: Revealing the Logical Blindspots of Vision-Language Models
by: Zhou, Yuchen, et al.
Published: (2025)