Saved in:
| Main Authors: | Liu, Zhiming, Wei, Yujie, Feng, Lei, Su, Xiu, Xia, Xiaobo, Guan, Weili, Xie, Zeke, Yang, Shuo |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01167 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning to Accelerate Vision-Language-Action Models through Adaptive Visual Token Caching
by: Wei, Yujie, et al.
Published: (2026)
by: Wei, Yujie, et al.
Published: (2026)
UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation
by: Guo, Jiyu, et al.
Published: (2025)
by: Guo, Jiyu, et al.
Published: (2025)
Inject Once Survive Later: Backdooring Vision-Language-Action Models to Persist Through Downstream Fine-tuning
by: Zhou, Jianyi, et al.
Published: (2026)
by: Zhou, Jianyi, et al.
Published: (2026)
LayerMatch: Do Pseudo-labels Benefit All Layers?
by: Liang, Chaoqi, et al.
Published: (2024)
by: Liang, Chaoqi, et al.
Published: (2024)
Model Specific Task Similarity for Vision Language Model Selection via Layer Conductance
by: Yang, Wei, et al.
Published: (2026)
by: Yang, Wei, et al.
Published: (2026)
Frozen-Tag-Based Physical-Layer Authentication Against User Interference
by: Yao, Lei, et al.
Published: (2026)
by: Yao, Lei, et al.
Published: (2026)
PUMA: Layer-Pruned Language Model for Efficient Unified Multimodal Retrieval with Modality-Adaptive Learning
by: Lyu, Yibo, et al.
Published: (2025)
by: Lyu, Yibo, et al.
Published: (2025)
Thickness‐Dependent Creep in Lithium Layers of All‐Solid‐State Batteries under Stack Pressures
by: Chuangchuang Duan, et al.
Published: (2025)
by: Chuangchuang Duan, et al.
Published: (2025)
ConLA: Contrastive Latent Action Learning from Human Videos for Robotic Manipulation
by: Dai, Weisheng, et al.
Published: (2026)
by: Dai, Weisheng, et al.
Published: (2026)
Calibrated Multimodal Representation Learning with Missing Modalities
by: Liu, Xiaohao, et al.
Published: (2025)
by: Liu, Xiaohao, et al.
Published: (2025)
APEX: A Decoupled Memory-based Explorer for Asynchronous Aerial Object Goal Navigation
by: Zhang, Daoxuan, et al.
Published: (2026)
by: Zhang, Daoxuan, et al.
Published: (2026)
Not All Layers of LLMs Are Necessary During Inference
by: Fan, Siqi, et al.
Published: (2024)
by: Fan, Siqi, et al.
Published: (2024)
Not All Noises Are Created Equally:Diffusion Noise Selection and Optimization
by: Qi, Zipeng, et al.
Published: (2024)
by: Qi, Zipeng, et al.
Published: (2024)
Tag-based Physical-Layer Authentication Against Message Interference
by: Yao, Lei, et al.
Published: (2026)
by: Yao, Lei, et al.
Published: (2026)
Language-Specific Layer Matters: Efficient Multilingual Enhancement for Large Vision-Language Models
by: Fan, Yuchun, et al.
Published: (2025)
by: Fan, Yuchun, et al.
Published: (2025)
MangaFlow: An End-to-End Agentic Framework for Controllable Story to Manga Generation
by: Wang, Muyao, et al.
Published: (2026)
by: Wang, Muyao, et al.
Published: (2026)
MultiPriv: Benchmarking Individual-Level Privacy Reasoning in Vision-Language Models
by: Sun, Xiongtao, et al.
Published: (2025)
by: Sun, Xiongtao, et al.
Published: (2025)
L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models
by: Liu, Xiaohao, et al.
Published: (2025)
by: Liu, Xiaohao, et al.
Published: (2025)
Last Layer Empirical Bayes
by: Villecroze, Valentin, et al.
Published: (2025)
by: Villecroze, Valentin, et al.
Published: (2025)
Do Vision-Language Transformers Exhibit Visual Commonsense? An Empirical Study of VCR
by: Li, Zhenyang, et al.
Published: (2024)
by: Li, Zhenyang, et al.
Published: (2024)
First Glimpse on Physical Layer Security in Internet of Vehicles: Transformed from Communication Interference to Sensing Interference
by: Li, Kaixuan, et al.
Published: (2025)
by: Li, Kaixuan, et al.
Published: (2025)
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
by: Chuang, Yung-Sung, et al.
Published: (2023)
by: Chuang, Yung-Sung, et al.
Published: (2023)
Mono2Stereo: A Benchmark and Empirical Study for Stereo Conversion
by: Yu, Songsong, et al.
Published: (2025)
by: Yu, Songsong, et al.
Published: (2025)
Branch, or Layer? Zeroth-Order Optimization for Continual Learning of Vision-Language Models
by: Liu, Ziwei, et al.
Published: (2025)
by: Liu, Ziwei, et al.
Published: (2025)
Mano: Restriking Manifold Optimization for LLM Training
by: Gu, Yufei, et al.
Published: (2026)
by: Gu, Yufei, et al.
Published: (2026)
Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs
by: Yang, Zhipeng, et al.
Published: (2025)
by: Yang, Zhipeng, et al.
Published: (2025)
Controllable Layer Decomposition for Reversible Multi-Layer Image Generation
by: Liu, Zihao, et al.
Published: (2025)
by: Liu, Zihao, et al.
Published: (2025)
Study on Improving the Vertical Thermal Conductivity of the Film by Single‐Layer Alumina Arrangement
by: Jin‐Biao Wang, et al.
Published: (2025)
by: Jin‐Biao Wang, et al.
Published: (2025)
FashionLens: Toward Versatile Fashion Image Retrieval via Task-Adaptive Learning
by: Wen, Haokun, et al.
Published: (2026)
by: Wen, Haokun, et al.
Published: (2026)
Double‐side Interfacial Engineering of Hole Transport Layer Enables Efficient and Operationally Stable Colloidal Quantum Dot Solar Cells
by: Xin Wen, et al.
Published: (2025)
by: Xin Wen, et al.
Published: (2025)
Benchmarking Large Vision-Language Models on Fine-Grained Image Tasks: A Comprehensive Evaluation
by: Yu, Hong-Tao, et al.
Published: (2025)
by: Yu, Hong-Tao, et al.
Published: (2025)
Empirical Study of Large Language Models as Automated Essay Scoring Tools in English Composition__Taking TOEFL Independent Writing Task for Example
by: Xia, Wei, et al.
Published: (2024)
by: Xia, Wei, et al.
Published: (2024)
Understanding Layer Significance in LLM Alignment
by: Shi, Guangyuan, et al.
Published: (2024)
by: Shi, Guangyuan, et al.
Published: (2024)
Not All Layers Need Tuning: Selective Layer Restoration Recovers Diversity
by: Zhang, Bowen, et al.
Published: (2026)
by: Zhang, Bowen, et al.
Published: (2026)
CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think
by: Sun, Zening, et al.
Published: (2026)
by: Sun, Zening, et al.
Published: (2026)
A Single-Layer Model Can Do Language Modeling
by: Wang, Zanmin
Published: (2026)
by: Wang, Zanmin
Published: (2026)
PhysLayer: Language-Guided Layered Animation with Depth-Aware Physics
by: Xie, Tianyidan, et al.
Published: (2026)
by: Xie, Tianyidan, et al.
Published: (2026)
SonoVision: A Computer Vision Approach for Helping Visually Challenged Individuals Locate Objects with the Help of Sound Cues
by: Zishan, Md Abu Obaida, et al.
Published: (2025)
by: Zishan, Md Abu Obaida, et al.
Published: (2025)
Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models
by: Zhao, Zheng, et al.
Published: (2024)
by: Zhao, Zheng, et al.
Published: (2024)
Logic Unseen: Revealing the Logical Blindspots of Vision-Language Models
by: Zhou, Yuchen, et al.
Published: (2025)
by: Zhou, Yuchen, et al.
Published: (2025)
Similar Items
-
Learning to Accelerate Vision-Language-Action Models through Adaptive Visual Token Caching
by: Wei, Yujie, et al.
Published: (2026) -
UtilGen: Utility-Centric Generative Data Augmentation with Dual-Level Task Adaptation
by: Guo, Jiyu, et al.
Published: (2025) -
Inject Once Survive Later: Backdooring Vision-Language-Action Models to Persist Through Downstream Fine-tuning
by: Zhou, Jianyi, et al.
Published: (2026) -
LayerMatch: Do Pseudo-labels Benefit All Layers?
by: Liang, Chaoqi, et al.
Published: (2024) -
Model Specific Task Similarity for Vision Language Model Selection via Layer Conductance
by: Yang, Wei, et al.
Published: (2026)