:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Yu, Liu, Jingyi, Liu, Feng, Miao, Duoqian, Zhang, Qi, Fu, Kexue, Wang, Changwei, Cao, Longbing
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.01345
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Markovian Scale Prediction: A New Era of Visual Autoregressive Generation
by: Zhang, Yu, et al.
Published: (2025)

Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning
by: Zhang, Yu, et al.
Published: (2025)

Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization
by: Gao, Yanting, et al.
Published: (2025)

SkipVAR: Accelerating Visual Autoregressive Modeling via Adaptive Frequency-Aware Skipping
by: Li, Jiajun, et al.
Published: (2025)

MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction
by: Gong, Zixuan, et al.
Published: (2024)

MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization
by: Zhang, Yu, et al.
Published: (2024)

FedMinds: Privacy-Preserving Personalized Brain Visual Decoding
by: Bao, Guangyin, et al.
Published: (2024)

Transformer-Based Person Search with High-Frequency Augmentation and Multi-Wave Mixing
by: Shu, Qilin, et al.
Published: (2025)

Perception Activator: An intuitive and portable framework for brain cognitive exploration
by: Xu, Le, et al.
Published: (2025)

Region Matters: Efficient and Reliable Region-Aware Visual Place Recognition
by: Chen, Shunpeng, et al.
Published: (2026)

Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
by: Zou, Zhen, et al.
Published: (2025)

Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition
by: Wang, Changwei, et al.
Published: (2025)

Lite-Mind: Towards Efficient and Robust Brain Representation Network
by: Gong, Zixuan, et al.
Published: (2023)

Depth Adaptive Efficient Visual Autoregressive Modeling
by: Li, Chunliang, et al.
Published: (2026)

Wills Aligner: Multi-Subject Collaborative Brain Visual Decoding
by: Bao, Guangyin, et al.
Published: (2024)

NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
by: Gong, Zixuan, et al.
Published: (2024)

SAGE: Spatial-visual Adaptive Graph Exploration for Efficient Visual Place Recognition
by: Chen, Shunpeng, et al.
Published: (2025)

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation
by: Xiong, Tianwei, et al.
Published: (2026)

Vision Also You Need: Navigating Out-of-Distribution Detection with Multimodal Large Language Model
by: Xu, Haoran, et al.
Published: (2026)

Thinking in Scales: Accelerating Gigapixel Pathology Image Analysis via Adaptive Continuous Reasoning
by: Ge, Jiusong, et al.
Published: (2026)

DressCode: Autoregressively Sewing and Generating Garments from Text Guidance
by: He, Kai, et al.
Published: (2024)

Passive Dementia Screening via Facial Temporal Micro-Dynamics Analysis of In-the-Wild Talking-Head Video
by: Cenacchi, Filippo, et al.
Published: (2025)

A Dual-Modulation Framework for RGB-T Crowd Counting via Spatially Modulated Attention and Adaptive Fusion
by: Feng, Yuhong, et al.
Published: (2025)

Parallelized Autoregressive Visual Generation
by: Wang, Yuqing, et al.
Published: (2024)

SAGE: Accelerating Vision-Language Models via Entropy-Guided Adaptive Speculative Decoding
by: Tong, Yujia, et al.
Published: (2026)

SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification
by: Zhang, Jiacheng, et al.
Published: (2026)

VCE: Safe Autoregressive Image Generation via Visual Contrast Exploitation
by: Han, Feng, et al.
Published: (2025)

CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion
by: Lin, Jinzhou, et al.
Published: (2025)

Rethinking Structure Preservation in Text-Guided Image Editing with Visual Autoregressive Models
by: Xia, Tao, et al.
Published: (2026)

DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing
by: Yang, Jingyi, et al.
Published: (2025)

Visual Autoregressive Modeling for Instruction-Guided Image Editing
by: Mao, Qingyang, et al.
Published: (2025)

ToProVAR: Efficient Visual Autoregressive Modeling via Tri-Dimensional Entropy-Aware Semantic Analysis and Sparsity Optimization
by: Chen, Jiayu, et al.
Published: (2026)

Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation
by: Teng, Yao, et al.
Published: (2025)

POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs
by: Wang, Haicheng, et al.
Published: (2026)

SCoT: Unifying Consistency Models and Rectified Flows via Straight-Consistent Trajectories
by: Wu, Zhangkai, et al.
Published: (2025)

FasterVAR: Plug-and-Play Acceleration for Visual Autoregressive Models
by: Li, Senmao, et al.
Published: (2025)

Rethinking the Zigzag Flattening for Image Reading
by: Zhao, Qingsong, et al.
Published: (2022)

Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
by: Liu, Wenze, et al.
Published: (2024)

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
by: Wang, Yuqing, et al.
Published: (2025)

CAR: Controllable Autoregressive Modeling for Visual Generation
by: Yao, Ziyu, et al.
Published: (2024)