Saved in:
| Main Authors: | Zhang, Yu, Liu, Jingyi, Liu, Feng, Miao, Duoqian, Zhang, Qi, Fu, Kexue, Wang, Changwei, Cao, Longbing |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01345 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Markovian Scale Prediction: A New Era of Visual Autoregressive Generation
by: Zhang, Yu, et al.
Published: (2025)
by: Zhang, Yu, et al.
Published: (2025)
Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning
by: Zhang, Yu, et al.
Published: (2025)
by: Zhang, Yu, et al.
Published: (2025)
Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization
by: Gao, Yanting, et al.
Published: (2025)
by: Gao, Yanting, et al.
Published: (2025)
SkipVAR: Accelerating Visual Autoregressive Modeling via Adaptive Frequency-Aware Skipping
by: Li, Jiajun, et al.
Published: (2025)
by: Li, Jiajun, et al.
Published: (2025)
MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction
by: Gong, Zixuan, et al.
Published: (2024)
by: Gong, Zixuan, et al.
Published: (2024)
MLIP: Efficient Multi-Perspective Language-Image Pretraining with Exhaustive Data Utilization
by: Zhang, Yu, et al.
Published: (2024)
by: Zhang, Yu, et al.
Published: (2024)
FedMinds: Privacy-Preserving Personalized Brain Visual Decoding
by: Bao, Guangyin, et al.
Published: (2024)
by: Bao, Guangyin, et al.
Published: (2024)
Transformer-Based Person Search with High-Frequency Augmentation and Multi-Wave Mixing
by: Shu, Qilin, et al.
Published: (2025)
by: Shu, Qilin, et al.
Published: (2025)
Perception Activator: An intuitive and portable framework for brain cognitive exploration
by: Xu, Le, et al.
Published: (2025)
by: Xu, Le, et al.
Published: (2025)
Region Matters: Efficient and Reliable Region-Aware Visual Place Recognition
by: Chen, Shunpeng, et al.
Published: (2026)
by: Chen, Shunpeng, et al.
Published: (2026)
Fast-ARDiff: An Entropy-informed Acceleration Framework for Continuous Space Autoregressive Generation
by: Zou, Zhen, et al.
Published: (2025)
by: Zou, Zhen, et al.
Published: (2025)
Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition
by: Wang, Changwei, et al.
Published: (2025)
by: Wang, Changwei, et al.
Published: (2025)
Lite-Mind: Towards Efficient and Robust Brain Representation Network
by: Gong, Zixuan, et al.
Published: (2023)
by: Gong, Zixuan, et al.
Published: (2023)
Depth Adaptive Efficient Visual Autoregressive Modeling
by: Li, Chunliang, et al.
Published: (2026)
by: Li, Chunliang, et al.
Published: (2026)
Wills Aligner: Multi-Subject Collaborative Brain Visual Decoding
by: Bao, Guangyin, et al.
Published: (2024)
by: Bao, Guangyin, et al.
Published: (2024)
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction
by: Gong, Zixuan, et al.
Published: (2024)
by: Gong, Zixuan, et al.
Published: (2024)
SAGE: Spatial-visual Adaptive Graph Exploration for Efficient Visual Place Recognition
by: Chen, Shunpeng, et al.
Published: (2025)
by: Chen, Shunpeng, et al.
Published: (2025)
EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation
by: Xiong, Tianwei, et al.
Published: (2026)
by: Xiong, Tianwei, et al.
Published: (2026)
Vision Also You Need: Navigating Out-of-Distribution Detection with Multimodal Large Language Model
by: Xu, Haoran, et al.
Published: (2026)
by: Xu, Haoran, et al.
Published: (2026)
Thinking in Scales: Accelerating Gigapixel Pathology Image Analysis via Adaptive Continuous Reasoning
by: Ge, Jiusong, et al.
Published: (2026)
by: Ge, Jiusong, et al.
Published: (2026)
DressCode: Autoregressively Sewing and Generating Garments from Text Guidance
by: He, Kai, et al.
Published: (2024)
by: He, Kai, et al.
Published: (2024)
Passive Dementia Screening via Facial Temporal Micro-Dynamics Analysis of In-the-Wild Talking-Head Video
by: Cenacchi, Filippo, et al.
Published: (2025)
by: Cenacchi, Filippo, et al.
Published: (2025)
A Dual-Modulation Framework for RGB-T Crowd Counting via Spatially Modulated Attention and Adaptive Fusion
by: Feng, Yuhong, et al.
Published: (2025)
by: Feng, Yuhong, et al.
Published: (2025)
Parallelized Autoregressive Visual Generation
by: Wang, Yuqing, et al.
Published: (2024)
by: Wang, Yuqing, et al.
Published: (2024)
SAGE: Accelerating Vision-Language Models via Entropy-Guided Adaptive Speculative Decoding
by: Tong, Yujia, et al.
Published: (2026)
by: Tong, Yujia, et al.
Published: (2026)
SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification
by: Zhang, Jiacheng, et al.
Published: (2026)
by: Zhang, Jiacheng, et al.
Published: (2026)
VCE: Safe Autoregressive Image Generation via Visual Contrast Exploitation
by: Han, Feng, et al.
Published: (2025)
by: Han, Feng, et al.
Published: (2025)
CurriFlow: Curriculum-Guided Depth Fusion with Optical Flow-Based Temporal Alignment for 3D Semantic Scene Completion
by: Lin, Jinzhou, et al.
Published: (2025)
by: Lin, Jinzhou, et al.
Published: (2025)
Rethinking Structure Preservation in Text-Guided Image Editing with Visual Autoregressive Models
by: Xia, Tao, et al.
Published: (2026)
by: Xia, Tao, et al.
Published: (2026)
DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing
by: Yang, Jingyi, et al.
Published: (2025)
by: Yang, Jingyi, et al.
Published: (2025)
Visual Autoregressive Modeling for Instruction-Guided Image Editing
by: Mao, Qingyang, et al.
Published: (2025)
by: Mao, Qingyang, et al.
Published: (2025)
ToProVAR: Efficient Visual Autoregressive Modeling via Tri-Dimensional Entropy-Aware Semantic Analysis and Sparsity Optimization
by: Chen, Jiayu, et al.
Published: (2026)
by: Chen, Jiayu, et al.
Published: (2026)
Speculative Jacobi-Denoising Decoding for Accelerating Autoregressive Text-to-image Generation
by: Teng, Yao, et al.
Published: (2025)
by: Teng, Yao, et al.
Published: (2025)
POINTS-Long: Adaptive Dual-Mode Visual Reasoning in MLLMs
by: Wang, Haicheng, et al.
Published: (2026)
by: Wang, Haicheng, et al.
Published: (2026)
SCoT: Unifying Consistency Models and Rectified Flows via Straight-Consistent Trajectories
by: Wu, Zhangkai, et al.
Published: (2025)
by: Wu, Zhangkai, et al.
Published: (2025)
FasterVAR: Plug-and-Play Acceleration for Visual Autoregressive Models
by: Li, Senmao, et al.
Published: (2025)
by: Li, Senmao, et al.
Published: (2025)
Rethinking the Zigzag Flattening for Image Reading
by: Zhao, Qingsong, et al.
Published: (2022)
by: Zhao, Qingsong, et al.
Published: (2022)
Customize Your Visual Autoregressive Recipe with Set Autoregressive Modeling
by: Liu, Wenze, et al.
Published: (2024)
by: Liu, Wenze, et al.
Published: (2024)
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation
by: Wang, Yuqing, et al.
Published: (2025)
by: Wang, Yuqing, et al.
Published: (2025)
CAR: Controllable Autoregressive Modeling for Visual Generation
by: Yao, Ziyu, et al.
Published: (2024)
by: Yao, Ziyu, et al.
Published: (2024)
Similar Items
-
Markovian Scale Prediction: A New Era of Visual Autoregressive Generation
by: Zhang, Yu, et al.
Published: (2025) -
Enhancing Text-to-Image Diffusion Transformer via Split-Text Conditioning
by: Zhang, Yu, et al.
Published: (2025) -
Boosting Adversarial Transferability via Commonality-Oriented Gradient Optimization
by: Gao, Yanting, et al.
Published: (2025) -
SkipVAR: Accelerating Visual Autoregressive Modeling via Adaptive Frequency-Aware Skipping
by: Li, Jiajun, et al.
Published: (2025) -
MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction
by: Gong, Zixuan, et al.
Published: (2024)