Saved in:
| Main Authors: | Kumbong, Hermann, Liu, Xian, Lin, Tsung-Yi, Liu, Ming-Yu, Liu, Xihui, Liu, Ziwei, Fu, Daniel Y., Ré, Christopher, Romero, David W. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.04421 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Personalized Text-to-Image Generation with Auto-Regressive Models
by: Sun, Kaiyue, et al.
Published: (2025)
by: Sun, Kaiyue, et al.
Published: (2025)
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
by: Zhang, Michael, et al.
Published: (2024)
by: Zhang, Michael, et al.
Published: (2024)
HMAR: Hierarchical Masked Attention for Multi-Behaviour Recommendation
by: Elsayed, Shereen, et al.
Published: (2024)
by: Elsayed, Shereen, et al.
Published: (2024)
Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
by: Xin, Yi, et al.
Published: (2025)
by: Xin, Yi, et al.
Published: (2025)
LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits
by: Zhou, Zikai, et al.
Published: (2025)
by: Zhou, Zikai, et al.
Published: (2025)
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
by: Teng, Yao, et al.
Published: (2024)
by: Teng, Yao, et al.
Published: (2024)
HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
by: Liu, Xian, et al.
Published: (2023)
by: Liu, Xian, et al.
Published: (2023)
Meshtron: High-Fidelity, Artist-Like 3D Mesh Generation at Scale
by: Hao, Zekun, et al.
Published: (2024)
by: Hao, Zekun, et al.
Published: (2024)
HMAR: Hierarchical Modality-Aware Expert and Dynamic Routing Medical Image Retrieval Architecture
by: Yuan, Aojie
Published: (2026)
by: Yuan, Aojie
Published: (2026)
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
by: Sun, Kaiyue, et al.
Published: (2025)
by: Sun, Kaiyue, et al.
Published: (2025)
SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation
by: Teng, Yao, et al.
Published: (2025)
by: Teng, Yao, et al.
Published: (2025)
TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation
by: Chen, Zhekai, et al.
Published: (2025)
by: Chen, Zhekai, et al.
Published: (2025)
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
by: Liu, Xian, et al.
Published: (2023)
by: Liu, Xian, et al.
Published: (2023)
EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation
by: Tang, Jiaxiang, et al.
Published: (2024)
by: Tang, Jiaxiang, et al.
Published: (2024)
Auto-Regressively Generating Multi-View Consistent Images
by: Hu, JiaKui, et al.
Published: (2025)
by: Hu, JiaKui, et al.
Published: (2025)
Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
by: Han, Jian, et al.
Published: (2024)
by: Han, Jian, et al.
Published: (2024)
Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
by: Wang, Xiaopeng, et al.
Published: (2024)
by: Wang, Xiaopeng, et al.
Published: (2024)
Learning Interpretable Representations Leads to Semantically Faithful EEG-to-Text Generation
by: Liu, Xiaozhao, et al.
Published: (2025)
by: Liu, Xiaozhao, et al.
Published: (2025)
Auto-Regressive Masked Diffusion Models
by: Karami, Mahdi, et al.
Published: (2026)
by: Karami, Mahdi, et al.
Published: (2026)
Adaptive 1D Video Diffusion Autoencoder
by: Teng, Yao, et al.
Published: (2026)
by: Teng, Yao, et al.
Published: (2026)
S ee 4D: Pose‐Free 4D Generation via Auto‐Regressive Video Inpainting
by: Dongyue Lu, et al.
Published: (2026)
by: Dongyue Lu, et al.
Published: (2026)
See4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting
by: Lu, Dongyue, et al.
Published: (2025)
by: Lu, Dongyue, et al.
Published: (2025)
Exploiting Hierarchical Interactions for Protein Surface Learning
by: Lin, Yiqun, et al.
Published: (2024)
by: Lin, Yiqun, et al.
Published: (2024)
GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
by: Wang, Zhenyu, et al.
Published: (2024)
by: Wang, Zhenyu, et al.
Published: (2024)
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
by: He, Wanggui, et al.
Published: (2024)
by: He, Wanggui, et al.
Published: (2024)
MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data
by: Chen, Zhekai, et al.
Published: (2026)
by: Chen, Zhekai, et al.
Published: (2026)
Reprojection Errors as Prompts for Efficient Scene Coordinate Regression
by: Liu, Ting-Ru, et al.
Published: (2024)
by: Liu, Ting-Ru, et al.
Published: (2024)
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
by: Sun, Kaiyue, et al.
Published: (2024)
by: Sun, Kaiyue, et al.
Published: (2024)
Hierarchical Information Flow for Generalized Efficient Image Restoration
by: Li, Yawei, et al.
Published: (2024)
by: Li, Yawei, et al.
Published: (2024)
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
by: Zhang, Mengchen, et al.
Published: (2025)
by: Zhang, Mengchen, et al.
Published: (2025)
AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion
by: Sun, Mingzhen, et al.
Published: (2025)
by: Sun, Mingzhen, et al.
Published: (2025)
AutoVP: An Automated Visual Prompting Framework and Benchmark
by: Tsao, Hsi-Ai, et al.
Published: (2023)
by: Tsao, Hsi-Ai, et al.
Published: (2023)
TC4D: Trajectory-Conditioned Text-to-4D Generation
by: Bahmani, Sherwin, et al.
Published: (2024)
by: Bahmani, Sherwin, et al.
Published: (2024)
EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation
by: Xiong, Tianwei, et al.
Published: (2026)
by: Xiong, Tianwei, et al.
Published: (2026)
InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation
by: Liu, Jinlai, et al.
Published: (2025)
by: Liu, Jinlai, et al.
Published: (2025)
Auto-Regressive Diffusion for Generating 3D Human-Object Interactions
by: Geng, Zichen, et al.
Published: (2025)
by: Geng, Zichen, et al.
Published: (2025)
Masked Generative Transformer Is What You Need for Image Editing
by: Chow, Wei, et al.
Published: (2026)
by: Chow, Wei, et al.
Published: (2026)
Deep Lossless Image Compression via Masked Sampling and Coarse-to-Fine Auto-Regression
by: Li, Tiantian, et al.
Published: (2025)
by: Li, Tiantian, et al.
Published: (2025)
Sample- and Parameter-Efficient Auto-Regressive Image Models
by: Amrani, Elad, et al.
Published: (2024)
by: Amrani, Elad, et al.
Published: (2024)
Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
by: Xin, Yi, et al.
Published: (2025)
by: Xin, Yi, et al.
Published: (2025)
Similar Items
-
Personalized Text-to-Image Generation with Auto-Regressive Models
by: Sun, Kaiyue, et al.
Published: (2025) -
The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
by: Zhang, Michael, et al.
Published: (2024) -
HMAR: Hierarchical Masked Attention for Multi-Behaviour Recommendation
by: Elsayed, Shereen, et al.
Published: (2024) -
Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
by: Xin, Yi, et al.
Published: (2025) -
LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits
by: Zhou, Zikai, et al.
Published: (2025)