:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kumbong, Hermann, Liu, Xian, Lin, Tsung-Yi, Liu, Ming-Yu, Liu, Xihui, Liu, Ziwei, Fu, Daniel Y., Ré, Christopher, Romero, David W.
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2506.04421
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Personalized Text-to-Image Generation with Auto-Regressive Models
by: Sun, Kaiyue, et al.
Published: (2025)

The Hedgehog & the Porcupine: Expressive Linear Attentions with Softmax Mimicry
by: Zhang, Michael, et al.
Published: (2024)

HMAR: Hierarchical Masked Attention for Multi-Behaviour Recommendation
by: Elsayed, Shereen, et al.
Published: (2024)

Resurrect Mask AutoRegressive Modeling for Efficient and Scalable Image Generation
by: Xin, Yi, et al.
Published: (2025)

LowRA: Accurate and Efficient LoRA Fine-Tuning of LLMs under 2 Bits
by: Zhou, Zikai, et al.
Published: (2025)

Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding
by: Teng, Yao, et al.
Published: (2024)

HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting
by: Liu, Xian, et al.
Published: (2023)

Meshtron: High-Fidelity, Artist-Like 3D Mesh Generation at Scale
by: Hao, Zekun, et al.
Published: (2024)

HMAR: Hierarchical Modality-Aware Expert and Dynamic Routing Medical Image Retrieval Architecture
by: Yuan, Aojie
Published: (2026)

T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
by: Sun, Kaiyue, et al.
Published: (2025)

SJD++: Improved Speculative Jacobi Decoding for Training-free Acceleration of Discrete Auto-regressive Text-to-Image Generation
by: Teng, Yao, et al.
Published: (2025)

TTS-VAR: A Test-Time Scaling Framework for Visual Auto-Regressive Generation
by: Chen, Zhekai, et al.
Published: (2025)

HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
by: Liu, Xian, et al.
Published: (2023)

EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation
by: Tang, Jiaxiang, et al.
Published: (2024)

Auto-Regressively Generating Multi-View Consistent Images
by: Hu, JiaKui, et al.
Published: (2025)

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
by: Han, Jian, et al.
Published: (2024)

Genuine-Focused Learning using Mask AutoEncoder for Generalized Fake Audio Detection
by: Wang, Xiaopeng, et al.
Published: (2024)

Learning Interpretable Representations Leads to Semantically Faithful EEG-to-Text Generation
by: Liu, Xiaozhao, et al.
Published: (2025)

Auto-Regressive Masked Diffusion Models
by: Karami, Mahdi, et al.
Published: (2026)

Adaptive 1D Video Diffusion Autoencoder
by: Teng, Yao, et al.
Published: (2026)

S ee 4D: Pose‐Free 4D Generation via Auto‐Regressive Video Inpainting
by: Dongyue Lu, et al.
Published: (2026)

See4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting
by: Lu, Dongyue, et al.
Published: (2025)

Exploiting Hierarchical Interactions for Protein Surface Learning
by: Lin, Yiqun, et al.
Published: (2024)

GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
by: Wang, Zhenyu, et al.
Published: (2024)

MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
by: He, Wanggui, et al.
Published: (2024)

MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data
by: Chen, Zhekai, et al.
Published: (2026)

Reprojection Errors as Prompts for Efficient Scene Coordinate Regression
by: Liu, Ting-Ru, et al.
Published: (2024)

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
by: Sun, Kaiyue, et al.
Published: (2024)

Hierarchical Information Flow for Generalized Efficient Image Restoration
by: Li, Yawei, et al.
Published: (2024)

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography
by: Zhang, Mengchen, et al.
Published: (2025)

AR-Diffusion: Asynchronous Video Generation with Auto-Regressive Diffusion
by: Sun, Mingzhen, et al.
Published: (2025)

AutoVP: An Automated Visual Prompting Framework and Benchmark
by: Tsao, Hsi-Ai, et al.
Published: (2023)

TC4D: Trajectory-Conditioned Text-to-4D Generation
by: Bahmani, Sherwin, et al.
Published: (2024)

EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation
by: Xiong, Tianwei, et al.
Published: (2026)

InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation
by: Liu, Jinlai, et al.
Published: (2025)

Auto-Regressive Diffusion for Generating 3D Human-Object Interactions
by: Geng, Zichen, et al.
Published: (2025)

Masked Generative Transformer Is What You Need for Image Editing
by: Chow, Wei, et al.
Published: (2026)

Deep Lossless Image Compression via Masked Sampling and Coarse-to-Fine Auto-Regression
by: Li, Tiantian, et al.
Published: (2025)

Sample- and Parameter-Efficient Auto-Regressive Image Models
by: Amrani, Elad, et al.
Published: (2024)

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
by: Xin, Yi, et al.
Published: (2025)