:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Zhang, Hong, Duan, Zhongjie, Wang, Xingjun, Zhao, Yuze, Lu, Weiyi, Di, Zhipeng, Xu, Yixuan, Chen, Yingda, Zhang, Yu
Format:	Preprint
Publié:	2025
Sujets:	Computer Vision and Pattern Recognition Artificial Intelligence
Accès en ligne:	https://arxiv.org/abs/2504.21356
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

EliGen: Entity-Level Controlled Image Generation with Regional Attention
par: Zhang, Hong, et autres
Publié: (2025)

Diffusion Templates: A Unified Plugin Framework for Controllable Diffusion
par: Duan, Zhongjie, et autres
Publié: (2026)

ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction
par: Duan, Zhongjie, et autres
Publié: (2024)

Minimum Tuning to Unlock Long Output from LLMs with High Quality Data as the Key
par: Chen, Yingda, et autres
Publié: (2024)

InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling
par: Jiang, Xiaoxiao, et autres
Publié: (2025)

Spectral Evolution Search: Efficient Inference-Time Scaling for Reward-Aligned Image Generation
par: Ye, Jinyan, et autres
Publié: (2026)

AutoLoRA: Automatic LoRA Retrieval and Fine-Grained Gated Fusion for Text-to-Image Generation
par: Li, Zhiwen, et autres
Publié: (2025)

FuncGenFoil: Airfoil Generation and Editing Model in Function Space
par: Zhang, Jinouwen, et autres
Publié: (2025)

EndoGen: Conditional Autoregressive Endoscopic Video Generation
par: Liu, Xinyu, et autres
Publié: (2025)

PrefillShare: A Shared Prefill Module for KV Reuse in Multi-LLM Disaggregated Serving
par: Woo, Sunghyeon, et autres
Publié: (2026)

GenArtist: Multimodal LLM as an Agent for Unified Image Generation and Editing
par: Wang, Zhenyu, et autres
Publié: (2024)

Nexus:Proactive Intra-GPU Disaggregation of Prefill and Decode in LLM Serving
par: Shi, Xiaoxiang, et autres
Publié: (2025)

Unified Personalized Understanding, Generating and Editing
par: Zhong, Yu, et autres
Publié: (2026)

EchoGen: Cycle-Consistent Learning for Unified Layout-Image Generation and Understanding
par: Zou, Kai, et autres
Publié: (2026)

Breaking the Modality Barrier: Universal Embedding Learning with Multimodal LLMs
par: Gu, Tiancheng, et autres
Publié: (2025)

GenSpace: Benchmarking Spatially-Aware Image Generation
par: Wang, Zehan, et autres
Publié: (2025)

SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning
par: Zhao, Yuze, et autres
Publié: (2024)

UniX: Unifying Autoregression and Diffusion for Chest X-Ray Understanding and Generation
par: Zhang, Ruiheng, et autres
Publié: (2026)

Unified Cross-Scale 3D Generation and Understanding via Autoregressive Modeling
par: Lu, Shuqi, et autres
Publié: (2025)

NEP: Autoregressive Image Editing via Next Editing Token Prediction
par: Wu, Huimin, et autres
Publié: (2025)

UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying
par: Bai, Chengyu, et autres
Publié: (2025)

Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective
par: Zhu, Yongxin, et autres
Publié: (2024)

AttriCtrl: Fine-Grained Control of Aesthetic Attribute Intensity in Diffusion Models
par: Chen, Die, et autres
Publié: (2025)

VIRAL: Visual In-Context Reasoning via Analogy in Diffusion Transformers
par: Li, Zhiwen, et autres
Publié: (2026)

Unified Medical Image Tokenizer for Autoregressive Synthesis and Understanding
par: Ma, Chenglong, et autres
Publié: (2025)

DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing
par: Wang, Dianyi, et autres
Publié: (2026)

Analyzing Diffusion and Autoregressive Vision Language Models in Multimodal Embedding Space
par: Wang, Zihang, et autres
Publié: (2026)

Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model
par: Yang, Liu, et autres
Publié: (2025)

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation
par: Ma, Yiyang, et autres
Publié: (2024)

Unified Autoregressive Visual Generation and Understanding with Continuous Tokens
par: Fan, Lijie, et autres
Publié: (2025)

DreamOmni: Unified Image Generation and Editing
par: Xia, Bin, et autres
Publié: (2024)

OmniGen: Unified Image Generation
par: Xiao, Shitao, et autres
Publié: (2024)

UTDesign: A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images
par: Zhao, Yiming, et autres
Publié: (2025)

Copy-as-Decode: Grammar-Constrained Parallel Prefill for LLM Editing
par: Liu, Ziyang
Publié: (2026)

Study on tensile properties of ultra‐thin‐ply carbon fiber‐reinforced composite laminates under static load
par: Mingfa Ren, et autres
Publié: (2024)

WeGen: A Unified Model for Interactive Multimodal Generation as We Chat
par: Huang, Zhipeng, et autres
Publié: (2025)

LaGen: Towards Autoregressive LiDAR Scene Generation
par: Zhou, Sizhuo, et autres
Publié: (2025)

OmniMoGen: Unifying Human Motion Generation via Learning from Interleaved Text-Motion Instructions
par: Bu, Wendong, et autres
Publié: (2025)

GenShield: Unified Detection and Artifact Correction for AI-Generated Images
par: Xu, Zhipei, et autres
Publié: (2026)

GUSLO: General and Unified Structured Light Optimization
par: Wan, Tinglei, et autres
Publié: (2025)