Saved in:
| Main Authors: | Cai, Qi, Chen, Jingwen, Gao, Chengmin, Gong, Zijian, Li, Yehao, Pan, Yingwei, Peng, Yi, Qiu, Zhaofan, Yu, Kai, Zhang, Yiheng, Ai, Hao, Bai, Siying, Chen, Yang, Chen, Zhihui, Gao, Fengbin, Guo, Ying, Li, Dong, Shen, Zhen, Shi, Leilei, Wang, Jing, Wang, Siyu, Wang, Yimeng, Zheng, Rui, Yao, Ting, Mei, Tao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.11061 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer
by: Cai, Qi, et al.
Published: (2025)
by: Cai, Qi, et al.
Published: (2025)
DreamVAR: Taming Reinforced Visual Autoregressive Model for High-Fidelity Subject-Driven Image Generation
by: Jiang, Xin, et al.
Published: (2026)
by: Jiang, Xin, et al.
Published: (2026)
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
by: Luo, Jianjie, et al.
Published: (2024)
by: Luo, Jianjie, et al.
Published: (2024)
DreamOmni: Unified Image Generation and Editing
by: Xia, Bin, et al.
Published: (2024)
by: Xia, Bin, et al.
Published: (2024)
Improving Text-guided Object Inpainting with Semantic Pre-inpainting
by: Chen, Yifu, et al.
Published: (2024)
by: Chen, Yifu, et al.
Published: (2024)
DreamLite: A Lightweight On-Device Unified Model for Image Generation and Editing
by: Feng, Kailai, et al.
Published: (2026)
by: Feng, Kailai, et al.
Published: (2026)
Microscopic 3D Surface Imaging With Annular Spectrum Sampling Parallel Single‐Pixel Imaging: Resistant to Global Illumination
by: Chengmin Liu, et al.
Published: (2026)
by: Chengmin Liu, et al.
Published: (2026)
Improving Virtual Try-On with Garment-focused Diffusion Models
by: Wan, Siqi, et al.
Published: (2024)
by: Wan, Siqi, et al.
Published: (2024)
DreamO: A Unified Framework for Image Customization
by: Mou, Chong, et al.
Published: (2025)
by: Mou, Chong, et al.
Published: (2025)
DreamVE: Unified Instruction-based Image and Video Editing
by: Xia, Bin, et al.
Published: (2025)
by: Xia, Bin, et al.
Published: (2025)
Modeling of Parallel Single-Pixel Imaging for 3D Reconstruction: New Insights and Opportunities
by: Chen, Feifei, et al.
Published: (2025)
by: Chen, Feifei, et al.
Published: (2025)
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation
by: Qu, Liao, et al.
Published: (2024)
by: Qu, Liao, et al.
Published: (2024)
DreamPoster: A Unified Framework for Image-Conditioned Generative Poster Design
by: Hu, Xiwei, et al.
Published: (2025)
by: Hu, Xiwei, et al.
Published: (2025)
SemHiTok: A Unified Image Tokenizer via Semantic-Guided Hierarchical Codebook for Multimodal Understanding and Generation
by: Chen, Zisheng, et al.
Published: (2025)
by: Chen, Zisheng, et al.
Published: (2025)
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
by: Zhang, Zhongwei, et al.
Published: (2024)
by: Zhang, Zhongwei, et al.
Published: (2024)
HIRI-ViT: Scaling Vision Transformer with High Resolution Inputs
by: Yao, Ting, et al.
Published: (2024)
by: Yao, Ting, et al.
Published: (2024)
DPImageBench: A Unified Benchmark for Differentially Private Image Synthesis
by: Gong, Chen, et al.
Published: (2025)
by: Gong, Chen, et al.
Published: (2025)
DreamStyle: A Unified Framework for Video Stylization
by: Li, Mengtian, et al.
Published: (2026)
by: Li, Mengtian, et al.
Published: (2026)
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models
by: Yang, Haibo, et al.
Published: (2024)
by: Yang, Haibo, et al.
Published: (2024)
FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process
by: Luo, Yang, et al.
Published: (2024)
by: Luo, Yang, et al.
Published: (2024)
Visual Autoregressive Modeling for Instruction-Guided Image Editing
by: Mao, Qingyang, et al.
Published: (2025)
by: Mao, Qingyang, et al.
Published: (2025)
MotionPro: A Precise Motion Controller for Image-to-Video Generation
by: Zhang, Zhongwei, et al.
Published: (2025)
by: Zhang, Zhongwei, et al.
Published: (2025)
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models
by: Liu, Zhiheng, et al.
Published: (2025)
by: Liu, Zhiheng, et al.
Published: (2025)
Unified Data Selection for LLM Reasoning
by: Li, Xiaoyuan, et al.
Published: (2026)
by: Li, Xiaoyuan, et al.
Published: (2026)
HiDiff: Hybrid Diffusion Framework for Medical Image Segmentation
by: Chen, Tao, et al.
Published: (2024)
by: Chen, Tao, et al.
Published: (2024)
QCS: Feature Refining from Quadruplet Cross Similarity for Facial Expression Recognition
by: Wang, Chengpeng, et al.
Published: (2024)
by: Wang, Chengpeng, et al.
Published: (2024)
DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation
by: Kim, Jeongsol, et al.
Published: (2024)
by: Kim, Jeongsol, et al.
Published: (2024)
UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation
by: Yue, Zhengrong, et al.
Published: (2025)
by: Yue, Zhengrong, et al.
Published: (2025)
Viewport-Unaware Blind Omnidirectional Image Quality Assessment: A Unified and Generalized Approach
by: Yan, Jiebin, et al.
Published: (2026)
by: Yan, Jiebin, et al.
Published: (2026)
SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation
by: Huang, Shuangping, et al.
Published: (2024)
by: Huang, Shuangping, et al.
Published: (2024)
DreamLight: Towards Harmonious and Consistent Image Relighting
by: Liu, Yong, et al.
Published: (2025)
by: Liu, Yong, et al.
Published: (2025)
On Fairness of Unified Multimodal Large Language Model for Image Generation
by: Liu, Ming, et al.
Published: (2025)
by: Liu, Ming, et al.
Published: (2025)
Flash-Unified: A Training-Free and Task-Aware Acceleration Framework for Native Unified Models
by: Ke, Junlong, et al.
Published: (2026)
by: Ke, Junlong, et al.
Published: (2026)
QMamba: On First Exploration of Vision Mamba for Image Quality Assessment
by: Guan, Fengbin, et al.
Published: (2024)
by: Guan, Fengbin, et al.
Published: (2024)
Reduce the Artifacts Bias for More Generalizable AI-Generated Image Detection
by: Li, Yiheng, et al.
Published: (2026)
by: Li, Yiheng, et al.
Published: (2026)
Uni-NTFM: A Unified Foundation Model for EEG Signal Representation Learning
by: Chen, Zhisheng, et al.
Published: (2025)
by: Chen, Zhisheng, et al.
Published: (2025)
UniAlignment: Semantic Alignment for Unified Image Generation, Understanding, Manipulation and Perception
by: Song, Xinyang, et al.
Published: (2025)
by: Song, Xinyang, et al.
Published: (2025)
DreamJourney: Perpetual View Generation with Video Diffusion Models
by: Pan, Bo, et al.
Published: (2025)
by: Pan, Bo, et al.
Published: (2025)
TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills
by: Sun, Qiushi, et al.
Published: (2023)
by: Sun, Qiushi, et al.
Published: (2023)
DB3D-L: Depth-aware BEV Feature Transformation for Accurate 3D Lane Detection
by: Liu, Yehao, et al.
Published: (2025)
by: Liu, Yehao, et al.
Published: (2025)
Similar Items
-
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion Transformer
by: Cai, Qi, et al.
Published: (2025) -
DreamVAR: Taming Reinforced Visual Autoregressive Model for High-Fidelity Subject-Driven Image Generation
by: Jiang, Xin, et al.
Published: (2026) -
Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
by: Luo, Jianjie, et al.
Published: (2024) -
DreamOmni: Unified Image Generation and Editing
by: Xia, Bin, et al.
Published: (2024) -
Improving Text-guided Object Inpainting with Semantic Pre-inpainting
by: Chen, Yifu, et al.
Published: (2024)