Saved in:
| Main Authors: | Bai, Xiyue, Yu, Ronghao, Xiu, Jia, Zhou, Pengfei, Xia, Jie, Ji, Peng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.18635 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ImgEdit: A Unified Image Editing Dataset and Benchmark
by: Ye, Yang, et al.
Published: (2025)
by: Ye, Yang, et al.
Published: (2025)
Uni-NTFM: A Unified Foundation Model for EEG Signal Representation Learning
by: Chen, Zhisheng, et al.
Published: (2025)
by: Chen, Zhisheng, et al.
Published: (2025)
UniVideo: Unified Understanding, Generation, and Editing for Videos
by: Wei, Cong, et al.
Published: (2025)
by: Wei, Cong, et al.
Published: (2025)
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying
by: Bai, Chengyu, et al.
Published: (2025)
by: Bai, Chengyu, et al.
Published: (2025)
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
by: Fu, Tsu-Jui, et al.
Published: (2025)
by: Fu, Tsu-Jui, et al.
Published: (2025)
UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation
by: Guo, Qin, et al.
Published: (2025)
by: Guo, Qin, et al.
Published: (2025)
Img2CADSeq: Image-to-CAD Generation via Sequence-Based Diffusion
by: Tan, Shiyu, et al.
Published: (2026)
by: Tan, Shiyu, et al.
Published: (2026)
UTDesign: A Unified Framework for Stylized Text Editing and Generation in Graphic Design Images
by: Zhao, Yiming, et al.
Published: (2025)
by: Zhao, Yiming, et al.
Published: (2025)
Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control
by: Fujiwara, Haruo, et al.
Published: (2025)
by: Fujiwara, Haruo, et al.
Published: (2025)
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
by: Wang, Haoxuan, et al.
Published: (2025)
by: Wang, Haoxuan, et al.
Published: (2025)
DreamOmni: Unified Image Generation and Editing
by: Xia, Bin, et al.
Published: (2024)
by: Xia, Bin, et al.
Published: (2024)
Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation
by: Yan, Canxiang, et al.
Published: (2025)
by: Yan, Canxiang, et al.
Published: (2025)
Neural-Driven Image Editing
by: Zhou, Pengfei, et al.
Published: (2025)
by: Zhou, Pengfei, et al.
Published: (2025)
Multimodal Contrastive Learning via Uni-Modal Coding and Cross-Modal Prediction for Multimodal Sentiment Analysis
by: Lin, Ronghao, et al.
Published: (2022)
by: Lin, Ronghao, et al.
Published: (2022)
UniLayDiff: A Unified Diffusion Transformer for Content-Aware Layout Generation
by: Liu, Zeyang, et al.
Published: (2025)
by: Liu, Zeyang, et al.
Published: (2025)
ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning
by: Luo, Pengfei, et al.
Published: (2025)
by: Luo, Pengfei, et al.
Published: (2025)
NeurOp-Diff:Continuous Remote Sensing Image Super-Resolution via Neural Operator Diffusion
by: Xu, Zihao, et al.
Published: (2025)
by: Xu, Zihao, et al.
Published: (2025)
NeurCADRecon: Neural Representation for Reconstructing CAD Surfaces by Enforcing Zero Gaussian Curvature
by: Dong, Qiujie, et al.
Published: (2024)
by: Dong, Qiujie, et al.
Published: (2024)
UniCustom: Unified Visual Conditioning for Multi-Reference Image Generation
by: Xu, Yiyan, et al.
Published: (2026)
by: Xu, Yiyan, et al.
Published: (2026)
StyleAdapter: A Unified Stylized Image Generation Model
by: Wang, Zhouxia, et al.
Published: (2023)
by: Wang, Zhouxia, et al.
Published: (2023)
UniHuman: A Unified Model for Editing Human Images in the Wild
by: Li, Nannan, et al.
Published: (2023)
by: Li, Nannan, et al.
Published: (2023)
Highly Accelerated MRI via Implicit Neural Representation Guided Posterior Sampling of Diffusion Models
by: Chu, Jiayue, et al.
Published: (2024)
by: Chu, Jiayue, et al.
Published: (2024)
UniFL: Improve Latent Diffusion Model via Unified Feedback Learning
by: Zhang, Jiacheng, et al.
Published: (2024)
by: Zhang, Jiacheng, et al.
Published: (2024)
UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution
by: Du, Shian, et al.
Published: (2025)
by: Du, Shian, et al.
Published: (2025)
UniGeo: Unifying Geometric Guidance for Camera-Controllable Image Editing via Video Models
by: Jiang, Hong, et al.
Published: (2026)
by: Jiang, Hong, et al.
Published: (2026)
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics
by: Chen, Xi, et al.
Published: (2024)
by: Chen, Xi, et al.
Published: (2024)
UniREditBench: A Unified Reasoning-based Image Editing Benchmark
by: Han, Feng, et al.
Published: (2025)
by: Han, Feng, et al.
Published: (2025)
TBAC-UniImage: Unified Understanding and Generation by Ladder-Side Diffusion Tuning
by: Xu, Junzhe, et al.
Published: (2025)
by: Xu, Junzhe, et al.
Published: (2025)
NeurIPT: Foundation Model for Neural Interfaces
by: Fang, Zitao, et al.
Published: (2025)
by: Fang, Zitao, et al.
Published: (2025)
UniForm: A Unified Multi-Task Diffusion Transformer for Audio-Video Generation
by: Zhao, Lei, et al.
Published: (2025)
by: Zhao, Lei, et al.
Published: (2025)
DreamVE: Unified Instruction-based Image and Video Editing
by: Xia, Bin, et al.
Published: (2025)
by: Xia, Bin, et al.
Published: (2025)
UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection
by: Zhang, Yanran, et al.
Published: (2026)
by: Zhang, Yanran, et al.
Published: (2026)
UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing
by: Bai, Jianhong, et al.
Published: (2024)
by: Bai, Jianhong, et al.
Published: (2024)
UniQA: Unified Vision-Language Pre-training for Image Quality and Aesthetic Assessment
by: Zhou, Hantao, et al.
Published: (2024)
by: Zhou, Hantao, et al.
Published: (2024)
UniSymNet: A Unified Symbolic Network Guided by Transformer
by: Li, Xinxin, et al.
Published: (2025)
by: Li, Xinxin, et al.
Published: (2025)
More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models
by: Lin, Hongkai, et al.
Published: (2025)
by: Lin, Hongkai, et al.
Published: (2025)
ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model
by: Qi, Jinwei, et al.
Published: (2025)
by: Qi, Jinwei, et al.
Published: (2025)
UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing
by: Tang, Hao, et al.
Published: (2025)
by: Tang, Hao, et al.
Published: (2025)
Uni-Edit: Intelligent Editing Is A General Task For Unified Model Tuning
by: Zheng, Dian, et al.
Published: (2026)
by: Zheng, Dian, et al.
Published: (2026)
Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator
by: Qin, Luozheng, et al.
Published: (2026)
by: Qin, Luozheng, et al.
Published: (2026)
Similar Items
-
ImgEdit: A Unified Image Editing Dataset and Benchmark
by: Ye, Yang, et al.
Published: (2025) -
Uni-NTFM: A Unified Foundation Model for EEG Signal Representation Learning
by: Chen, Zhisheng, et al.
Published: (2025) -
UniVideo: Unified Understanding, Generation, and Editing for Videos
by: Wei, Cong, et al.
Published: (2025) -
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying
by: Bai, Chengyu, et al.
Published: (2025) -
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing
by: Fu, Tsu-Jui, et al.
Published: (2025)