Saved in:
| Main Authors: | Zhao, Yiming, Gao, Yuanpeng, Luo, Yuxuan, Duan, Jiwei, Lin, Shisong, Xiong, Longfei, Lian, Zhouhui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.20479 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Beyond Patches: Global-aware Autoregressive Model for Multimodal Few-Shot Font Generation
by: Cai, Haonan, et al.
Published: (2026)
by: Cai, Haonan, et al.
Published: (2026)
Pano2Room: Novel View Synthesis from a Single Indoor Panorama
by: Pu, Guo, et al.
Published: (2024)
by: Pu, Guo, et al.
Published: (2024)
RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts
by: Liu, Xu, et al.
Published: (2024)
by: Liu, Xu, et al.
Published: (2024)
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning
by: Luo, Yuxuan, et al.
Published: (2025)
by: Luo, Yuxuan, et al.
Published: (2025)
VecFontSDF: Learning to Reconstruct and Synthesize High-quality Vector Fonts via Signed Distance Functions
by: Xia, Zeqing, et al.
Published: (2023)
by: Xia, Zeqing, et al.
Published: (2023)
CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model
by: Luo, Yuxuan, et al.
Published: (2025)
by: Luo, Yuxuan, et al.
Published: (2025)
CalliRewrite: Recovering Handwriting Behaviors from Calligraphy Images without Supervision
by: Luo, Yuxuan, et al.
Published: (2024)
by: Luo, Yuxuan, et al.
Published: (2024)
HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution
by: Li, Hua, et al.
Published: (2024)
by: Li, Hua, et al.
Published: (2024)
StyleAdapter: A Unified Stylized Image Generation Model
by: Wang, Zhouxia, et al.
Published: (2023)
by: Wang, Zhouxia, et al.
Published: (2023)
DreamStyle: A Unified Framework for Video Stylization
by: Li, Mengtian, et al.
Published: (2026)
by: Li, Mengtian, et al.
Published: (2026)
Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control
by: Fujiwara, Haruo, et al.
Published: (2025)
by: Fujiwara, Haruo, et al.
Published: (2025)
Uni-Neur2Img: Unified Neural Signal-Guided Image Generation, Editing, and Stylization via Diffusion Transformers
by: Bai, Xiyue, et al.
Published: (2025)
by: Bai, Xiyue, et al.
Published: (2025)
PoseMaster: A Unified 3D Native Framework for Stylized Pose Generation
by: Yan, Hongyu, et al.
Published: (2025)
by: Yan, Hongyu, et al.
Published: (2025)
Training-free Stylized Text-to-Image Generation with Fast Inference
by: Ma, Xin, et al.
Published: (2025)
by: Ma, Xin, et al.
Published: (2025)
ActMVS: Active Scene Reconstruction with Monocular Multi-View Stereo
by: Pu, Guo, et al.
Published: (2026)
by: Pu, Guo, et al.
Published: (2026)
Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting
by: Liu, Hanxi, et al.
Published: (2025)
by: Liu, Hanxi, et al.
Published: (2025)
TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
by: Xie, Yu, et al.
Published: (2025)
by: Xie, Yu, et al.
Published: (2025)
LayerFlow: A Unified Model for Layer-aware Video Generation
by: Ji, Sihui, et al.
Published: (2025)
by: Ji, Sihui, et al.
Published: (2025)
Neural-Polyptych: Content Controllable Painting Recreation for Diverse Genres
by: Zhao, Yiming, et al.
Published: (2024)
by: Zhao, Yiming, et al.
Published: (2024)
Generative Human Motion Stylization in Latent Space
by: Guo, Chuan, et al.
Published: (2024)
by: Guo, Chuan, et al.
Published: (2024)
Dynamic Texture Transfer using PatchMatch and Transformers
by: Pu, Guo, et al.
Published: (2024)
by: Pu, Guo, et al.
Published: (2024)
NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image Translation
by: Ozturk, Serkan, et al.
Published: (2025)
by: Ozturk, Serkan, et al.
Published: (2025)
Training-Free Diffusion Framework for Stylized Image Generation with Identity Preservation
by: Rezaei, Mohammad Ali, et al.
Published: (2025)
by: Rezaei, Mohammad Ali, et al.
Published: (2025)
Neural Contrast: Leveraging Generative Editing for Graphic Design Recommendations
by: Lupascu, Marian, et al.
Published: (2024)
by: Lupascu, Marian, et al.
Published: (2024)
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
by: Pu, Yifan, et al.
Published: (2025)
by: Pu, Yifan, et al.
Published: (2025)
StylizedGS: Controllable Stylization for 3D Gaussian Splatting
by: Zhang, Dingxi, et al.
Published: (2024)
by: Zhang, Dingxi, et al.
Published: (2024)
TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control
by: Yan, Zhenyu, et al.
Published: (2024)
by: Yan, Zhenyu, et al.
Published: (2024)
TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting
by: Xiong, Bojun, et al.
Published: (2024)
by: Xiong, Bojun, et al.
Published: (2024)
OctFusion: Octree-based Diffusion Models for 3D Shape Generation
by: Xiong, Bojun, et al.
Published: (2024)
by: Xiong, Bojun, et al.
Published: (2024)
Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model
by: Yang, Liu, et al.
Published: (2025)
by: Yang, Liu, et al.
Published: (2025)
DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation
by: Xiong, Yuxuan, et al.
Published: (2025)
by: Xiong, Yuxuan, et al.
Published: (2025)
Towards Generalized Multi-Image Editing for Unified Multimodal Models
by: Xu, Pengcheng, et al.
Published: (2026)
by: Xu, Pengcheng, et al.
Published: (2026)
InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow
by: Gong, Yiming, et al.
Published: (2025)
by: Gong, Yiming, et al.
Published: (2025)
How Control Information Influences Multilingual Text Image Generation and Editing?
by: Zhang, Boqiang, et al.
Published: (2024)
by: Zhang, Boqiang, et al.
Published: (2024)
MotionVerse: A Unified Multimodal Framework for Motion Comprehension, Generation and Editing
by: Hou, Ruibing, et al.
Published: (2025)
by: Hou, Ruibing, et al.
Published: (2025)
EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning
by: Ju, Xuan, et al.
Published: (2025)
by: Ju, Xuan, et al.
Published: (2025)
DreamOmni: Unified Image Generation and Editing
by: Xia, Bin, et al.
Published: (2024)
by: Xia, Bin, et al.
Published: (2024)
En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
by: Men, Yifang, et al.
Published: (2024)
by: Men, Yifang, et al.
Published: (2024)
UM-Text: A Unified Multimodal Model for Image Understanding and Visual Text Editing
by: Ma, Lichen, et al.
Published: (2026)
by: Ma, Lichen, et al.
Published: (2026)
ScribbleEdit: Synthetic Data for Image Editing with Scribbles and Text
by: Ji, Anya, et al.
Published: (2026)
by: Ji, Anya, et al.
Published: (2026)
Similar Items
-
Beyond Patches: Global-aware Autoregressive Model for Multimodal Few-Shot Font Generation
by: Cai, Haonan, et al.
Published: (2026) -
Pano2Room: Novel View Synthesis from a Single Indoor Panorama
by: Pu, Guo, et al.
Published: (2024) -
RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts
by: Liu, Xu, et al.
Published: (2024) -
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning
by: Luo, Yuxuan, et al.
Published: (2025) -
VecFontSDF: Learning to Reconstruct and Synthesize High-quality Vector Fonts via Signed Distance Functions
by: Xia, Zeqing, et al.
Published: (2023)