:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhao, Yiming, Gao, Yuanpeng, Luo, Yuxuan, Duan, Jiwei, Lin, Shisong, Xiong, Longfei, Lian, Zhouhui
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.20479
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Beyond Patches: Global-aware Autoregressive Model for Multimodal Few-Shot Font Generation
by: Cai, Haonan, et al.
Published: (2026)

Pano2Room: Novel View Synthesis from a Single Indoor Panorama
by: Pu, Guo, et al.
Published: (2024)

RSUniVLM: A Unified Vision Language Model for Remote Sensing via Granularity-oriented Mixture of Experts
by: Liu, Xu, et al.
Published: (2024)

MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning
by: Luo, Yuxuan, et al.
Published: (2025)

VecFontSDF: Learning to Reconstruct and Synthesize High-quality Vector Fonts via Signed Distance Functions
by: Xia, Zeqing, et al.
Published: (2023)

CalliReader: Contextualizing Chinese Calligraphy via an Embedding-Aligned Vision-Language Model
by: Luo, Yuxuan, et al.
Published: (2025)

CalliRewrite: Recovering Handwriting Behaviors from Calligraphy Images without Supervision
by: Luo, Yuxuan, et al.
Published: (2024)

HFH-Font: Few-shot Chinese Font Synthesis with Higher Quality, Faster Speed, and Higher Resolution
by: Li, Hua, et al.
Published: (2024)

StyleAdapter: A Unified Stylized Image Generation Model
by: Wang, Zhouxia, et al.
Published: (2023)

DreamStyle: A Unified Framework for Video Stylization
by: Li, Mengtian, et al.
Published: (2026)

Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control
by: Fujiwara, Haruo, et al.
Published: (2025)

Uni-Neur2Img: Unified Neural Signal-Guided Image Generation, Editing, and Stylization via Diffusion Transformers
by: Bai, Xiyue, et al.
Published: (2025)

PoseMaster: A Unified 3D Native Framework for Stylized Pose Generation
by: Yan, Hongyu, et al.
Published: (2025)

Training-free Stylized Text-to-Image Generation with Fast Inference
by: Ma, Xin, et al.
Published: (2025)

ActMVS: Active Scene Reconstruction with Monocular Multi-View Stereo
by: Pu, Guo, et al.
Published: (2026)

Creating Your Editable 3D Photorealistic Avatar with Tetrahedron-constrained Gaussian Splatting
by: Liu, Hanxi, et al.
Published: (2025)

TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis
by: Xie, Yu, et al.
Published: (2025)

LayerFlow: A Unified Model for Layer-aware Video Generation
by: Ji, Sihui, et al.
Published: (2025)

Neural-Polyptych: Content Controllable Painting Recreation for Diverse Genres
by: Zhao, Yiming, et al.
Published: (2024)

Generative Human Motion Stylization in Latent Space
by: Guo, Chuan, et al.
Published: (2024)

Dynamic Texture Transfer using PatchMatch and Transformers
by: Pu, Guo, et al.
Published: (2024)

NSYNC: Negative Synthetic Image Generation for Contrastive Training to Improve Stylized Text-To-Image Translation
by: Ozturk, Serkan, et al.
Published: (2025)

Training-Free Diffusion Framework for Stylized Image Generation with Identity Preservation
by: Rezaei, Mohammad Ali, et al.
Published: (2025)

Neural Contrast: Leveraging Generative Editing for Graphic Design Recommendations
by: Lupascu, Marian, et al.
Published: (2024)

ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
by: Pu, Yifan, et al.
Published: (2025)

StylizedGS: Controllable Stylization for 3D Gaussian Splatting
by: Zhang, Dingxi, et al.
Published: (2024)

TextMaster: A Unified Framework for Realistic Text Editing via Glyph-Style Dual-Control
by: Yan, Zhenyu, et al.
Published: (2024)

TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting
by: Xiong, Bojun, et al.
Published: (2024)

OctFusion: Octree-based Diffusion Models for 3D Shape Generation
by: Xiong, Bojun, et al.
Published: (2024)

Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model
by: Yang, Liu, et al.
Published: (2025)

DualNeRF: Text-Driven 3D Scene Editing via Dual-Field Representation
by: Xiong, Yuxuan, et al.
Published: (2025)

Towards Generalized Multi-Image Editing for Unified Multimodal Models
by: Xu, Pengcheng, et al.
Published: (2026)

InstantEdit: Text-Guided Few-Step Image Editing with Piecewise Rectified Flow
by: Gong, Yiming, et al.
Published: (2025)

How Control Information Influences Multilingual Text Image Generation and Editing?
by: Zhang, Boqiang, et al.
Published: (2024)

MotionVerse: A Unified Multimodal Framework for Motion Comprehension, Generation and Editing
by: Hou, Ruibing, et al.
Published: (2025)

EditVerse: Unifying Image and Video Editing and Generation with In-Context Learning
by: Ju, Xuan, et al.
Published: (2025)

DreamOmni: Unified Image Generation and Editing
by: Xia, Bin, et al.
Published: (2024)

En3D: An Enhanced Generative Model for Sculpting 3D Humans from 2D Synthetic Data
by: Men, Yifang, et al.
Published: (2024)

UM-Text: A Unified Multimodal Model for Image Understanding and Visual Text Editing
by: Ma, Lichen, et al.
Published: (2026)

ScribbleEdit: Synthetic Data for Image Editing with Scribbles and Text
by: Ji, Anya, et al.
Published: (2026)