Saved in:
Bibliographic Details
Main Authors: Guo, Lanqing, Liu, Xi, Wang, Yufei, Li, Zhihao, Huang, Siyu
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.14443
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914331560509440
author Guo, Lanqing
Liu, Xi
Wang, Yufei
Li, Zhihao
Huang, Siyu
author_facet Guo, Lanqing
Liu, Xi
Wang, Yufei
Li, Zhihao
Huang, Siyu
contents Recent advances in image generation have achieved remarkable visual quality, while a fundamental challenge remains: Can image generation be controlled at the element level, enabling intuitive modifications such as adjusting shapes, altering colors, or adding and removing objects? In this work, we address this challenge by introducing layer-wise controllable generation through simplified vector graphics (VGs). Our approach first efficiently parses images into hierarchical VG representations that are semantic-aligned and structurally coherent. Building on this representation, we design a novel image synthesis framework guided by VGs, allowing users to freely modify elements and seamlessly translate these edits into photorealistic outputs. By leveraging the structural and semantic features of VGs in conjunction with noise prediction, our method provides precise control over geometry, color, and object semantics. Extensive experiments demonstrate the effectiveness of our approach in diverse applications, including image editing, object-level manipulation, and fine-grained content creation, establishing a new paradigm for controllable image generation. Project page: https://guolanqing.github.io/Vec2Pix/
format Preprint
id arxiv_https___arxiv_org_abs_2602_14443
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Controlling Your Image via Simplified Vector Graphics
Guo, Lanqing
Liu, Xi
Wang, Yufei
Li, Zhihao
Huang, Siyu
Computer Vision and Pattern Recognition
Recent advances in image generation have achieved remarkable visual quality, while a fundamental challenge remains: Can image generation be controlled at the element level, enabling intuitive modifications such as adjusting shapes, altering colors, or adding and removing objects? In this work, we address this challenge by introducing layer-wise controllable generation through simplified vector graphics (VGs). Our approach first efficiently parses images into hierarchical VG representations that are semantic-aligned and structurally coherent. Building on this representation, we design a novel image synthesis framework guided by VGs, allowing users to freely modify elements and seamlessly translate these edits into photorealistic outputs. By leveraging the structural and semantic features of VGs in conjunction with noise prediction, our method provides precise control over geometry, color, and object semantics. Extensive experiments demonstrate the effectiveness of our approach in diverse applications, including image editing, object-level manipulation, and fine-grained content creation, establishing a new paradigm for controllable image generation. Project page: https://guolanqing.github.io/Vec2Pix/
title Controlling Your Image via Simplified Vector Graphics
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2602.14443