Saved in:
Bibliographic Details
Main Authors: Wang, Wenda, Liu, Anqi, Yang, Junqi, He, Lei, Wang, Luying, Lu, Jiachen, Huang, Weixin
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.20733
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916030886969344
author Wang, Wenda
Liu, Anqi
Yang, Junqi
He, Lei
Wang, Luying
Lu, Jiachen
Huang, Weixin
author_facet Wang, Wenda
Liu, Anqi
Yang, Junqi
He, Lei
Wang, Luying
Lu, Jiachen
Huang, Weixin
contents Converting hand-drawn sketches into structured 3D geometries remains challenging due to the difficulty of representing non-Euclidean surfaces and maintaining topological consistency. Existing generative models such as GANs, NeRFs, and diffusion architectures often fail to produce editable manifolds directly usable in downstream design workflows. We present Sketch2MinSurf, a hybrid vision-language and geometric optimization framework that integrates vision-language guidance with minimal-surface theory to generate smooth and editable 3D surfaces from hand-drawn sketches. The core of our approach is a spatial-topological encoding that represents geometry as tuples of node coordinates and real/virtual edge skeletons, enabling stable topological control during generation. We further introduce the Sketch2MinSurf Structural Loss (S2MS-Loss), a reward-modulated objective that jointly constrains geometric reconstruction and topological coherence. On a test set of 100 sketches, Sketch2MinSurf achieves a topological similarity score of 0.844, outperforming existing sketch-to-shape baselines. The generated manifolds are directly editable and free from non-manifold artifacts. A public art installation at a university showcases the method's potential for human-intent-driven 3D form generation. The dataset and code are available at https://anonymous.4open.science/r/Sketch2MinSurf/.
format Preprint
id arxiv_https___arxiv_org_abs_2605_20733
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Sketch2MinSurf: Vision-Language Guided Generation of Editable Minimal Surfaces from Hand-Drawn Sketches
Wang, Wenda
Liu, Anqi
Yang, Junqi
He, Lei
Wang, Luying
Lu, Jiachen
Huang, Weixin
Computer Vision and Pattern Recognition
Converting hand-drawn sketches into structured 3D geometries remains challenging due to the difficulty of representing non-Euclidean surfaces and maintaining topological consistency. Existing generative models such as GANs, NeRFs, and diffusion architectures often fail to produce editable manifolds directly usable in downstream design workflows. We present Sketch2MinSurf, a hybrid vision-language and geometric optimization framework that integrates vision-language guidance with minimal-surface theory to generate smooth and editable 3D surfaces from hand-drawn sketches. The core of our approach is a spatial-topological encoding that represents geometry as tuples of node coordinates and real/virtual edge skeletons, enabling stable topological control during generation. We further introduce the Sketch2MinSurf Structural Loss (S2MS-Loss), a reward-modulated objective that jointly constrains geometric reconstruction and topological coherence. On a test set of 100 sketches, Sketch2MinSurf achieves a topological similarity score of 0.844, outperforming existing sketch-to-shape baselines. The generated manifolds are directly editable and free from non-manifold artifacts. A public art installation at a university showcases the method's potential for human-intent-driven 3D form generation. The dataset and code are available at https://anonymous.4open.science/r/Sketch2MinSurf/.
title Sketch2MinSurf: Vision-Language Guided Generation of Editable Minimal Surfaces from Hand-Drawn Sketches
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2605.20733