Saved in:
Bibliographic Details
Main Authors: Hsiao, Teng-Fang, Ruan, Bo-Kai, Liu, Yu-Lun, Shuai, Hong-Han
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.04349
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914522894172160
author Hsiao, Teng-Fang
Ruan, Bo-Kai
Liu, Yu-Lun
Shuai, Hong-Han
author_facet Hsiao, Teng-Fang
Ruan, Bo-Kai
Liu, Yu-Lun
Shuai, Hong-Han
contents 3D editing has emerged as a critical research area to provide users with flexible control over 3D assets. While current editing approaches predominantly focus on 3D Gaussian Splatting or multi-view images, the direct editing of 3D meshes remains underexplored. Prior attempts, such as VoxHammer, rely on voxel-based representations that suffer from limited resolution and necessitate labor-intensive 3D mask. To address these limitations, we propose \textbf{VecSet-Edit}, the first pipeline that leverages the high-fidelity VecSet Large Reconstruction Model (LRM) as a backbone for mesh editing. Our approach is grounded on a analysis of the spatial properties in VecSet tokens, revealing that token subsets govern distinct geometric regions. Based on this insight, we introduce Mask-guided Token Seeding and Attention-aligned Token Gating strategies to precisely localize target regions using only 2D image conditions. Also, considering the difference between VecSet diffusion process versus voxel we design a Drift-aware Token Pruning to reject geometric outliers during the denoising process. Finally, our Detail-preserving Texture Baking module ensures that we not only preserve the geometric details of original mesh but also the textural information. More details can be found in our project page: https://github.com/BlueDyee/VecSet-Edit/tree/main
format Preprint
id arxiv_https___arxiv_org_abs_2602_04349
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle VecSet-Edit: Unleashing Pre-trained LRM for Mesh Editing from Single Image
Hsiao, Teng-Fang
Ruan, Bo-Kai
Liu, Yu-Lun
Shuai, Hong-Han
Computer Vision and Pattern Recognition
Artificial Intelligence
3D editing has emerged as a critical research area to provide users with flexible control over 3D assets. While current editing approaches predominantly focus on 3D Gaussian Splatting or multi-view images, the direct editing of 3D meshes remains underexplored. Prior attempts, such as VoxHammer, rely on voxel-based representations that suffer from limited resolution and necessitate labor-intensive 3D mask. To address these limitations, we propose \textbf{VecSet-Edit}, the first pipeline that leverages the high-fidelity VecSet Large Reconstruction Model (LRM) as a backbone for mesh editing. Our approach is grounded on a analysis of the spatial properties in VecSet tokens, revealing that token subsets govern distinct geometric regions. Based on this insight, we introduce Mask-guided Token Seeding and Attention-aligned Token Gating strategies to precisely localize target regions using only 2D image conditions. Also, considering the difference between VecSet diffusion process versus voxel we design a Drift-aware Token Pruning to reject geometric outliers during the denoising process. Finally, our Detail-preserving Texture Baking module ensures that we not only preserve the geometric details of original mesh but also the textural information. More details can be found in our project page: https://github.com/BlueDyee/VecSet-Edit/tree/main
title VecSet-Edit: Unleashing Pre-trained LRM for Mesh Editing from Single Image
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2602.04349