:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Hara, Takayuki, Harada, Tatsuya
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2404.00345
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control
by: Fujiwara, Haruo, et al.
Published: (2025)

Detection Based Part-level Articulated Object Reconstruction from Single RGBD Image
by: Kawana, Yuki, et al.
Published: (2025)

RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images
by: Cui, Ziteng, et al.
Published: (2024)

Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
by: Fujiwara, Haruo, et al.
Published: (2024)

Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment
by: Cui, Ziteng, et al.
Published: (2025)

Discovering an Image-Adaptive Coordinate System for Photography Processing
by: Cui, Ziteng, et al.
Published: (2025)

CapTalk: Text-Guided Stylization and Speech-Driven 3D Head Animation
by: Chu, Xuangeng, et al.
Published: (2026)

RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images and A Benchmark
by: Cui, Ziteng, et al.
Published: (2025)

SceneProp: Combining Neural Network and Markov Random Field for Scene-Graph Grounding
by: Otani, Keita, et al.
Published: (2025)

Generalizable and Animatable Gaussian Head Avatar
by: Chu, Xuangeng, et al.
Published: (2024)

ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model
by: Chu, Xuangeng, et al.
Published: (2025)

Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments
by: Etchegaray, Djamahl, et al.
Published: (2024)

Combining inherent knowledge of vision-language models with unsupervised domain adaptation through strong-weak guidance
by: Westfechtel, Thomas, et al.
Published: (2023)

OBJVanish: Physically Realizable Text-to-3D Adv. Generation of LiDAR-Invisible Objects
by: Li, Bing, et al.
Published: (2025)

Text-Image Conditioned 3D Generation
by: Cen, Jiazhong, et al.
Published: (2026)

MaPa: Text-driven Photorealistic Material Painting for 3D Shapes
by: Zhang, Shangzan, et al.
Published: (2024)

GPAvatar: Generalizable and Precise Head Avatar from Image(s)
by: Chu, Xuangeng, et al.
Published: (2024)

MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
by: Zhou, Kangneng, et al.
Published: (2023)

DEJIMA: A Novel Large-scale Japanese Dataset for Image Captioning and Visual Question Answering
by: Katsube, Toshiki, et al.
Published: (2025)

Unifying Color and Lightness Correction with View-Adaptive Curve Adjustment for Robust 3D Novel View Synthesis
by: Cui, Ziteng, et al.
Published: (2026)

3D Object Manipulation in a Single Image using Generative Models
by: Zhao, Ruisi, et al.
Published: (2025)

Learning Continuous 3D Words for Text-to-Image Generation
by: Cheng, Ta-Ying, et al.
Published: (2024)

OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting
by: Gao, Penglei, et al.
Published: (2024)

Manipulating Vehicle 3D Shapes through Latent Space Editing
by: Miao, JiangDong, et al.
Published: (2024)

ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
by: Höllein, Lukas, et al.
Published: (2024)

PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion
by: Liu, Ying-Tian, et al.
Published: (2023)

DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation
by: Yang, Haibo, et al.
Published: (2024)

Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding
by: Torimi, Kohei, et al.
Published: (2025)

LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation
by: Wang, Jingjing, et al.
Published: (2026)

Frequency-aware Feature Fusion for Dense Image Prediction
by: Chen, Linwei, et al.
Published: (2024)

3D Space as a Scratchpad for Editable Text-to-Image Generation
by: Saha, Oindrila, et al.
Published: (2026)

SIC3D: Style Image Conditioned Text-to-3D Gaussian Splatting Generation
by: He, Ming, et al.
Published: (2026)

OmniText: A Training-Free Generalist for Controllable Text-Image Manipulation
by: Gunawan, Agus, et al.
Published: (2025)

DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation
by: Lyu, Yueming, et al.
Published: (2023)

I2-NeRF: Learning Neural Radiance Fields Under Physically-Grounded Media Interactions
by: Liu, Shuhong, et al.
Published: (2025)

Prompt Augmentation for Self-supervised Text-guided Image Manipulation
by: Bodur, Rumeysa, et al.
Published: (2024)

Detecting Text Manipulation in Images using Vision Language Models
by: Vidit, Vidit, et al.
Published: (2025)

Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
by: Zhang, Jinlu, et al.
Published: (2024)

Text-to-3D Shape Generation
by: Lee, Han-Hung, et al.
Published: (2024)

PortraVec: Image-Based Portrait Vectorization with Text-Guided Manipulation
by: Liang, Yiqi, et al.
Published: (2024)