Saved in:
| Main Authors: | Hara, Takayuki, Harada, Tatsuya |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.00345 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control
by: Fujiwara, Haruo, et al.
Published: (2025)
by: Fujiwara, Haruo, et al.
Published: (2025)
Detection Based Part-level Articulated Object Reconstruction from Single RGBD Image
by: Kawana, Yuki, et al.
Published: (2025)
by: Kawana, Yuki, et al.
Published: (2025)
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images
by: Cui, Ziteng, et al.
Published: (2024)
by: Cui, Ziteng, et al.
Published: (2024)
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
by: Fujiwara, Haruo, et al.
Published: (2024)
by: Fujiwara, Haruo, et al.
Published: (2024)
Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment
by: Cui, Ziteng, et al.
Published: (2025)
by: Cui, Ziteng, et al.
Published: (2025)
Discovering an Image-Adaptive Coordinate System for Photography Processing
by: Cui, Ziteng, et al.
Published: (2025)
by: Cui, Ziteng, et al.
Published: (2025)
CapTalk: Text-Guided Stylization and Speech-Driven 3D Head Animation
by: Chu, Xuangeng, et al.
Published: (2026)
by: Chu, Xuangeng, et al.
Published: (2026)
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images and A Benchmark
by: Cui, Ziteng, et al.
Published: (2025)
by: Cui, Ziteng, et al.
Published: (2025)
SceneProp: Combining Neural Network and Markov Random Field for Scene-Graph Grounding
by: Otani, Keita, et al.
Published: (2025)
by: Otani, Keita, et al.
Published: (2025)
Generalizable and Animatable Gaussian Head Avatar
by: Chu, Xuangeng, et al.
Published: (2024)
by: Chu, Xuangeng, et al.
Published: (2024)
ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model
by: Chu, Xuangeng, et al.
Published: (2025)
by: Chu, Xuangeng, et al.
Published: (2025)
Find n' Propagate: Open-Vocabulary 3D Object Detection in Urban Environments
by: Etchegaray, Djamahl, et al.
Published: (2024)
by: Etchegaray, Djamahl, et al.
Published: (2024)
Combining inherent knowledge of vision-language models with unsupervised domain adaptation through strong-weak guidance
by: Westfechtel, Thomas, et al.
Published: (2023)
by: Westfechtel, Thomas, et al.
Published: (2023)
OBJVanish: Physically Realizable Text-to-3D Adv. Generation of LiDAR-Invisible Objects
by: Li, Bing, et al.
Published: (2025)
by: Li, Bing, et al.
Published: (2025)
Text-Image Conditioned 3D Generation
by: Cen, Jiazhong, et al.
Published: (2026)
by: Cen, Jiazhong, et al.
Published: (2026)
MaPa: Text-driven Photorealistic Material Painting for 3D Shapes
by: Zhang, Shangzan, et al.
Published: (2024)
by: Zhang, Shangzan, et al.
Published: (2024)
GPAvatar: Generalizable and Precise Head Avatar from Image(s)
by: Chu, Xuangeng, et al.
Published: (2024)
by: Chu, Xuangeng, et al.
Published: (2024)
MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
by: Zhou, Kangneng, et al.
Published: (2023)
by: Zhou, Kangneng, et al.
Published: (2023)
DEJIMA: A Novel Large-scale Japanese Dataset for Image Captioning and Visual Question Answering
by: Katsube, Toshiki, et al.
Published: (2025)
by: Katsube, Toshiki, et al.
Published: (2025)
Unifying Color and Lightness Correction with View-Adaptive Curve Adjustment for Robust 3D Novel View Synthesis
by: Cui, Ziteng, et al.
Published: (2026)
by: Cui, Ziteng, et al.
Published: (2026)
3D Object Manipulation in a Single Image using Generative Models
by: Zhao, Ruisi, et al.
Published: (2025)
by: Zhao, Ruisi, et al.
Published: (2025)
Learning Continuous 3D Words for Text-to-Image Generation
by: Cheng, Ta-Ying, et al.
Published: (2024)
by: Cheng, Ta-Ying, et al.
Published: (2024)
OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting
by: Gao, Penglei, et al.
Published: (2024)
by: Gao, Penglei, et al.
Published: (2024)
Manipulating Vehicle 3D Shapes through Latent Space Editing
by: Miao, JiangDong, et al.
Published: (2024)
by: Miao, JiangDong, et al.
Published: (2024)
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
by: Höllein, Lukas, et al.
Published: (2024)
by: Höllein, Lukas, et al.
Published: (2024)
PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion
by: Liu, Ying-Tian, et al.
Published: (2023)
by: Liu, Ying-Tian, et al.
Published: (2023)
DreamMesh: Jointly Manipulating and Texturing Triangle Meshes for Text-to-3D Generation
by: Yang, Haibo, et al.
Published: (2024)
by: Yang, Haibo, et al.
Published: (2024)
Text-guided Synthetic Geometric Augmentation for Zero-shot 3D Understanding
by: Torimi, Kohei, et al.
Published: (2025)
by: Torimi, Kohei, et al.
Published: (2025)
LAMP: Lift Image-Editing as General 3D Priors for Open-world Manipulation
by: Wang, Jingjing, et al.
Published: (2026)
by: Wang, Jingjing, et al.
Published: (2026)
Frequency-aware Feature Fusion for Dense Image Prediction
by: Chen, Linwei, et al.
Published: (2024)
by: Chen, Linwei, et al.
Published: (2024)
3D Space as a Scratchpad for Editable Text-to-Image Generation
by: Saha, Oindrila, et al.
Published: (2026)
by: Saha, Oindrila, et al.
Published: (2026)
SIC3D: Style Image Conditioned Text-to-3D Gaussian Splatting Generation
by: He, Ming, et al.
Published: (2026)
by: He, Ming, et al.
Published: (2026)
OmniText: A Training-Free Generalist for Controllable Text-Image Manipulation
by: Gunawan, Agus, et al.
Published: (2025)
by: Gunawan, Agus, et al.
Published: (2025)
DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation
by: Lyu, Yueming, et al.
Published: (2023)
by: Lyu, Yueming, et al.
Published: (2023)
I2-NeRF: Learning Neural Radiance Fields Under Physically-Grounded Media Interactions
by: Liu, Shuhong, et al.
Published: (2025)
by: Liu, Shuhong, et al.
Published: (2025)
Prompt Augmentation for Self-supervised Text-guided Image Manipulation
by: Bodur, Rumeysa, et al.
Published: (2024)
by: Bodur, Rumeysa, et al.
Published: (2024)
Detecting Text Manipulation in Images using Vision Language Models
by: Vidit, Vidit, et al.
Published: (2025)
by: Vidit, Vidit, et al.
Published: (2025)
Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
by: Zhang, Jinlu, et al.
Published: (2024)
by: Zhang, Jinlu, et al.
Published: (2024)
Text-to-3D Shape Generation
by: Lee, Han-Hung, et al.
Published: (2024)
by: Lee, Han-Hung, et al.
Published: (2024)
PortraVec: Image-Based Portrait Vectorization with Text-Guided Manipulation
by: Liang, Yiqi, et al.
Published: (2024)
by: Liang, Yiqi, et al.
Published: (2024)
Similar Items
-
Improved 3D Scene Stylization via Text-Guided Generative Image Editing with Region-Based Control
by: Fujiwara, Haruo, et al.
Published: (2025) -
Detection Based Part-level Articulated Object Reconstruction from Single RGBD Image
by: Kawana, Yuki, et al.
Published: (2025) -
RAW-Adapter: Adapting Pre-trained Visual Model to Camera RAW Images
by: Cui, Ziteng, et al.
Published: (2024) -
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
by: Fujiwara, Haruo, et al.
Published: (2024) -
Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve Adjustment
by: Cui, Ziteng, et al.
Published: (2025)