Saved in:
| Main Authors: | Alharbi, Yazeed, Wonka, Peter |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.12585 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PartEdit: Fine-Grained Image Editing using Pre-Trained Diffusion Models
by: Cvejic, Aleksandar, et al.
Published: (2025)
by: Cvejic, Aleksandar, et al.
Published: (2025)
EditCLIP: Representation Learning for Image Editing
by: Wang, Qian, et al.
Published: (2025)
by: Wang, Qian, et al.
Published: (2025)
LatentMan: Generating Consistent Animated Characters using Image Diffusion Models
by: Eldesokey, Abdelrahman, et al.
Published: (2023)
by: Eldesokey, Abdelrahman, et al.
Published: (2023)
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation
by: Eldesokey, Abdelrahman, et al.
Published: (2024)
by: Eldesokey, Abdelrahman, et al.
Published: (2024)
PrEditor3D: Fast and Precise 3D Shape Editing
by: Erkoç, Ziya, et al.
Published: (2024)
by: Erkoç, Ziya, et al.
Published: (2024)
Latent Inversion with Timestep-aware Sampling for Training-free Non-rigid Editing
by: Jung, Yunji, et al.
Published: (2024)
by: Jung, Yunji, et al.
Published: (2024)
SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing
by: Chen, Songyan, et al.
Published: (2024)
by: Chen, Songyan, et al.
Published: (2024)
Training-free Geometric Image Editing on Diffusion Models
by: Zhu, Hanshen, et al.
Published: (2025)
by: Zhu, Hanshen, et al.
Published: (2025)
Isharah: A Large-Scale Multi-Scene Dataset for Continuous Sign Language Recognition
by: Alyami, Sarah, et al.
Published: (2025)
by: Alyami, Sarah, et al.
Published: (2025)
No Mesh, No Problem: Estimating Coral Volume and Surface from Sparse Multi-View Images
by: Farchione, Diego Eustachio, et al.
Published: (2025)
by: Farchione, Diego Eustachio, et al.
Published: (2025)
PatchRefiner V2: Fast and Lightweight Real-Domain High-Resolution Metric Depth Estimation
by: Li, Zhenyu, et al.
Published: (2025)
by: Li, Zhenyu, et al.
Published: (2025)
Training-Free Image Editing with Visual Context Integration and Concept Alignment
by: Song, Rui, et al.
Published: (2026)
by: Song, Rui, et al.
Published: (2026)
Geometry without Position? When Positional Embeddings Help and Hurt Spatial Reasoning
by: Shi, Jian, et al.
Published: (2026)
by: Shi, Jian, et al.
Published: (2026)
LaGeM: A Large Geometry Model for 3D Representation Learning and Diffusion
by: Zhang, Biao, et al.
Published: (2024)
by: Zhang, Biao, et al.
Published: (2024)
efunc: An Efficient Function Representation without Neural Networks
by: Zhang, Biao, et al.
Published: (2025)
by: Zhang, Biao, et al.
Published: (2025)
LaRI: Layered Ray Intersections for Single-view 3D Geometric Reasoning
by: Li, Rui, et al.
Published: (2025)
by: Li, Rui, et al.
Published: (2025)
Training-Free Disentangled Text-Guided Image Editing via Sparse Latent Constraints
by: Shabrina, Mutiara, et al.
Published: (2025)
by: Shabrina, Mutiara, et al.
Published: (2025)
Training-free Stylized Text-to-Image Generation with Fast Inference
by: Ma, Xin, et al.
Published: (2025)
by: Ma, Xin, et al.
Published: (2025)
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying
by: Bai, Chengyu, et al.
Published: (2025)
by: Bai, Chengyu, et al.
Published: (2025)
AvatarMMC: 3D Head Avatar Generation and Editing with Multi-Modal Conditioning
by: Para, Wamiq Reyaz, et al.
Published: (2024)
by: Para, Wamiq Reyaz, et al.
Published: (2024)
FastEdit: Fast Text-Guided Single-Image Editing via Semantic-Aware Diffusion Fine-Tuning
by: Chen, Zhi, et al.
Published: (2024)
by: Chen, Zhi, et al.
Published: (2024)
Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing
by: Hu, Taihang, et al.
Published: (2025)
by: Hu, Taihang, et al.
Published: (2025)
ImmersePro: End-to-End Stereo Video Synthesis Via Implicit Disparity Learning
by: Shi, Jian, et al.
Published: (2024)
by: Shi, Jian, et al.
Published: (2024)
Generative Human Geometry Distribution
by: Tang, Xiangjun, et al.
Published: (2025)
by: Tang, Xiangjun, et al.
Published: (2025)
SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images
by: Barel, Nir, et al.
Published: (2024)
by: Barel, Nir, et al.
Published: (2024)
Image-to-Image Translation with Disentangled Latent Vectors for Face Editing
by: Dalva, Yusuf, et al.
Published: (2023)
by: Dalva, Yusuf, et al.
Published: (2023)
Fast Training-free Perceptual Image Compression
by: Zhu, Ziran, et al.
Published: (2025)
by: Zhu, Ziran, et al.
Published: (2025)
Deep Learning-based Image and Video Inpainting: A Survey
by: Quan, Weize, et al.
Published: (2024)
by: Quan, Weize, et al.
Published: (2024)
PatchAlign3D: Local Feature Alignment for Dense 3D Shape understanding
by: Hadgi, Souhail, et al.
Published: (2026)
by: Hadgi, Souhail, et al.
Published: (2026)
LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
by: Gani, Hanan, et al.
Published: (2023)
by: Gani, Hanan, et al.
Published: (2023)
SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing
by: Xiao, Yicheng, et al.
Published: (2026)
by: Xiao, Yicheng, et al.
Published: (2026)
PatchRefiner: Leveraging Synthetic Data for Real-Domain High-Resolution Monocular Metric Depth Estimation
by: Li, Zhenyu, et al.
Published: (2024)
by: Li, Zhenyu, et al.
Published: (2024)
Geometry Distributions
by: Zhang, Biao, et al.
Published: (2024)
by: Zhang, Biao, et al.
Published: (2024)
Back to 3D: Few-Shot 3D Keypoint Detection with Back-Projected 2D Features
by: Wimmer, Thomas, et al.
Published: (2023)
by: Wimmer, Thomas, et al.
Published: (2023)
WinSyn: A High Resolution Testbed for Synthetic Data
by: Kelly, Tom, et al.
Published: (2023)
by: Kelly, Tom, et al.
Published: (2023)
Human Geometry Distribution for 3D Animation Generation
by: Tang, Xiangjun, et al.
Published: (2025)
by: Tang, Xiangjun, et al.
Published: (2025)
Training-free Mixed-Resolution Latent Upsampling for Spatially Accelerated Diffusion Transformers
by: Jeong, Wongi, et al.
Published: (2025)
by: Jeong, Wongi, et al.
Published: (2025)
Zero-Shot Video Semantic Segmentation based on Pre-Trained Diffusion Models
by: Wang, Qian, et al.
Published: (2024)
by: Wang, Qian, et al.
Published: (2024)
Multimodal-Conditioned Latent Diffusion Models for Fashion Image Editing
by: Baldrati, Alberto, et al.
Published: (2024)
by: Baldrati, Alberto, et al.
Published: (2024)
PIXELS: Progressive Image Xemplar-based Editing with Latent Surgery
by: Biswas, Shristi Das, et al.
Published: (2025)
by: Biswas, Shristi Das, et al.
Published: (2025)
Similar Items
-
PartEdit: Fine-Grained Image Editing using Pre-Trained Diffusion Models
by: Cvejic, Aleksandar, et al.
Published: (2025) -
EditCLIP: Representation Learning for Image Editing
by: Wang, Qian, et al.
Published: (2025) -
LatentMan: Generating Consistent Animated Characters using Image Diffusion Models
by: Eldesokey, Abdelrahman, et al.
Published: (2023) -
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation
by: Eldesokey, Abdelrahman, et al.
Published: (2024) -
PrEditor3D: Fast and Precise 3D Shape Editing
by: Erkoç, Ziya, et al.
Published: (2024)