Saved in:
| Main Authors: | Tao, Xinhao, Qiu, Tianyuan, Cao, Junyan, Niu, Li |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.15481 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling
by: Tao, Xinhao, et al.
Published: (2023)
by: Tao, Xinhao, et al.
Published: (2023)
Shadow Generation for Composite Image Using Diffusion model
by: Liu, Qingyang, et al.
Published: (2024)
by: Liu, Qingyang, et al.
Published: (2024)
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
by: Li, Xinhao, et al.
Published: (2023)
by: Li, Xinhao, et al.
Published: (2023)
Unbiased Object Detection Beyond Frequency with Visually Prompted Image Synthesis
by: Cai, Xinhao, et al.
Published: (2025)
by: Cai, Xinhao, et al.
Published: (2025)
Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing
by: Su, Xiongfei, et al.
Published: (2025)
by: Su, Xiongfei, et al.
Published: (2025)
Learning to Customize Text-to-Image Diffusion In Diverse Context
by: Kim, Taewook, et al.
Published: (2024)
by: Kim, Taewook, et al.
Published: (2024)
IHF-Harmony: Multi-Modality Magnetic Resonance Images Harmonization using Invertible Hierarchy Flow Model
by: Zhu, Pengli, et al.
Published: (2026)
by: Zhu, Pengli, et al.
Published: (2026)
Retrieval Augmented Image Harmonization
by: Wang, Haolin, et al.
Published: (2024)
by: Wang, Haolin, et al.
Published: (2024)
Hierarchical Features Matter: A Deep Exploration of Progressive Parameterization Method for Dataset Distillation
by: Zhong, Xinhao, et al.
Published: (2024)
by: Zhong, Xinhao, et al.
Published: (2024)
SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
by: Zhang, Jiaming, et al.
Published: (2025)
by: Zhang, Jiaming, et al.
Published: (2025)
ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting
by: Zhang, Junyan, et al.
Published: (2025)
by: Zhang, Junyan, et al.
Published: (2025)
VISTAR:A User-Centric and Role-Driven Benchmark for Text-to-Image Evaluation
by: Jiang, Kaiyuan, et al.
Published: (2025)
by: Jiang, Kaiyuan, et al.
Published: (2025)
MVG4D: Image Matrix-Based Multi-View and Motion Generation for 4D Content Creation from a Single Image
by: Yin, DongFu, et al.
Published: (2025)
by: Yin, DongFu, et al.
Published: (2025)
Toy-GS: Assembling Local Gaussians for Precisely Rendering Large-Scale Free Camera Trajectories
by: Zhang, Xiaohan, et al.
Published: (2024)
by: Zhang, Xiaohan, et al.
Published: (2024)
Bridging the Intention-Expression Gap: Aligning Multi-Dimensional Preferences via Hierarchical Relevance Feedback in Text-to-Image Diffusion
by: Wang, Wenxi, et al.
Published: (2026)
by: Wang, Wenxi, et al.
Published: (2026)
Controllable Generation of Large-Scale 3D Urban Layouts with Semantic and Structural Guidance
by: Niu, Mengyuan, et al.
Published: (2025)
by: Niu, Mengyuan, et al.
Published: (2025)
PRIOR: Prototype Representation Joint Learning from Medical Images and Reports
by: Cheng, Pujin, et al.
Published: (2023)
by: Cheng, Pujin, et al.
Published: (2023)
The Diffusion Duet: Harmonizing Dual Channels with Wavelet Suppression for Image Separation
by: Li, Jingwei, et al.
Published: (2026)
by: Li, Jingwei, et al.
Published: (2026)
PFPs: Prompt-guided Flexible Pathological Segmentation for Diverse Potential Outcomes Using Large Vision and Language Models
by: Cui, Can, et al.
Published: (2024)
by: Cui, Can, et al.
Published: (2024)
Diverse and Tailored Image Generation for Zero-shot Multi-label Classification
by: Zhang, Kaixin, et al.
Published: (2024)
by: Zhang, Kaixin, et al.
Published: (2024)
HarmonPaint: Harmonized Training-Free Diffusion Inpainting
by: Li, Ying, et al.
Published: (2025)
by: Li, Ying, et al.
Published: (2025)
FakeVLM-R1: Internalizing Physical Laws via CoT for Synthetic Image Detection
by: Zhu, Leqi, et al.
Published: (2026)
by: Zhu, Leqi, et al.
Published: (2026)
Diagnosing Urban Street Vitality via a Visual-Semantic and Spatiotemporal Framework for Street-Level Economics
by: Zhuo, Xinxin, et al.
Published: (2026)
by: Zhuo, Xinxin, et al.
Published: (2026)
GenClaw: Code-Driven Agentic Image Generation
by: Ye, Junyan, et al.
Published: (2026)
by: Ye, Junyan, et al.
Published: (2026)
PHASE-Net: Physics-Grounded Harmonic Attention System for Efficient Remote Photoplethysmography Measurement
by: Zhao, Bo, et al.
Published: (2025)
by: Zhao, Bo, et al.
Published: (2025)
SSiT: Saliency-guided Self-supervised Image Transformer for Diabetic Retinopathy Grading
by: Huang, Yijin, et al.
Published: (2022)
by: Huang, Yijin, et al.
Published: (2022)
TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
by: Zhang, Zhongwei, et al.
Published: (2024)
by: Zhang, Zhongwei, et al.
Published: (2024)
Image Harmonization using Robust Restricted CDF Matching
by: Stoklasa, Roman
Published: (2024)
by: Stoklasa, Roman
Published: (2024)
Zero-Shot Image Harmonization with Generative Model Prior
by: Chen, Jianqi, et al.
Published: (2023)
by: Chen, Jianqi, et al.
Published: (2023)
DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation
by: Yang, Ying, et al.
Published: (2025)
by: Yang, Ying, et al.
Published: (2025)
OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild
by: Guo, Yuncheng, et al.
Published: (2025)
by: Guo, Yuncheng, et al.
Published: (2025)
DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning
by: Yuan, Tianyuan, et al.
Published: (2025)
by: Yuan, Tianyuan, et al.
Published: (2025)
DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data
by: Wu, Ruiqi, et al.
Published: (2025)
by: Wu, Ruiqi, et al.
Published: (2025)
OSInsert: Towards High-authenticity and High-fidelity Image Composition
by: Wang, Jingyuan, et al.
Published: (2026)
by: Wang, Jingyuan, et al.
Published: (2026)
Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis
by: Ye, Junyan, et al.
Published: (2024)
by: Ye, Junyan, et al.
Published: (2024)
IDPruner: Harmonizing Importance and Diversity in Visual Token Pruning for MLLMs
by: Tan, Yifan, et al.
Published: (2026)
by: Tan, Yifan, et al.
Published: (2026)
Automatic Image Unfolding and Stitching Framework for Esophageal Lining Video Based on Density-Weighted Feature Matching
by: Li, Muyang, et al.
Published: (2024)
by: Li, Muyang, et al.
Published: (2024)
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
by: Wu, Size, et al.
Published: (2025)
by: Wu, Size, et al.
Published: (2025)
LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors
by: Dalva, Yusuf, et al.
Published: (2024)
by: Dalva, Yusuf, et al.
Published: (2024)
PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment
by: Huang, Dingbang, et al.
Published: (2025)
by: Huang, Dingbang, et al.
Published: (2025)
Similar Items
-
Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling
by: Tao, Xinhao, et al.
Published: (2023) -
Shadow Generation for Composite Image Using Diffusion model
by: Liu, Qingyang, et al.
Published: (2024) -
ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
by: Li, Xinhao, et al.
Published: (2023) -
Unbiased Object Detection Beyond Frequency with Visually Prompted Image Synthesis
by: Cai, Xinhao, et al.
Published: (2025) -
Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing
by: Su, Xiongfei, et al.
Published: (2025)