:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tao, Xinhao, Qiu, Tianyuan, Cao, Junyan, Niu, Li
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2407.15481
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Shadow Generation with Decomposed Mask Prediction and Attentive Shadow Filling
by: Tao, Xinhao, et al.
Published: (2023)

Shadow Generation for Composite Image Using Diffusion model
by: Liu, Qingyang, et al.
Published: (2024)

ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video
by: Li, Xinhao, et al.
Published: (2023)

Unbiased Object Detection Beyond Frequency with Visually Prompted Image Synthesis
by: Cai, Xinhao, et al.
Published: (2025)

Prior-guided Hierarchical Harmonization Network for Efficient Image Dehazing
by: Su, Xiongfei, et al.
Published: (2025)

Learning to Customize Text-to-Image Diffusion In Diverse Context
by: Kim, Taewook, et al.
Published: (2024)

IHF-Harmony: Multi-Modality Magnetic Resonance Images Harmonization using Invertible Hierarchy Flow Model
by: Zhu, Pengli, et al.
Published: (2026)

Retrieval Augmented Image Harmonization
by: Wang, Haolin, et al.
Published: (2024)

Hierarchical Features Matter: A Deep Exploration of Progressive Parameterization Method for Dataset Distillation
by: Zhong, Xinhao, et al.
Published: (2024)

SteadyDancer: Harmonized and Coherent Human Image Animation with First-Frame Preservation
by: Zhang, Jiaming, et al.
Published: (2025)

ESDiff: Encoding Strategy-inspired Diffusion Model with Few-shot Learning for Color Image Inpainting
by: Zhang, Junyan, et al.
Published: (2025)

VISTAR:A User-Centric and Role-Driven Benchmark for Text-to-Image Evaluation
by: Jiang, Kaiyuan, et al.
Published: (2025)

MVG4D: Image Matrix-Based Multi-View and Motion Generation for 4D Content Creation from a Single Image
by: Yin, DongFu, et al.
Published: (2025)

Toy-GS: Assembling Local Gaussians for Precisely Rendering Large-Scale Free Camera Trajectories
by: Zhang, Xiaohan, et al.
Published: (2024)

Bridging the Intention-Expression Gap: Aligning Multi-Dimensional Preferences via Hierarchical Relevance Feedback in Text-to-Image Diffusion
by: Wang, Wenxi, et al.
Published: (2026)

Controllable Generation of Large-Scale 3D Urban Layouts with Semantic and Structural Guidance
by: Niu, Mengyuan, et al.
Published: (2025)

PRIOR: Prototype Representation Joint Learning from Medical Images and Reports
by: Cheng, Pujin, et al.
Published: (2023)

The Diffusion Duet: Harmonizing Dual Channels with Wavelet Suppression for Image Separation
by: Li, Jingwei, et al.
Published: (2026)

PFPs: Prompt-guided Flexible Pathological Segmentation for Diverse Potential Outcomes Using Large Vision and Language Models
by: Cui, Can, et al.
Published: (2024)

Diverse and Tailored Image Generation for Zero-shot Multi-label Classification
by: Zhang, Kaixin, et al.
Published: (2024)

HarmonPaint: Harmonized Training-Free Diffusion Inpainting
by: Li, Ying, et al.
Published: (2025)

FakeVLM-R1: Internalizing Physical Laws via CoT for Synthetic Image Detection
by: Zhu, Leqi, et al.
Published: (2026)

Diagnosing Urban Street Vitality via a Visual-Semantic and Spatiotemporal Framework for Street-Level Economics
by: Zhuo, Xinxin, et al.
Published: (2026)

GenClaw: Code-Driven Agentic Image Generation
by: Ye, Junyan, et al.
Published: (2026)

PHASE-Net: Physics-Grounded Harmonic Attention System for Efficient Remote Photoplethysmography Measurement
by: Zhao, Bo, et al.
Published: (2025)

SSiT: Saliency-guided Self-supervised Image Transformer for Diabetic Retinopathy Grading
by: Huang, Yijin, et al.
Published: (2022)

TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models
by: Zhang, Zhongwei, et al.
Published: (2024)

Image Harmonization using Robust Restricted CDF Matching
by: Stoklasa, Roman
Published: (2024)

Zero-Shot Image Harmonization with Generative Model Prior
by: Chen, Jianqi, et al.
Published: (2023)

DiverseAR: Boosting Diversity in Bitwise Autoregressive Image Generation
by: Yang, Ying, et al.
Published: (2025)

OmniAID: Decoupling Semantic and Artifacts for Universal AI-Generated Image Detection in the Wild
by: Guo, Yuncheng, et al.
Published: (2025)

DepthVLA: Enhancing Vision-Language-Action Models with Depth-Aware Spatial Reasoning
by: Yuan, Tianyuan, et al.
Published: (2025)

DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data
by: Wu, Ruiqi, et al.
Published: (2025)

OSInsert: Towards High-authenticity and High-fidelity Image Composition
by: Wang, Jingyuan, et al.
Published: (2026)

Leveraging BEV Paradigm for Ground-to-Aerial Image Synthesis
by: Ye, Junyan, et al.
Published: (2024)

IDPruner: Harmonizing Importance and Diversity in Visual Token Pruning for MLLMs
by: Tan, Yifan, et al.
Published: (2026)

Automatic Image Unfolding and Stitching Framework for Esophageal Lining Video Based on Density-Weighted Feature Matching
by: Li, Muyang, et al.
Published: (2024)

Harmonizing Visual Representations for Unified Multimodal Understanding and Generation
by: Wu, Size, et al.
Published: (2025)

LayerFusion: Harmonized Multi-Layer Text-to-Image Generation with Generative Priors
by: Dalva, Yusuf, et al.
Published: (2024)

PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment
by: Huang, Dingbang, et al.
Published: (2025)