Saved in:
| Main Authors: | Zhou, Hang, Zuo, Xinxin, Wang, Sen, Cheng, Li |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.06873 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PICS: Pipeline for Image Captioning and Search
by: Rosario, Grant, et al.
Published: (2024)
by: Rosario, Grant, et al.
Published: (2024)
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
by: Zhou, Hang, et al.
Published: (2025)
by: Zhou, Hang, et al.
Published: (2025)
PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation
by: Dwivedi, Vikas, et al.
Published: (2023)
by: Dwivedi, Vikas, et al.
Published: (2023)
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors
by: Zhang, Yabo, et al.
Published: (2025)
by: Zhang, Yabo, et al.
Published: (2025)
Highly Efficient 3D Human Pose Tracking from Events with Spiking Spatiotemporal Transformer
by: Zou, Shihao, et al.
Published: (2023)
by: Zou, Shihao, et al.
Published: (2023)
Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning
by: Wang, Yibin, et al.
Published: (2025)
by: Wang, Yibin, et al.
Published: (2025)
Multimodal Information Interaction for Medical Image Segmentation
by: Fan, Xinxin, et al.
Published: (2024)
by: Fan, Xinxin, et al.
Published: (2024)
Pairwise Similarity Regularization for Semi-supervised Graph Medical Image Segmentation
by: Zhou, Jialu, et al.
Published: (2025)
by: Zhou, Jialu, et al.
Published: (2025)
Learning Spatially Decoupled Color Representations for Facial Image Colorization
by: Zhu, Hangyan, et al.
Published: (2024)
by: Zhu, Hangyan, et al.
Published: (2024)
NOAH: Learning Pairwise Object Category Attentions for Image Classification
by: Li, Chao, et al.
Published: (2024)
by: Li, Chao, et al.
Published: (2024)
Image Copy-Move Forgery Detection via Deep PatchMatch and Pairwise Ranking Learning
by: Li, Yuanman, et al.
Published: (2024)
by: Li, Yuanman, et al.
Published: (2024)
MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion
by: Li, Sen, et al.
Published: (2024)
by: Li, Sen, et al.
Published: (2024)
Diverse Generation while Maintaining Semantic Coordination: A Diffusion-Based Data Augmentation Method for Object Detection
by: Nie, Sen, et al.
Published: (2024)
by: Nie, Sen, et al.
Published: (2024)
PSR: Scaling Multi-Subject Personalized Image Generation with Pairwise Subject-Consistency Rewards
by: Wang, Shulei, et al.
Published: (2025)
by: Wang, Shulei, et al.
Published: (2025)
DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition
by: Liu, Jiacheng, et al.
Published: (2024)
by: Liu, Jiacheng, et al.
Published: (2024)
MVS-TTA: Test-Time Adaptation for Multi-View Stereo via Meta-Auxiliary Learning
by: Zhang, Hannuo, et al.
Published: (2025)
by: Zhang, Hannuo, et al.
Published: (2025)
Activating Wider Areas in Image Super-Resolution
by: Cheng, Cheng, et al.
Published: (2024)
by: Cheng, Cheng, et al.
Published: (2024)
ACE: Anti-Editing Concept Erasure in Text-to-Image Models
by: Wang, Zihao, et al.
Published: (2025)
by: Wang, Zihao, et al.
Published: (2025)
Pairwise Alignment & Compatibility for Arbitrarily Irregular Image Fragments
by: Shahar, Ofir Itzhak, et al.
Published: (2025)
by: Shahar, Ofir Itzhak, et al.
Published: (2025)
BEV-VAE: Multi-view Image Generation with Spatial Consistency for Autonomous Driving
by: Chen, Zeming, et al.
Published: (2025)
by: Chen, Zeming, et al.
Published: (2025)
InterCoG: Towards Spatially Precise Image Editing with Interleaved Chain-of-Grounding Reasoning
by: Wan, Yecong, et al.
Published: (2026)
by: Wan, Yecong, et al.
Published: (2026)
PointMAC: Meta-Learned Adaptation for Robust Test-Time Point Cloud Completion
by: Jiang, Linlian, et al.
Published: (2025)
by: Jiang, Linlian, et al.
Published: (2025)
SUIT: Spatial-Spectral Union-Intersection Interaction Network for Hyperspectral Object Tracking
by: Xiong, Fengchao, et al.
Published: (2025)
by: Xiong, Fengchao, et al.
Published: (2025)
ALDI-ray: Adapting the ALDI Framework for Security X-ray Object Detection
by: Heidari, Omid Reza, et al.
Published: (2025)
by: Heidari, Omid Reza, et al.
Published: (2025)
Generative Human Motion Stylization in Latent Space
by: Guo, Chuan, et al.
Published: (2024)
by: Guo, Chuan, et al.
Published: (2024)
DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers
by: Wang, Zitong, et al.
Published: (2025)
by: Wang, Zitong, et al.
Published: (2025)
ModeTv2: GPU-accelerated Motion Decomposition Transformer for Pairwise Optimization in Medical Image Registration
by: Wang, Haiqiao, et al.
Published: (2024)
by: Wang, Haiqiao, et al.
Published: (2024)
UniRestorer: Universal Image Restoration via Adaptively Estimating Image Degradation at Proper Granularity
by: Lin, Jingbo, et al.
Published: (2024)
by: Lin, Jingbo, et al.
Published: (2024)
PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data
by: Huang, Shijie, et al.
Published: (2025)
by: Huang, Shijie, et al.
Published: (2025)
Noise Calibration and Spatial-Frequency Interactive Network for STEM Image Enhancement
by: Li, Hesong, et al.
Published: (2025)
by: Li, Hesong, et al.
Published: (2025)
Dual-Path Coupled Image Deraining Network via Spatial-Frequency Interaction
by: He, Yuhong, et al.
Published: (2024)
by: He, Yuhong, et al.
Published: (2024)
Sketch2PoseNet: Efficient and Generalized Sketch to 3D Human Pose Prediction
by: Wang, Li, et al.
Published: (2025)
by: Wang, Li, et al.
Published: (2025)
Multi-hop Relational Contrastive Learning: Extending Spatial Contrastive Pre-training Beyond Pairwise Relations
by: Ahmed, Sheikh Tanvir, et al.
Published: (2026)
by: Ahmed, Sheikh Tanvir, et al.
Published: (2026)
RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation
by: Pang, Lexi, et al.
Published: (2025)
by: Pang, Lexi, et al.
Published: (2025)
iDiff: Interpretable Difference-aware Framework for Pairwise Image Quality Assessment
by: Yue, Xinli, et al.
Published: (2026)
by: Yue, Xinli, et al.
Published: (2026)
Universal Medical Image Representation Learning with Compositional Decoders
by: Wang, Kaini, et al.
Published: (2024)
by: Wang, Kaini, et al.
Published: (2024)
GenHOI: Towards Object-Consistent Hand-Object Interaction with Temporally Balanced and Spatially Selective Object Injection
by: Huang, Xuan, et al.
Published: (2026)
by: Huang, Xuan, et al.
Published: (2026)
FACT-GS: Frequency-Aligned Complexity-Aware Texture Reparameterization for 2D Gaussian Splatting
by: Xie, Tianhao, et al.
Published: (2025)
by: Xie, Tianhao, et al.
Published: (2025)
Learning Semi-Supervised Medical Image Segmentation from Spatial Registration
by: Liu, Qianying, et al.
Published: (2024)
by: Liu, Qianying, et al.
Published: (2024)
ONE-SHOT: Compositional Human-Environment Video Synthesis via Spatial-Decoupled Motion Injection and Hybrid Context Integration
by: Yang, Fengyuan, et al.
Published: (2026)
by: Yang, Fengyuan, et al.
Published: (2026)
Similar Items
-
PICS: Pipeline for Image Captioning and Search
by: Rosario, Grant, et al.
Published: (2024) -
BOOTPLACE: Bootstrapped Object Placement with Detection Transformers
by: Zhou, Hang, et al.
Published: (2025) -
PICS in Pics: Physics Informed Contour Selection for Rapid Image Segmentation
by: Dwivedi, Vikas, et al.
Published: (2023) -
FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors
by: Zhang, Yabo, et al.
Published: (2025) -
Highly Efficient 3D Human Pose Tracking from Events with Spiking Spatiotemporal Transformer
by: Zou, Shihao, et al.
Published: (2023)