:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wu, You, Liu, Kean, Mi, Xiaoyue, Tang, Fan, Cao, Juan, Li, Jintao
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.20231
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Visual-Friendly Concept Protection via Selective Adversarial Perturbations
by: Mi, Xiaoyue, et al.
Published: (2024)

Interactive Visual Assessment for Text-to-Image Generation Models
by: Mi, Xiaoyue, et al.
Published: (2024)

Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
by: Lin, Jiaqi, et al.
Published: (2025)

Topology-preserving Adversarial Training for Alleviating Natural Accuracy Degradation
by: Mi, Xiaoyue, et al.
Published: (2023)

ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model
by: Chen, Binghui, et al.
Published: (2024)

MoSA: Motion-Coherent Human Video Generation via Structure-Appearance Decoupling
by: Wang, Haoyu, et al.
Published: (2025)

GA-Drive: Geometry-Appearance Decoupled Modeling for Free-viewpoint Driving Scene Generation
by: Zhang, Hao, et al.
Published: (2026)

MAD: Motion Appearance Decoupling for efficient Driving World Models
by: Rahimi, Ahmad, et al.
Published: (2026)

OptiSAR-Net++: A Large-Scale Benchmark and Transformer-Free Framework for Cross-Domain Remote Sensing Visual Grounding
by: Tang, Xiaoyu, et al.
Published: (2026)

Mema: Memory-Augmented Adapter for Enhanced Vision-Language Understanding
by: Liu, Ying, et al.
Published: (2026)

PEGAsus: 3D Personalization of Geometry and Appearance
by: Hu, Jingyu, et al.
Published: (2026)

VAP-Diffusion: Enriching Descriptions with MLLMs for Enhanced Medical Image Generation
by: Huang, Peng, et al.
Published: (2025)

DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
by: Nam, Jisu, et al.
Published: (2024)

3D-UIR: 3D Gaussian for Underwater 3D Scene Reconstruction via Physics Based Appearance-Medium Decoupling
by: Yuan, Jieyu, et al.
Published: (2025)

Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield
by: Liu, Dongyang, et al.
Published: (2025)

V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping
by: Lee, Hyunkoo, et al.
Published: (2025)

Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation
by: Chen, Yingjie, et al.
Published: (2026)

Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm
by: Wu, Yi, et al.
Published: (2024)

In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation
by: Xu, Yu, et al.
Published: (2025)

Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning
by: Chen, Chubin, et al.
Published: (2025)

Parameter-Efficient Semantic Augmentation for Enhancing Open-Vocabulary Object Detection
by: Cao, Weihao, et al.
Published: (2026)

Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
by: Huang, Ziyao, et al.
Published: (2024)

Personal Visual Context Learning in Large Multimodal Models
by: Xue, Zihui, et al.
Published: (2026)

Improving Adversarial Robustness via Decoupled Visual Representation Masking
by: Liu, Decheng, et al.
Published: (2024)

Adjustable Visual Appearance for Generalizable Novel View Synthesis
by: Bengtson, Josef, et al.
Published: (2023)

Beyond Appearance: Transformer-based Person Identification from Conversational Dynamics
by: Chapariniya, Masoumeh, et al.
Published: (2025)

One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control
by: Mi, Zhenxing, et al.
Published: (2025)

AnchorCrafter: Animate Cyber-Anchors Selling Your Products via Human-Object Interacting Video Generation
by: Xu, Ziyi, et al.
Published: (2024)

EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision
by: Cao, Yifei, et al.
Published: (2025)

Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving
by: Savkin, Artem, et al.
Published: (2025)

YOLO-Ant: A Lightweight Detector via Depthwise Separable Convolutional and Large Kernel Design for Antenna Interference Source Detection
by: Tang, Xiaoyu, et al.
Published: (2024)

FROMAT: Multiview Material Appearance Transfer via Few-Shot Self-Attention Adaptation
by: Kompanowski, Hubert, et al.
Published: (2025)

VLA-IAP: Training-Free Visual Token Pruning via Interaction Alignment for Vision-Language-Action Models
by: Cheng, Jintao, et al.
Published: (2026)

Harnessing Weak Pair Uncertainty for Text-based Person Search
by: Sun, Jintao, et al.
Published: (2026)

SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference
by: Khaki, Samir, et al.
Published: (2025)

Joint Geometry-Appearance Human Reconstruction in a Unified Latent Space via Bridge Diffusion
by: Tang, Yingzhi, et al.
Published: (2026)

Diverse Semantics-Guided Feature Alignment and Decoupling for Visible-Infrared Person Re-Identification
by: Dong, Neng, et al.
Published: (2025)

Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting
by: Xu, Jingyi, et al.
Published: (2024)

Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions
by: Wu, Tianxu, et al.
Published: (2023)

Integrating Language-Derived Appearance Elements with Visual Cues in Pedestrian Detection
by: Park, Sungjune, et al.
Published: (2023)