Saved in:
| Main Authors: | Wu, You, Liu, Kean, Mi, Xiaoyue, Tang, Fan, Cao, Juan, Li, Jintao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.20231 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Visual-Friendly Concept Protection via Selective Adversarial Perturbations
by: Mi, Xiaoyue, et al.
Published: (2024)
by: Mi, Xiaoyue, et al.
Published: (2024)
Interactive Visual Assessment for Text-to-Image Generation Models
by: Mi, Xiaoyue, et al.
Published: (2024)
by: Mi, Xiaoyue, et al.
Published: (2024)
Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
by: Lin, Jiaqi, et al.
Published: (2025)
by: Lin, Jiaqi, et al.
Published: (2025)
Topology-preserving Adversarial Training for Alleviating Natural Accuracy Degradation
by: Mi, Xiaoyue, et al.
Published: (2023)
by: Mi, Xiaoyue, et al.
Published: (2023)
ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model
by: Chen, Binghui, et al.
Published: (2024)
by: Chen, Binghui, et al.
Published: (2024)
MoSA: Motion-Coherent Human Video Generation via Structure-Appearance Decoupling
by: Wang, Haoyu, et al.
Published: (2025)
by: Wang, Haoyu, et al.
Published: (2025)
GA-Drive: Geometry-Appearance Decoupled Modeling for Free-viewpoint Driving Scene Generation
by: Zhang, Hao, et al.
Published: (2026)
by: Zhang, Hao, et al.
Published: (2026)
MAD: Motion Appearance Decoupling for efficient Driving World Models
by: Rahimi, Ahmad, et al.
Published: (2026)
by: Rahimi, Ahmad, et al.
Published: (2026)
OptiSAR-Net++: A Large-Scale Benchmark and Transformer-Free Framework for Cross-Domain Remote Sensing Visual Grounding
by: Tang, Xiaoyu, et al.
Published: (2026)
by: Tang, Xiaoyu, et al.
Published: (2026)
Mema: Memory-Augmented Adapter for Enhanced Vision-Language Understanding
by: Liu, Ying, et al.
Published: (2026)
by: Liu, Ying, et al.
Published: (2026)
PEGAsus: 3D Personalization of Geometry and Appearance
by: Hu, Jingyu, et al.
Published: (2026)
by: Hu, Jingyu, et al.
Published: (2026)
VAP-Diffusion: Enriching Descriptions with MLLMs for Enhanced Medical Image Generation
by: Huang, Peng, et al.
Published: (2025)
by: Huang, Peng, et al.
Published: (2025)
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
by: Nam, Jisu, et al.
Published: (2024)
by: Nam, Jisu, et al.
Published: (2024)
3D-UIR: 3D Gaussian for Underwater 3D Scene Reconstruction via Physics Based Appearance-Medium Decoupling
by: Yuan, Jieyu, et al.
Published: (2025)
by: Yuan, Jieyu, et al.
Published: (2025)
Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield
by: Liu, Dongyang, et al.
Published: (2025)
by: Liu, Dongyang, et al.
Published: (2025)
V-Warper: Appearance-Consistent Video Diffusion Personalization via Value Warping
by: Lee, Hyunkoo, et al.
Published: (2025)
by: Lee, Hyunkoo, et al.
Published: (2025)
Identity as Presence: Towards Appearance and Voice Personalized Joint Audio-Video Generation
by: Chen, Yingjie, et al.
Published: (2026)
by: Chen, Yingjie, et al.
Published: (2026)
Infinite-ID: Identity-preserved Personalization via ID-semantics Decoupling Paradigm
by: Wu, Yi, et al.
Published: (2024)
by: Wu, Yi, et al.
Published: (2024)
In-Context Brush: Zero-shot Customized Subject Insertion with Context-Aware Latent Space Manipulation
by: Xu, Yu, et al.
Published: (2025)
by: Xu, Yu, et al.
Published: (2025)
Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning
by: Chen, Chubin, et al.
Published: (2025)
by: Chen, Chubin, et al.
Published: (2025)
Parameter-Efficient Semantic Augmentation for Enhancing Open-Vocabulary Object Detection
by: Cao, Weihao, et al.
Published: (2026)
by: Cao, Weihao, et al.
Published: (2026)
Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework
by: Huang, Ziyao, et al.
Published: (2024)
by: Huang, Ziyao, et al.
Published: (2024)
Personal Visual Context Learning in Large Multimodal Models
by: Xue, Zihui, et al.
Published: (2026)
by: Xue, Zihui, et al.
Published: (2026)
Improving Adversarial Robustness via Decoupled Visual Representation Masking
by: Liu, Decheng, et al.
Published: (2024)
by: Liu, Decheng, et al.
Published: (2024)
Adjustable Visual Appearance for Generalizable Novel View Synthesis
by: Bengtson, Josef, et al.
Published: (2023)
by: Bengtson, Josef, et al.
Published: (2023)
Beyond Appearance: Transformer-based Person Identification from Conversational Dynamics
by: Chapariniya, Masoumeh, et al.
Published: (2025)
by: Chapariniya, Masoumeh, et al.
Published: (2025)
One4D: Unified 4D Generation and Reconstruction via Decoupled LoRA Control
by: Mi, Zhenxing, et al.
Published: (2025)
by: Mi, Zhenxing, et al.
Published: (2025)
AnchorCrafter: Animate Cyber-Anchors Selling Your Products via Human-Object Interacting Video Generation
by: Xu, Ziyi, et al.
Published: (2024)
by: Xu, Ziyi, et al.
Published: (2024)
EAGLE: Episodic Appearance- and Geometry-aware Memory for Unified 2D-3D Visual Query Localization in Egocentric Vision
by: Cao, Yifei, et al.
Published: (2025)
by: Cao, Yifei, et al.
Published: (2025)
Adversarial Appearance Learning in Augmented Cityscapes for Pedestrian Recognition in Autonomous Driving
by: Savkin, Artem, et al.
Published: (2025)
by: Savkin, Artem, et al.
Published: (2025)
YOLO-Ant: A Lightweight Detector via Depthwise Separable Convolutional and Large Kernel Design for Antenna Interference Source Detection
by: Tang, Xiaoyu, et al.
Published: (2024)
by: Tang, Xiaoyu, et al.
Published: (2024)
FROMAT: Multiview Material Appearance Transfer via Few-Shot Self-Attention Adaptation
by: Kompanowski, Hubert, et al.
Published: (2025)
by: Kompanowski, Hubert, et al.
Published: (2025)
VLA-IAP: Training-Free Visual Token Pruning via Interaction Alignment for Vision-Language-Action Models
by: Cheng, Jintao, et al.
Published: (2026)
by: Cheng, Jintao, et al.
Published: (2026)
Harnessing Weak Pair Uncertainty for Text-based Person Search
by: Sun, Jintao, et al.
Published: (2026)
by: Sun, Jintao, et al.
Published: (2026)
SparseVILA: Decoupling Visual Sparsity for Efficient VLM Inference
by: Khaki, Samir, et al.
Published: (2025)
by: Khaki, Samir, et al.
Published: (2025)
Joint Geometry-Appearance Human Reconstruction in a Unified Latent Space via Bridge Diffusion
by: Tang, Yingzhi, et al.
Published: (2026)
by: Tang, Yingzhi, et al.
Published: (2026)
Diverse Semantics-Guided Feature Alignment and Decoupling for Visible-Infrared Person Re-Identification
by: Dong, Neng, et al.
Published: (2025)
by: Dong, Neng, et al.
Published: (2025)
Spatiotemporal Decoupling for Efficient Vision-Based Occupancy Forecasting
by: Xu, Jingyi, et al.
Published: (2024)
by: Xu, Jingyi, et al.
Published: (2024)
Detail Reinforcement Diffusion Model: Augmentation Fine-Grained Visual Categorization in Few-Shot Conditions
by: Wu, Tianxu, et al.
Published: (2023)
by: Wu, Tianxu, et al.
Published: (2023)
Integrating Language-Derived Appearance Elements with Visual Cues in Pedestrian Detection
by: Park, Sungjune, et al.
Published: (2023)
by: Park, Sungjune, et al.
Published: (2023)
Similar Items
-
Visual-Friendly Concept Protection via Selective Adversarial Perturbations
by: Mi, Xiaoyue, et al.
Published: (2024) -
Interactive Visual Assessment for Text-to-Image Generation Models
by: Mi, Xiaoyue, et al.
Published: (2024) -
Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
by: Lin, Jiaqi, et al.
Published: (2025) -
Topology-preserving Adversarial Training for Alleviating Natural Accuracy Degradation
by: Mi, Xiaoyue, et al.
Published: (2023) -
ShoeModel: Learning to Wear on the User-specified Shoes via Diffusion Model
by: Chen, Binghui, et al.
Published: (2024)