Saved in:
| Main Authors: | Zhang, Qilong, Sun, Youheng, Zhang, Chaoning, Li, Chaoqun, Wang, Xuanhan, Song, Jingkuan, Gao, Lianli |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2203.04607 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection
by: Sun, Youheng, et al.
Published: (2024)
by: Sun, Youheng, et al.
Published: (2024)
Learning Generalizable and Efficient Image Watermarking via Hierarchical Two-Stage Optimization
by: Liu, Ke, et al.
Published: (2025)
by: Liu, Ke, et al.
Published: (2025)
Scale-Aware Pre-Training for Human-Centric Visual Perception: Enabling Lightweight and Generalizable Models
by: Wang, Xuanhan, et al.
Published: (2025)
by: Wang, Xuanhan, et al.
Published: (2025)
Dynamic Pattern Alignment Learning for Pretraining Lightweight Human-Centric Vision Models
by: Wang, Xuanhan, et al.
Published: (2025)
by: Wang, Xuanhan, et al.
Published: (2025)
Pseudo-Label Refinement for Robust Wheat Head Segmentation via Two-Stage Hybrid Training
by: Jiang, Jiahao, et al.
Published: (2025)
by: Jiang, Jiahao, et al.
Published: (2025)
Black-box Targeted Adversarial Attack on Segment Anything (SAM)
by: Zheng, Sheng, et al.
Published: (2023)
by: Zheng, Sheng, et al.
Published: (2023)
F3-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis
by: Su, Sitong, et al.
Published: (2023)
by: Su, Sitong, et al.
Published: (2023)
TIMI: Training-Free Image-to-3D Multi-Instance Generation with Spatial Fidelity
by: Cai, Xiao, et al.
Published: (2026)
by: Cai, Xiao, et al.
Published: (2026)
Training-Free Semantic Video Composition via Pre-trained Diffusion Model
by: Guo, Jiaqi, et al.
Published: (2024)
by: Guo, Jiaqi, et al.
Published: (2024)
Reversible Inversion for Training-Free Exemplar-guided Image Editing
by: Li, Yuke, et al.
Published: (2025)
by: Li, Yuke, et al.
Published: (2025)
Debiased Orthogonal Boundary-Driven Efficient Noise Mitigation
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
DePT: Decoupled Prompt Tuning
by: Zhang, Ji, et al.
Published: (2023)
by: Zhang, Ji, et al.
Published: (2023)
Towards Generalized and Training-Free Text-Guided Semantic Manipulation
by: Hong, Yu, et al.
Published: (2025)
by: Hong, Yu, et al.
Published: (2025)
Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach
by: Yin, Xiaoran, et al.
Published: (2025)
by: Yin, Xiaoran, et al.
Published: (2025)
Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval
by: Li, Hao, et al.
Published: (2023)
by: Li, Hao, et al.
Published: (2023)
Reliable Few-shot Learning under Dual Noises
by: Zhang, Ji, et al.
Published: (2025)
by: Zhang, Ji, et al.
Published: (2025)
Benchmarking Few-shot Transferability of Pre-trained Models with Improved Evaluation Protocols
by: Luo, Xu, et al.
Published: (2026)
by: Luo, Xu, et al.
Published: (2026)
AICL: Action In-Context Learning for Video Diffusion Model
by: Liu, Jianzhi, et al.
Published: (2024)
by: Liu, Jianzhi, et al.
Published: (2024)
Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
by: Lyu, Xinyu, et al.
Published: (2024)
by: Lyu, Xinyu, et al.
Published: (2024)
Attention Hijackers: Detect and Disentangle Attention Hijacking in LVLMs for Hallucination Mitigation
by: Chen, Beitao, et al.
Published: (2025)
by: Chen, Beitao, et al.
Published: (2025)
CFReID: Continual Few-shot Person Re-Identification
by: Ni, Hao, et al.
Published: (2025)
by: Ni, Hao, et al.
Published: (2025)
Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves
by: Wu, Shihan, et al.
Published: (2024)
by: Wu, Shihan, et al.
Published: (2024)
A Closer Look at Conditional Prompt Tuning for Vision-Language Models
by: Zhang, Ji, et al.
Published: (2025)
by: Zhang, Ji, et al.
Published: (2025)
MiVLA: Towards Generalizable Vision-Language-Action Model with Human-Robot Mutual Imitation Pre-training
by: Yin, Zhenhan, et al.
Published: (2025)
by: Yin, Zhenhan, et al.
Published: (2025)
From Channel Bias to Feature Redundancy: Uncovering the "Less is More" Principle in Few-Shot Learning
by: Zhang, Ji, et al.
Published: (2023)
by: Zhang, Ji, et al.
Published: (2023)
Text-Video Retrieval with Global-Local Semantic Consistent Learning
by: Zhang, Haonan, et al.
Published: (2024)
by: Zhang, Haonan, et al.
Published: (2024)
CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model
by: Chen, Cheng, et al.
Published: (2024)
by: Chen, Cheng, et al.
Published: (2024)
FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language Models
by: Yuan, Shengming, et al.
Published: (2025)
by: Yuan, Shengming, et al.
Published: (2025)
SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism
by: Chen, Beitao, et al.
Published: (2025)
by: Chen, Beitao, et al.
Published: (2025)
SeMv-3D: Towards Concurrency of Semantic and Multi-view Consistency in General Text-to-3D Generation
by: Cai, Xiao, et al.
Published: (2024)
by: Cai, Xiao, et al.
Published: (2024)
Towards Understanding Dual BN In Hybrid Adversarial Training
by: Zhang, Chenshuang, et al.
Published: (2024)
by: Zhang, Chenshuang, et al.
Published: (2024)
Structure-aware Prompt Adaptation from Seen to Unseen for Open-Vocabulary Compositional Zero-Shot Learning
by: Duan, Yihang, et al.
Published: (2026)
by: Duan, Yihang, et al.
Published: (2026)
From One-to-One to Many-to-Many: Dynamic Cross-Layer Injection for Deep Vision-Language Fusion
by: Chen, Cheng, et al.
Published: (2026)
by: Chen, Cheng, et al.
Published: (2026)
Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation
by: Xing, Youguang, et al.
Published: (2025)
by: Xing, Youguang, et al.
Published: (2025)
ALF: Adaptive Label Finetuning for Scene Graph Generation
by: Chen, Qishen, et al.
Published: (2023)
by: Chen, Qishen, et al.
Published: (2023)
ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval
by: Fang, Kaipeng, et al.
Published: (2023)
by: Fang, Kaipeng, et al.
Published: (2023)
GT23D-Bench: A Comprehensive General Text-to-3D Generation Benchmark
by: Cai, Xiao, et al.
Published: (2024)
by: Cai, Xiao, et al.
Published: (2024)
SAM Meets UAP: Attacking Segment Anything Model With Universal Adversarial Perturbation
by: Han, Dongshen, et al.
Published: (2023)
by: Han, Dongshen, et al.
Published: (2023)
Black-box Adversarial Attacks Against Image Quality Assessment Models
by: Ran, Yu, et al.
Published: (2024)
by: Ran, Yu, et al.
Published: (2024)
Distillation-Enhanced Physical Adversarial Attacks
by: Liu, Wei, et al.
Published: (2025)
by: Liu, Wei, et al.
Published: (2025)
Similar Items
-
Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection
by: Sun, Youheng, et al.
Published: (2024) -
Learning Generalizable and Efficient Image Watermarking via Hierarchical Two-Stage Optimization
by: Liu, Ke, et al.
Published: (2025) -
Scale-Aware Pre-Training for Human-Centric Visual Perception: Enabling Lightweight and Generalizable Models
by: Wang, Xuanhan, et al.
Published: (2025) -
Dynamic Pattern Alignment Learning for Pretraining Lightweight Human-Centric Vision Models
by: Wang, Xuanhan, et al.
Published: (2025) -
Pseudo-Label Refinement for Robust Wheat Head Segmentation via Two-Stage Hybrid Training
by: Jiang, Jiahao, et al.
Published: (2025)