:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhang, Qilong, Sun, Youheng, Zhang, Chaoning, Li, Chaoqun, Wang, Xuanhan, Song, Jingkuan, Gao, Lianli
Format:	Preprint
Published:	2022
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2203.04607
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Any Target Can be Offense: Adversarial Example Generation via Generalized Latent Infection
by: Sun, Youheng, et al.
Published: (2024)

Learning Generalizable and Efficient Image Watermarking via Hierarchical Two-Stage Optimization
by: Liu, Ke, et al.
Published: (2025)

Scale-Aware Pre-Training for Human-Centric Visual Perception: Enabling Lightweight and Generalizable Models
by: Wang, Xuanhan, et al.
Published: (2025)

Dynamic Pattern Alignment Learning for Pretraining Lightweight Human-Centric Vision Models
by: Wang, Xuanhan, et al.
Published: (2025)

Pseudo-Label Refinement for Robust Wheat Head Segmentation via Two-Stage Hybrid Training
by: Jiang, Jiahao, et al.
Published: (2025)

Black-box Targeted Adversarial Attack on Segment Anything (SAM)
by: Zheng, Sheng, et al.
Published: (2023)

F3-Pruning: A Training-Free and Generalized Pruning Strategy towards Faster and Finer Text-to-Video Synthesis
by: Su, Sitong, et al.
Published: (2023)

TIMI: Training-Free Image-to-3D Multi-Instance Generation with Spatial Fidelity
by: Cai, Xiao, et al.
Published: (2026)

Training-Free Semantic Video Composition via Pre-trained Diffusion Model
by: Guo, Jiaqi, et al.
Published: (2024)

Reversible Inversion for Training-Free Exemplar-guided Image Editing
by: Li, Yuke, et al.
Published: (2025)

Debiased Orthogonal Boundary-Driven Efficient Noise Mitigation
by: Li, Hao, et al.
Published: (2024)

DePT: Decoupled Prompt Tuning
by: Zhang, Ji, et al.
Published: (2023)

Towards Generalized and Training-Free Text-Guided Semantic Manipulation
by: Hong, Yu, et al.
Published: (2025)

Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach
by: Yin, Xiaoran, et al.
Published: (2025)

Prototype-based Aleatoric Uncertainty Quantification for Cross-modal Retrieval
by: Li, Hao, et al.
Published: (2023)

Reliable Few-shot Learning under Dual Noises
by: Zhang, Ji, et al.
Published: (2025)

Benchmarking Few-shot Transferability of Pre-trained Models with Improved Evaluation Protocols
by: Luo, Xu, et al.
Published: (2026)

AICL: Action In-Context Learning for Video Diffusion Model
by: Liu, Jianzhi, et al.
Published: (2024)

Alleviating Hallucinations in Large Vision-Language Models through Hallucination-Induced Optimization
by: Lyu, Xinyu, et al.
Published: (2024)

Attention Hijackers: Detect and Disentangle Attention Hijacking in LVLMs for Hallucination Mitigation
by: Chen, Beitao, et al.
Published: (2025)

CFReID: Continual Few-shot Person Re-Identification
by: Ni, Hao, et al.
Published: (2025)

Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves
by: Wu, Shihan, et al.
Published: (2024)

A Closer Look at Conditional Prompt Tuning for Vision-Language Models
by: Zhang, Ji, et al.
Published: (2025)

MiVLA: Towards Generalizable Vision-Language-Action Model with Human-Robot Mutual Imitation Pre-training
by: Yin, Zhenhan, et al.
Published: (2025)

From Channel Bias to Feature Redundancy: Uncovering the "Less is More" Principle in Few-Shot Learning
by: Zhang, Ji, et al.
Published: (2023)

Text-Video Retrieval with Global-Local Semantic Consistent Learning
by: Zhang, Haonan, et al.
Published: (2024)

CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model
by: Chen, Cheng, et al.
Published: (2024)

FlexAC: Towards Flexible Control of Associative Reasoning in Multimodal Large Language Models
by: Yuan, Shengming, et al.
Published: (2025)

SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism
by: Chen, Beitao, et al.
Published: (2025)

SeMv-3D: Towards Concurrency of Semantic and Multi-view Consistency in General Text-to-3D Generation
by: Cai, Xiao, et al.
Published: (2024)

Towards Understanding Dual BN In Hybrid Adversarial Training
by: Zhang, Chenshuang, et al.
Published: (2024)

Structure-aware Prompt Adaptation from Seen to Unseen for Open-Vocabulary Compositional Zero-Shot Learning
by: Duan, Yihang, et al.
Published: (2026)

From One-to-One to Many-to-Many: Dynamic Cross-Layer Injection for Deep Vision-Language Fusion
by: Chen, Cheng, et al.
Published: (2026)

Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation
by: Xing, Youguang, et al.
Published: (2025)

ALF: Adaptive Label Finetuning for Scene Graph Generation
by: Chen, Qishen, et al.
Published: (2023)

ProS: Prompting-to-simulate Generalized knowledge for Universal Cross-Domain Retrieval
by: Fang, Kaipeng, et al.
Published: (2023)

GT23D-Bench: A Comprehensive General Text-to-3D Generation Benchmark
by: Cai, Xiao, et al.
Published: (2024)

SAM Meets UAP: Attacking Segment Anything Model With Universal Adversarial Perturbation
by: Han, Dongshen, et al.
Published: (2023)

Black-box Adversarial Attacks Against Image Quality Assessment Models
by: Ran, Yu, et al.
Published: (2024)

Distillation-Enhanced Physical Adversarial Attacks
by: Liu, Wei, et al.
Published: (2025)