:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wan, Cong, Guo, Zeyu, Li, Jiangyang, Dong, SongLin, Bai, Yifan, Peng, Lin, Ma, Zhiheng, Gong, Yihong
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2603.00461
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Trajectory-Diversity-Driven Robust Vision-and-Language Navigation
by: Li, Jiangyang, et al.
Published: (2026)

Continuous Expert Assembly: Instance-Conditioned Low-Rank Residuals for All-in-One Image Restoration
by: He, Haisen, et al.
Published: (2026)

VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection
by: Wang, Qiang, et al.
Published: (2025)

DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype
by: Wang, Qiang, et al.
Published: (2025)

P2L-CA: An Effective Parameter Tuning Framework for Rehearsal-Free Multi-Label Class-Incremental Learning
by: Dong, Songlin, et al.
Published: (2026)

Unleashing the Potential of All Test Samples: Mean-Shift Guided Test-Time Adaptation
by: Han, Jizhou, et al.
Published: (2025)

Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2026)

Retrieve-then-Steer: Online Success Memory for Test-Time Adaptation of Generative VLAs
by: Zhao, Jianchao, et al.
Published: (2026)

ProSR: Process-Shaped Spatial Reasoning for Reliable Chain-of-Thought in VLMs
by: Li, Jiangyang, et al.
Published: (2026)

GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2026)

Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2025)

Beyond World-Frame Action Heads: Motion-Centric Action Frames for Vision-Language-Action Models
by: Yang, Huoren, et al.
Published: (2026)

Beyond CLIP Generalization: Against Forward&Backward Forgetting Adapter for Continual Learning of Vision-Language Models
by: Dong, Songlin, et al.
Published: (2025)

Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models
by: Wan, Cong, et al.
Published: (2024)

ReMoMask: Retrieval-Augmented Masked Motion Generation
by: Li, Zhengdao, et al.
Published: (2025)

Few-shot Online Anomaly Detection and Segmentation
by: Wei, Shenxing, et al.
Published: (2024)

Shared & Domain Self-Adaptive Experts with Frequency-Aware Discrimination for Continual Test-Time Adaptation
by: Zhao, JianChao, et al.
Published: (2025)

ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe
by: Bai, Yifan, et al.
Published: (2023)

MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation
by: Wang, Hongpeng, et al.
Published: (2026)

Curriculum Dataset Distillation
by: Ma, Zhiheng, et al.
Published: (2024)

Grid: Omni Visual Generation
by: Wan, Cong, et al.
Published: (2024)

CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization
by: Bai, Detao, et al.
Published: (2025)

MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second
by: Lin, Chenguo, et al.
Published: (2025)

MSP-ReID: Hairstyle-Robust Cloth-Changing Person Re-Identification
by: He, Xiangyang, et al.
Published: (2026)

Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation
by: Zhao, Zeyang, et al.
Published: (2024)

ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
by: Gao, Jiayi, et al.
Published: (2025)

IOTA: Corrective Knowledge-Guided Prompt Learning via Black-White Box Framework
by: Wang, Shaokun, et al.
Published: (2026)

MoTiC: Momentum Tightness and Contrast for Few-Shot Class-Incremental Learning
by: He, Zeyu, et al.
Published: (2025)

Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification
by: Li, Jiachen, et al.
Published: (2023)

H-MoRe: Learning Human-centric Motion Representation for Action Analysis
by: Huang, Zhanbo, et al.
Published: (2025)

SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance
by: Cong, Peishan, et al.
Published: (2025)

MoReFun: Past-Movement Guided Motion Representation Learning for Future Motion Prediction and Understanding
by: Shi, Junyu, et al.
Published: (2024)

MoReGen: Multi-Agent Motion-Reasoning Engine for Code-based Text-to-Video Synthesis
by: Bai, Xiangyu, et al.
Published: (2025)

Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models
by: Liu, Haoyun, et al.
Published: (2026)

SafeMo: Linguistically Grounded Unlearning for Trustworthy Text-to-Motion Generation
by: Wang, Yiling, et al.
Published: (2026)

Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video Representation Learning
by: Zhu, Minghao, et al.
Published: (2023)

SiCL: Silhouette-Driven Contrastive Learning for Unsupervised Person Re-Identification with Clothes Change
by: Li, Mingkun, et al.
Published: (2023)

Re$^2$MoGen: Open-Vocabulary Motion Generation via LLM Reasoning and Physics-Aware Refinement
by: Zheng, Jiakun, et al.
Published: (2026)

Beyond Prompt Learning: Continual Adapter for Efficient Rehearsal-Free Continual Learning
by: Gao, Xinyuan, et al.
Published: (2024)

Gramformer: Learning Crowd Counting via Graph-Modulated Transformer
by: Lin, Hui, et al.
Published: (2024)