Saved in:
| Main Authors: | Wan, Cong, Guo, Zeyu, Li, Jiangyang, Dong, SongLin, Bai, Yifan, Peng, Lin, Ma, Zhiheng, Gong, Yihong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.00461 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Trajectory-Diversity-Driven Robust Vision-and-Language Navigation
by: Li, Jiangyang, et al.
Published: (2026)
by: Li, Jiangyang, et al.
Published: (2026)
Continuous Expert Assembly: Instance-Conditioned Low-Rank Residuals for All-in-One Image Restoration
by: He, Haisen, et al.
Published: (2026)
by: He, Haisen, et al.
Published: (2026)
VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection
by: Wang, Qiang, et al.
Published: (2025)
by: Wang, Qiang, et al.
Published: (2025)
DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype
by: Wang, Qiang, et al.
Published: (2025)
by: Wang, Qiang, et al.
Published: (2025)
P2L-CA: An Effective Parameter Tuning Framework for Rehearsal-Free Multi-Label Class-Incremental Learning
by: Dong, Songlin, et al.
Published: (2026)
by: Dong, Songlin, et al.
Published: (2026)
Unleashing the Potential of All Test Samples: Mean-Shift Guided Test-Time Adaptation
by: Han, Jizhou, et al.
Published: (2025)
by: Han, Jizhou, et al.
Published: (2025)
Learning Like Humans: Analogical Concept Learning for Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2026)
by: Han, Jizhou, et al.
Published: (2026)
Retrieve-then-Steer: Online Success Memory for Test-Time Adaptation of Generative VLAs
by: Zhao, Jianchao, et al.
Published: (2026)
by: Zhao, Jianchao, et al.
Published: (2026)
ProSR: Process-Shaped Spatial Reasoning for Reliable Chain-of-Thought in VLMs
by: Li, Jiangyang, et al.
Published: (2026)
by: Li, Jiangyang, et al.
Published: (2026)
GOAL: Geometrically Optimal Alignment for Continual Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2026)
by: Han, Jizhou, et al.
Published: (2026)
Consistent Supervised-Unsupervised Alignment for Generalized Category Discovery
by: Han, Jizhou, et al.
Published: (2025)
by: Han, Jizhou, et al.
Published: (2025)
Beyond World-Frame Action Heads: Motion-Centric Action Frames for Vision-Language-Action Models
by: Yang, Huoren, et al.
Published: (2026)
by: Yang, Huoren, et al.
Published: (2026)
Beyond CLIP Generalization: Against Forward&Backward Forgetting Adapter for Continual Learning of Vision-Language Models
by: Dong, Songlin, et al.
Published: (2025)
by: Dong, Songlin, et al.
Published: (2025)
Prompt-Agnostic Adversarial Perturbation for Customized Diffusion Models
by: Wan, Cong, et al.
Published: (2024)
by: Wan, Cong, et al.
Published: (2024)
ReMoMask: Retrieval-Augmented Masked Motion Generation
by: Li, Zhengdao, et al.
Published: (2025)
by: Li, Zhengdao, et al.
Published: (2025)
Few-shot Online Anomaly Detection and Segmentation
by: Wei, Shenxing, et al.
Published: (2024)
by: Wei, Shenxing, et al.
Published: (2024)
Shared & Domain Self-Adaptive Experts with Frequency-Aware Discrimination for Continual Test-Time Adaptation
by: Zhao, JianChao, et al.
Published: (2025)
by: Zhao, JianChao, et al.
Published: (2025)
ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe
by: Bai, Yifan, et al.
Published: (2023)
by: Bai, Yifan, et al.
Published: (2023)
MoRL: Reinforced Reasoning for Unified Motion Understanding and Generation
by: Wang, Hongpeng, et al.
Published: (2026)
by: Wang, Hongpeng, et al.
Published: (2026)
Curriculum Dataset Distillation
by: Ma, Zhiheng, et al.
Published: (2024)
by: Ma, Zhiheng, et al.
Published: (2024)
Grid: Omni Visual Generation
by: Wan, Cong, et al.
Published: (2024)
by: Wan, Cong, et al.
Published: (2024)
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization
by: Bai, Detao, et al.
Published: (2025)
by: Bai, Detao, et al.
Published: (2025)
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second
by: Lin, Chenguo, et al.
Published: (2025)
by: Lin, Chenguo, et al.
Published: (2025)
MSP-ReID: Hairstyle-Robust Cloth-Changing Person Re-Identification
by: He, Xiangyang, et al.
Published: (2026)
by: He, Xiangyang, et al.
Published: (2026)
Projecting Points to Axes: Oriented Object Detection via Point-Axis Representation
by: Zhao, Zeyang, et al.
Published: (2024)
by: Zhao, Zeyang, et al.
Published: (2024)
ConMo: Controllable Motion Disentanglement and Recomposition for Zero-Shot Motion Transfer
by: Gao, Jiayi, et al.
Published: (2025)
by: Gao, Jiayi, et al.
Published: (2025)
IOTA: Corrective Knowledge-Guided Prompt Learning via Black-White Box Framework
by: Wang, Shaokun, et al.
Published: (2026)
by: Wang, Shaokun, et al.
Published: (2026)
MoTiC: Momentum Tightness and Contrast for Few-Shot Class-Incremental Learning
by: He, Zeyu, et al.
Published: (2025)
by: He, Zeyu, et al.
Published: (2025)
Prototypical Contrastive Learning-based CLIP Fine-tuning for Object Re-identification
by: Li, Jiachen, et al.
Published: (2023)
by: Li, Jiachen, et al.
Published: (2023)
H-MoRe: Learning Human-centric Motion Representation for Action Analysis
by: Huang, Zhanbo, et al.
Published: (2025)
by: Huang, Zhanbo, et al.
Published: (2025)
SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance
by: Cong, Peishan, et al.
Published: (2025)
by: Cong, Peishan, et al.
Published: (2025)
MoReFun: Past-Movement Guided Motion Representation Learning for Future Motion Prediction and Understanding
by: Shi, Junyu, et al.
Published: (2024)
by: Shi, Junyu, et al.
Published: (2024)
MoReGen: Multi-Agent Motion-Reasoning Engine for Code-based Text-to-Video Synthesis
by: Bai, Xiangyu, et al.
Published: (2025)
by: Bai, Xiangyu, et al.
Published: (2025)
Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models
by: Liu, Haoyun, et al.
Published: (2026)
by: Liu, Haoyun, et al.
Published: (2026)
SafeMo: Linguistically Grounded Unlearning for Trustworthy Text-to-Motion Generation
by: Wang, Yiling, et al.
Published: (2026)
by: Wang, Yiling, et al.
Published: (2026)
Fine-Grained Spatiotemporal Motion Alignment for Contrastive Video Representation Learning
by: Zhu, Minghao, et al.
Published: (2023)
by: Zhu, Minghao, et al.
Published: (2023)
SiCL: Silhouette-Driven Contrastive Learning for Unsupervised Person Re-Identification with Clothes Change
by: Li, Mingkun, et al.
Published: (2023)
by: Li, Mingkun, et al.
Published: (2023)
Re$^2$MoGen: Open-Vocabulary Motion Generation via LLM Reasoning and Physics-Aware Refinement
by: Zheng, Jiakun, et al.
Published: (2026)
by: Zheng, Jiakun, et al.
Published: (2026)
Beyond Prompt Learning: Continual Adapter for Efficient Rehearsal-Free Continual Learning
by: Gao, Xinyuan, et al.
Published: (2024)
by: Gao, Xinyuan, et al.
Published: (2024)
Gramformer: Learning Crowd Counting via Graph-Modulated Transformer
by: Lin, Hui, et al.
Published: (2024)
by: Lin, Hui, et al.
Published: (2024)
Similar Items
-
Trajectory-Diversity-Driven Robust Vision-and-Language Navigation
by: Li, Jiangyang, et al.
Published: (2026) -
Continuous Expert Assembly: Instance-Conditioned Low-Rank Residuals for All-in-One Image Restoration
by: He, Haisen, et al.
Published: (2026) -
VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection
by: Wang, Qiang, et al.
Published: (2025) -
DualCP: Rehearsal-Free Domain-Incremental Learning via Dual-Level Concept Prototype
by: Wang, Qiang, et al.
Published: (2025) -
P2L-CA: An Effective Parameter Tuning Framework for Rehearsal-Free Multi-Label Class-Incremental Learning
by: Dong, Songlin, et al.
Published: (2026)