:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Feng, Yongchao, Liu, Yajie, Yang, Shuai, Cai, Wenrui, Zhang, Jinqing, Zhan, Qiqi, Huang, Ziyue, Yan, Hongxi, Wan, Qiao, Liu, Chenguang, Wang, Junzhe, Lv, Jiahui, Liu, Ziqi, Shi, Tengyuan, Liu, Qingjie, Wang, Yunhong
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2504.09480
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images
by: Huang, Ziyue, et al.
Published: (2025)

MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
by: Huang, Ziyue, et al.
Published: (2024)

PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection
by: Liu, Chenguang, et al.
Published: (2025)

Uni-MDTrack: Learning Decoupled Memory and Dynamic States for Parameter-Efficient Visual Tracking in All Modality
by: Cai, Wenrui, et al.
Published: (2026)

A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
by: Huang, Ziyue, et al.
Published: (2025)

HIPTrack: Visual Tracking with Historical Prompts
by: Cai, Wenrui, et al.
Published: (2023)

SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
by: Cai, Wenrui, et al.
Published: (2025)

Lightweight Spatial Embedding for Vision-based 3D Occupancy Prediction
by: Zhang, Jinqing, et al.
Published: (2024)

YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images
by: Liu, Chenguang, et al.
Published: (2024)

EntroCut: Entropy-Guided Adaptive Truncation for Efficient Chain-of-Thought Reasoning in Small-scale Large Reasoning Models
by: Yan, Hongxi, et al.
Published: (2026)

AttriPrompt: Dynamic Prompt Composition Learning for CLIP
by: Zhan, Qiqi, et al.
Published: (2025)

Incremental Object Detection with CLIP
by: Huang, Ziyue, et al.
Published: (2023)

Beyond Open Vocabulary: Multimodal Prompting for Object Detection in Remote Sensing Images
by: Yang, Shuai, et al.
Published: (2026)

SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature Aggregation
by: Zhang, Zongye, et al.
Published: (2025)

DSD-DA: Distillation-based Source Debiasing for Domain Adaptive Object Detection
by: Feng, Yongchao, et al.
Published: (2023)

GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection
by: Zhang, Jinqing, et al.
Published: (2024)

De-Simplifying Pseudo Labels to Enhancing Domain Adaptive Object Detection
by: Fu, Zehua, et al.
Published: (2025)

FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
by: Jiang, Zheng, et al.
Published: (2024)

Semantic Enhanced Few-shot Object Detection
by: Wang, Zheng, et al.
Published: (2024)

ResWorld: Temporal Residual World Model for End-to-End Autonomous Driving
by: Zhang, Jinqing, et al.
Published: (2026)

Generic Knowledge Boosted Pre-training For Remote Sensing Images
by: Huang, Ziyue, et al.
Published: (2024)

Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
by: Liu, Yibai, et al.
Published: (2025)

CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image Understanding
by: Zhang, Mingming, et al.
Published: (2023)

HiT: Building Mapping with Hierarchical Transformers
by: Zhang, Mingming, et al.
Published: (2023)

A Survey on Data Synthesis and Augmentation for Large Language Models
by: Wang, Ke, et al.
Published: (2024)

Context-Enhanced Detector For Building Detection From Remote Sensing Images
by: Huang, Ziyue, et al.
Published: (2023)

Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
by: Liu, Yajie, et al.
Published: (2024)

Towards Robust and Controllable Text-to-Motion via Masked Autoregressive Diffusion
by: Zhang, Zongye, et al.
Published: (2025)

SeeDNorm: Self-Rescaled Dynamic Normalization
by: Cai, Wenrui, et al.
Published: (2025)

ONER: Online Experience Replay for Incremental Anomaly Detection
by: Jin, Yizhou, et al.
Published: (2024)

Reasoning-Driven Anomaly Detection and Localization with Image-Level Supervision
by: Jin, Yizhou, et al.
Published: (2026)

TimeGMM: Single-Pass Probabilistic Forecasting via Adaptive Gaussian Mixture Models with Reversible Normalization
by: Liu, Lei, et al.
Published: (2026)

Diffusion Trajectory-guided Policy for Long-horizon Robot Manipulation
by: Fan, Shichao, et al.
Published: (2025)

LIBERO-X: Robustness Litmus for Vision-Language-Action Models
by: Wang, Guodong, et al.
Published: (2026)

ActiveDC: Distribution Calibration for Active Finetuning
by: Xu, Wenshuai, et al.
Published: (2023)

ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations
by: Lei, Yiming, et al.
Published: (2025)

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
by: Zhang, Chenkai, et al.
Published: (2025)

GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art
by: Lei, Yiming, et al.
Published: (2025)

On the pancyclicity of $2$-connected $[5,3]$-graphs
by: Liu, Feng, et al.
Published: (2025)

Phys-Diff: A Physics-Inspired Latent Diffusion Model for Tropical Cyclone Forecasting
by: Liu, Lei, et al.
Published: (2026)