Saved in:
| Main Authors: | Feng, Yongchao, Liu, Yajie, Yang, Shuai, Cai, Wenrui, Zhang, Jinqing, Zhan, Qiqi, Huang, Ziyue, Yan, Hongxi, Wan, Qiao, Liu, Chenguang, Wang, Junzhe, Lv, Jiahui, Liu, Ziqi, Shi, Tengyuan, Liu, Qingjie, Wang, Yunhong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.09480 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images
by: Huang, Ziyue, et al.
Published: (2025)
by: Huang, Ziyue, et al.
Published: (2025)
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
by: Huang, Ziyue, et al.
Published: (2024)
by: Huang, Ziyue, et al.
Published: (2024)
PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection
by: Liu, Chenguang, et al.
Published: (2025)
by: Liu, Chenguang, et al.
Published: (2025)
Uni-MDTrack: Learning Decoupled Memory and Dynamic States for Parameter-Efficient Visual Tracking in All Modality
by: Cai, Wenrui, et al.
Published: (2026)
by: Cai, Wenrui, et al.
Published: (2026)
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
by: Huang, Ziyue, et al.
Published: (2025)
by: Huang, Ziyue, et al.
Published: (2025)
HIPTrack: Visual Tracking with Historical Prompts
by: Cai, Wenrui, et al.
Published: (2023)
by: Cai, Wenrui, et al.
Published: (2023)
SPMTrack: Spatio-Temporal Parameter-Efficient Fine-Tuning with Mixture of Experts for Scalable Visual Tracking
by: Cai, Wenrui, et al.
Published: (2025)
by: Cai, Wenrui, et al.
Published: (2025)
Lightweight Spatial Embedding for Vision-based 3D Occupancy Prediction
by: Zhang, Jinqing, et al.
Published: (2024)
by: Zhang, Jinqing, et al.
Published: (2024)
YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images
by: Liu, Chenguang, et al.
Published: (2024)
by: Liu, Chenguang, et al.
Published: (2024)
EntroCut: Entropy-Guided Adaptive Truncation for Efficient Chain-of-Thought Reasoning in Small-scale Large Reasoning Models
by: Yan, Hongxi, et al.
Published: (2026)
by: Yan, Hongxi, et al.
Published: (2026)
AttriPrompt: Dynamic Prompt Composition Learning for CLIP
by: Zhan, Qiqi, et al.
Published: (2025)
by: Zhan, Qiqi, et al.
Published: (2025)
Incremental Object Detection with CLIP
by: Huang, Ziyue, et al.
Published: (2023)
by: Huang, Ziyue, et al.
Published: (2023)
Beyond Open Vocabulary: Multimodal Prompting for Object Detection in Remote Sensing Images
by: Yang, Shuai, et al.
Published: (2026)
by: Yang, Shuai, et al.
Published: (2026)
SkeletonX: Data-Efficient Skeleton-based Action Recognition via Cross-sample Feature Aggregation
by: Zhang, Zongye, et al.
Published: (2025)
by: Zhang, Zongye, et al.
Published: (2025)
DSD-DA: Distillation-based Source Debiasing for Domain Adaptive Object Detection
by: Feng, Yongchao, et al.
Published: (2023)
by: Feng, Yongchao, et al.
Published: (2023)
GeoBEV: Learning Geometric BEV Representation for Multi-view 3D Object Detection
by: Zhang, Jinqing, et al.
Published: (2024)
by: Zhang, Jinqing, et al.
Published: (2024)
De-Simplifying Pseudo Labels to Enhancing Domain Adaptive Object Detection
by: Fu, Zehua, et al.
Published: (2025)
by: Fu, Zehua, et al.
Published: (2025)
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection
by: Jiang, Zheng, et al.
Published: (2024)
by: Jiang, Zheng, et al.
Published: (2024)
Semantic Enhanced Few-shot Object Detection
by: Wang, Zheng, et al.
Published: (2024)
by: Wang, Zheng, et al.
Published: (2024)
ResWorld: Temporal Residual World Model for End-to-End Autonomous Driving
by: Zhang, Jinqing, et al.
Published: (2026)
by: Zhang, Jinqing, et al.
Published: (2026)
Generic Knowledge Boosted Pre-training For Remote Sensing Images
by: Huang, Ziyue, et al.
Published: (2024)
by: Huang, Ziyue, et al.
Published: (2024)
Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
by: Liu, Yibai, et al.
Published: (2025)
by: Liu, Yibai, et al.
Published: (2025)
CtxMIM: Context-Enhanced Masked Image Modeling for Remote Sensing Image Understanding
by: Zhang, Mingming, et al.
Published: (2023)
by: Zhang, Mingming, et al.
Published: (2023)
HiT: Building Mapping with Hierarchical Transformers
by: Zhang, Mingming, et al.
Published: (2023)
by: Zhang, Mingming, et al.
Published: (2023)
A Survey on Data Synthesis and Augmentation for Large Language Models
by: Wang, Ke, et al.
Published: (2024)
by: Wang, Ke, et al.
Published: (2024)
Context-Enhanced Detector For Building Detection From Remote Sensing Images
by: Huang, Ziyue, et al.
Published: (2023)
by: Huang, Ziyue, et al.
Published: (2023)
Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision
by: Liu, Yajie, et al.
Published: (2024)
by: Liu, Yajie, et al.
Published: (2024)
Towards Robust and Controllable Text-to-Motion via Masked Autoregressive Diffusion
by: Zhang, Zongye, et al.
Published: (2025)
by: Zhang, Zongye, et al.
Published: (2025)
SeeDNorm: Self-Rescaled Dynamic Normalization
by: Cai, Wenrui, et al.
Published: (2025)
by: Cai, Wenrui, et al.
Published: (2025)
ONER: Online Experience Replay for Incremental Anomaly Detection
by: Jin, Yizhou, et al.
Published: (2024)
by: Jin, Yizhou, et al.
Published: (2024)
Reasoning-Driven Anomaly Detection and Localization with Image-Level Supervision
by: Jin, Yizhou, et al.
Published: (2026)
by: Jin, Yizhou, et al.
Published: (2026)
TimeGMM: Single-Pass Probabilistic Forecasting via Adaptive Gaussian Mixture Models with Reversible Normalization
by: Liu, Lei, et al.
Published: (2026)
by: Liu, Lei, et al.
Published: (2026)
Diffusion Trajectory-guided Policy for Long-horizon Robot Manipulation
by: Fan, Shichao, et al.
Published: (2025)
by: Fan, Shichao, et al.
Published: (2025)
LIBERO-X: Robustness Litmus for Vision-Language-Action Models
by: Wang, Guodong, et al.
Published: (2026)
by: Wang, Guodong, et al.
Published: (2026)
ActiveDC: Distribution Calibration for Active Finetuning
by: Xu, Wenshuai, et al.
Published: (2023)
by: Xu, Wenshuai, et al.
Published: (2023)
ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations
by: Lei, Yiming, et al.
Published: (2025)
by: Lei, Yiming, et al.
Published: (2025)
SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
by: Zhang, Chenkai, et al.
Published: (2025)
by: Zhang, Chenkai, et al.
Published: (2025)
GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art
by: Lei, Yiming, et al.
Published: (2025)
by: Lei, Yiming, et al.
Published: (2025)
On the pancyclicity of $2$-connected $[5,3]$-graphs
by: Liu, Feng, et al.
Published: (2025)
by: Liu, Feng, et al.
Published: (2025)
Phys-Diff: A Physics-Inspired Latent Diffusion Model for Tropical Cyclone Forecasting
by: Liu, Lei, et al.
Published: (2026)
by: Liu, Lei, et al.
Published: (2026)
Similar Items
-
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images
by: Huang, Ziyue, et al.
Published: (2025) -
MutDet: Mutually Optimizing Pre-training for Remote Sensing Object Detection
by: Huang, Ziyue, et al.
Published: (2024) -
PACF: Prototype Augmented Compact Features for Improving Domain Adaptive Object Detection
by: Liu, Chenguang, et al.
Published: (2025) -
Uni-MDTrack: Learning Decoupled Memory and Dynamic States for Parameter-Efficient Visual Tracking in All Modality
by: Cai, Wenrui, et al.
Published: (2026) -
A Survey on Remote Sensing Foundation Models: From Vision to Multimodality
by: Huang, Ziyue, et al.
Published: (2025)