Saved in:
| Main Authors: | Zhang, Jinyue, Zhang, Xiangrong, Huang, Zhongjian, Zhang, Tianyang, Jiang, Yifei, Jiao, Licheng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.10278 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sat-JEPA-Diff: Bridging Self-Supervised Learning and Generative Diffusion for Remote Sensing
by: Komurcu, Kursat, et al.
Published: (2026)
by: Komurcu, Kursat, et al.
Published: (2026)
Diffusion Features for Zero-Shot 6DoF Object Pose Estimation
by: Von Gimborn, Bernd, et al.
Published: (2024)
by: Von Gimborn, Bernd, et al.
Published: (2024)
BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection
by: Zhang, Jian, et al.
Published: (2024)
by: Zhang, Jian, et al.
Published: (2024)
Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset
by: Zinnen, Mathias, et al.
Published: (2025)
by: Zinnen, Mathias, et al.
Published: (2025)
Architectural Insights into Knowledge Distillation for Object Detection: A Comprehensive Review
by: Golizadeh, Mahdi, et al.
Published: (2025)
by: Golizadeh, Mahdi, et al.
Published: (2025)
Neighborhood Feature Pooling for Remote Sensing Image Classification
by: Nia, Fahimeh Orvati, et al.
Published: (2025)
by: Nia, Fahimeh Orvati, et al.
Published: (2025)
Explaining What Machines See: XAI Strategies in Deep Object Detection Models
by: Seyedmomeni, FatemehSadat, et al.
Published: (2025)
by: Seyedmomeni, FatemehSadat, et al.
Published: (2025)
Feature Based Methods in Domain Adaptation for Object Detection: A Review Paper
by: Mohamadi, Helia, et al.
Published: (2024)
by: Mohamadi, Helia, et al.
Published: (2024)
Enhancing Small Object Detection with YOLO: A Novel Framework for Improved Accuracy and Efficiency
by: Moghadami, Mahila, et al.
Published: (2025)
by: Moghadami, Mahila, et al.
Published: (2025)
CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
by: Raoufi, Behnam, et al.
Published: (2025)
by: Raoufi, Behnam, et al.
Published: (2025)
Point, Detect, Count: Multi-Task Medical Image Understanding with Instruction-Tuned Vision-Language Models
by: Gautam, Sushant, et al.
Published: (2025)
by: Gautam, Sushant, et al.
Published: (2025)
A Novel Compression Framework for YOLOv8: Achieving Real-Time Aerial Object Detection on Edge Devices via Structured Pruning and Channel-Wise Distillation
by: Sabaghian, Melika, et al.
Published: (2025)
by: Sabaghian, Melika, et al.
Published: (2025)
Reference-based Category Discovery: Unsupervised Object Detection with Category Awareness
by: Li, Yichen, et al.
Published: (2026)
by: Li, Yichen, et al.
Published: (2026)
UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes
by: Lentsch, Ted, et al.
Published: (2024)
by: Lentsch, Ted, et al.
Published: (2024)
Semantic2Graph: Graph-based Multi-modal Feature Fusion for Action Segmentation in Videos
by: Zhang, Junbin, et al.
Published: (2022)
by: Zhang, Junbin, et al.
Published: (2022)
An Earth Rover dataset recorded at the ICRA@40 party
by: Zhang, Qi, et al.
Published: (2024)
by: Zhang, Qi, et al.
Published: (2024)
Position and Altitude of the Nao Camera Head from Two Points on the Soccer Field plus the Gravitational Direction
by: Oomes, Stijn, et al.
Published: (2024)
by: Oomes, Stijn, et al.
Published: (2024)
Improving Object Detection for Time-Lapse Imagery Using Temporal Features in Wildlife Monitoring
by: Jenkins, Marcus, et al.
Published: (2024)
by: Jenkins, Marcus, et al.
Published: (2024)
Hierarchical Point-Patch Fusion with Adaptive Patch Codebook for 3D Shape Anomaly Detection
by: Kang, Xueyang, et al.
Published: (2026)
by: Kang, Xueyang, et al.
Published: (2026)
GazeD: Context-Aware Diffusion for Accurate 3D Gaze Estimation
by: Catalini, Riccardo, et al.
Published: (2026)
by: Catalini, Riccardo, et al.
Published: (2026)
Decoder Generates Manufacturable Structures: A Framework for 3D-Printable Object Synthesis
by: Kumar, Abhishek
Published: (2026)
by: Kumar, Abhishek
Published: (2026)
Learning through Creation: A Hash-Free Framework for On-the-Fly Category Discovery
by: Zhang, Bohan, et al.
Published: (2026)
by: Zhang, Bohan, et al.
Published: (2026)
METER: Multi-modal Evidence-based Thinking and Explainable Reasoning -- Algorithm and Benchmark
by: Yang, Xu, et al.
Published: (2025)
by: Yang, Xu, et al.
Published: (2025)
From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation
by: Chen, Jingkun, et al.
Published: (2025)
by: Chen, Jingkun, et al.
Published: (2025)
Detecting 3D Line Segments for 6DoF Pose Estimation with Limited Data
by: Mok, Matej, et al.
Published: (2026)
by: Mok, Matej, et al.
Published: (2026)
Pixel-Wise Multimodal Contrastive Learning for Remote Sensing Images
by: Stival, Leandro, et al.
Published: (2026)
by: Stival, Leandro, et al.
Published: (2026)
Archival Faces: Detection of Faces in Digitized Historical Documents
by: Vaško, Marek, et al.
Published: (2025)
by: Vaško, Marek, et al.
Published: (2025)
ReFlow6D: Refraction-Guided Transparent Object 6D Pose Estimation via Intermediate Representation Learning
by: Gupta, Hrishikesh, et al.
Published: (2024)
by: Gupta, Hrishikesh, et al.
Published: (2024)
Bayesian Differentiable Physics for Cloth Digitalization
by: Gong, Deshan, et al.
Published: (2024)
by: Gong, Deshan, et al.
Published: (2024)
CADE 2.5 - ZeResFDG: Frequency-Decoupled, Rescaled and Zero-Projected Guidance for SD/SDXL Latent Diffusion Models
by: Rychkovskiy, Denis
Published: (2025)
by: Rychkovskiy, Denis
Published: (2025)
Automated Defect Detection for Mass-Produced Electronic Components Based on YOLO Object Detection Models
by: Mao, Wei-Lung, et al.
Published: (2025)
by: Mao, Wei-Lung, et al.
Published: (2025)
Comparison of Neural Models for X-ray Image Classification in COVID-19 Detection
by: Togni, Jimi, et al.
Published: (2025)
by: Togni, Jimi, et al.
Published: (2025)
LRCP: Low-Rank Compressibility Guided Visual Token Pruning for Efficient LVLMs
by: Lu, Hongyu, et al.
Published: (2026)
by: Lu, Hongyu, et al.
Published: (2026)
QSilk: Micrograin Stabilization and Adaptive Quantile Clipping for Detail-Friendly Latent Diffusion
by: Rychkovskiy, Denis
Published: (2025)
by: Rychkovskiy, Denis
Published: (2025)
Short-Window Sliding Learning for Real-Time Violence Detection via LLM-based Auto-Labeling
by: Jung, Seoik, et al.
Published: (2025)
by: Jung, Seoik, et al.
Published: (2025)
Real Time Human Detection by Unmanned Aerial Vehicles
by: Guettala, Walid, et al.
Published: (2024)
by: Guettala, Walid, et al.
Published: (2024)
The Impact of Image Resolution on Face Detection: A Comparative Analysis of MTCNN, YOLOv XI and YOLOv XII models
by: Ömercikoğlu, Ahmet Can, et al.
Published: (2025)
by: Ömercikoğlu, Ahmet Can, et al.
Published: (2025)
WildfireVLM: AI-powered Analysis for Early Wildfire Detection and Risk Assessment Using Satellite Imagery
by: Ayanzadeh, Aydin, et al.
Published: (2026)
by: Ayanzadeh, Aydin, et al.
Published: (2026)
OpenFusion++: An Open-vocabulary Real-time Scene Understanding System
by: Jin, Xiaofeng, et al.
Published: (2025)
by: Jin, Xiaofeng, et al.
Published: (2025)
DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation
by: Mohammad, Noor Islam S., et al.
Published: (2025)
by: Mohammad, Noor Islam S., et al.
Published: (2025)
Similar Items
-
Sat-JEPA-Diff: Bridging Self-Supervised Learning and Generative Diffusion for Remote Sensing
by: Komurcu, Kursat, et al.
Published: (2026) -
Diffusion Features for Zero-Shot 6DoF Object Pose Estimation
by: Von Gimborn, Bernd, et al.
Published: (2024) -
BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection
by: Zhang, Jian, et al.
Published: (2024) -
Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset
by: Zinnen, Mathias, et al.
Published: (2025) -
Architectural Insights into Knowledge Distillation for Object Detection: A Comprehensive Review
by: Golizadeh, Mahdi, et al.
Published: (2025)