Saved in:
| Main Authors: | Lehavi, David, Osserman, Brian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.13058 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image Analysis
by: S, Kamal Basha, et al.
Published: (2024)
by: S, Kamal Basha, et al.
Published: (2024)
ROI-GS: Interest-based Local Quality 3D Gaussian Splatting
by: Bui, Quoc-Anh, et al.
Published: (2025)
by: Bui, Quoc-Anh, et al.
Published: (2025)
ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition
by: Bui, Quoc-Anh, et al.
Published: (2025)
by: Bui, Quoc-Anh, et al.
Published: (2025)
Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset
by: Zinnen, Mathias, et al.
Published: (2025)
by: Zinnen, Mathias, et al.
Published: (2025)
LRCP: Low-Rank Compressibility Guided Visual Token Pruning for Efficient LVLMs
by: Lu, Hongyu, et al.
Published: (2026)
by: Lu, Hongyu, et al.
Published: (2026)
Some computational aspects of spectral sequences in Čech cohomology
by: Zach, Matthias
Published: (2025)
by: Zach, Matthias
Published: (2025)
ViFiCon: Vision and Wireless Association Via Self-Supervised Contrastive Learning
by: Meegan, Nicholas, et al.
Published: (2022)
by: Meegan, Nicholas, et al.
Published: (2022)
TSPE-GS: Probabilistic Depth Extraction for Semi-Transparent Surface Reconstruction via 3D Gaussian Splatting
by: Xu, Zhiyuan, et al.
Published: (2025)
by: Xu, Zhiyuan, et al.
Published: (2025)
Progressive Cross Attention Network for Flood Segmentation using Multispectral Satellite Imagery
by: Feliren, Vicky, et al.
Published: (2025)
by: Feliren, Vicky, et al.
Published: (2025)
Rethinking VLMs for Image Forgery Detection and Localization
by: Guo, Shaofeng, et al.
Published: (2026)
by: Guo, Shaofeng, et al.
Published: (2026)
ALERT-Transformer: Bridging Asynchronous and Synchronous Machine Learning for Real-Time Event-based Spatio-Temporal Data
by: Martin-Turrero, Carmen, et al.
Published: (2024)
by: Martin-Turrero, Carmen, et al.
Published: (2024)
Event-ECC: Asynchronous Tracking of Events with Continuous Optimization
by: Zafeiri, Maria, et al.
Published: (2024)
by: Zafeiri, Maria, et al.
Published: (2024)
BG-YOLO: A Bidirectional-Guided Method for Underwater Object Detection
by: Zhang, Jian, et al.
Published: (2024)
by: Zhang, Jian, et al.
Published: (2024)
VDPP: Video Depth Post-Processing for Speed and Scalability
by: Yoon, Daewon, et al.
Published: (2026)
by: Yoon, Daewon, et al.
Published: (2026)
From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation
by: Chen, Jingkun, et al.
Published: (2025)
by: Chen, Jingkun, et al.
Published: (2025)
DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation
by: Mohammad, Noor Islam S., et al.
Published: (2025)
by: Mohammad, Noor Islam S., et al.
Published: (2025)
Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction
by: Mohammad, Noor Islam S.
Published: (2025)
by: Mohammad, Noor Islam S.
Published: (2025)
VLM-NCD:Novel Class Discovery with Vision-Based Large Language Models
by: Su, Yuetong, et al.
Published: (2025)
by: Su, Yuetong, et al.
Published: (2025)
CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
by: Raoufi, Behnam, et al.
Published: (2025)
by: Raoufi, Behnam, et al.
Published: (2025)
Sat-JEPA-Diff: Bridging Self-Supervised Learning and Generative Diffusion for Remote Sensing
by: Komurcu, Kursat, et al.
Published: (2026)
by: Komurcu, Kursat, et al.
Published: (2026)
Corn Ear Detection and Orientation Estimation Using Deep Learning
by: Sprague, Nathan, et al.
Published: (2024)
by: Sprague, Nathan, et al.
Published: (2024)
Gr-IoU: Ground-Intersection over Union for Robust Multi-Object Tracking with 3D Geometric Constraints
by: Toida, Keisuke, et al.
Published: (2024)
by: Toida, Keisuke, et al.
Published: (2024)
Reconstruction of curves from their theta hyperplanes in genera $6$ and $7$
by: Çelik, Türkü Özlüm, et al.
Published: (2024)
by: Çelik, Türkü Özlüm, et al.
Published: (2024)
OpenFusion++: An Open-vocabulary Real-time Scene Understanding System
by: Jin, Xiaofeng, et al.
Published: (2025)
by: Jin, Xiaofeng, et al.
Published: (2025)
Experimental Evaluation of Road-Crossing Decisions by Autonomous Wheelchairs against Environmental Factors
by: Corradini, Franca, et al.
Published: (2024)
by: Corradini, Franca, et al.
Published: (2024)
Polarization-Based Eye Tracking with Personalized Siamese Architectures
by: Kalkanli, Beyza, et al.
Published: (2026)
by: Kalkanli, Beyza, et al.
Published: (2026)
3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model
by: Ko, Hyun-kyu, et al.
Published: (2026)
by: Ko, Hyun-kyu, et al.
Published: (2026)
Person detection and re-identification in open-world settings of retail stores and public spaces
by: Brkljač, Branko, et al.
Published: (2025)
by: Brkljač, Branko, et al.
Published: (2025)
Do Generative Metrics Predict YOLO Performance? An Evaluation Across Models, Augmentation Ratios, and Dataset Complexity
by: Marian, Vasile, et al.
Published: (2026)
by: Marian, Vasile, et al.
Published: (2026)
Safe Road-Crossing by Autonomous Wheelchairs: a Novel Dataset and its Experimental Evaluation
by: Grigioni, Carlo, et al.
Published: (2024)
by: Grigioni, Carlo, et al.
Published: (2024)
μ-Net: A Deep Learning-Based Architecture for μ-CT Segmentation
by: Bruno, Pierangela, et al.
Published: (2024)
by: Bruno, Pierangela, et al.
Published: (2024)
A large-scale, physically-based synthetic dataset for satellite pose estimation
by: Velkei, Szabolcs, et al.
Published: (2025)
by: Velkei, Szabolcs, et al.
Published: (2025)
Computing $A$-resultants via direct images
by: Groh, Friedemann, et al.
Published: (2026)
by: Groh, Friedemann, et al.
Published: (2026)
ARTPS: Depth-Enhanced Hybrid Anomaly Detection and Learnable Curiosity Score for Autonomous Rover Target Prioritization
by: Baydemir, Poyraz
Published: (2025)
by: Baydemir, Poyraz
Published: (2025)
TRACES: Temporal Recall with Contextual Embeddings for Real-Time Video Anomaly Detection
by: Siddiqui, Yousuf Ahmed, et al.
Published: (2025)
by: Siddiqui, Yousuf Ahmed, et al.
Published: (2025)
Semantic2Graph: Graph-based Multi-modal Feature Fusion for Action Segmentation in Videos
by: Zhang, Junbin, et al.
Published: (2022)
by: Zhang, Junbin, et al.
Published: (2022)
PCRI: Measuring Context Robustness in Multimodal Models for Enterprise Applications
by: Patel, Hitesh Laxmichand, et al.
Published: (2025)
by: Patel, Hitesh Laxmichand, et al.
Published: (2025)
FeudalNav: A Simple Framework for Visual Navigation
by: Johnson, Faith, et al.
Published: (2026)
by: Johnson, Faith, et al.
Published: (2026)
HEDGE: Hallucination Estimation via Dense Geometric Entropy for VQA with Vision-Language Models
by: Gautam, Sushant, et al.
Published: (2025)
by: Gautam, Sushant, et al.
Published: (2025)
Point, Detect, Count: Multi-Task Medical Image Understanding with Instruction-Tuned Vision-Language Models
by: Gautam, Sushant, et al.
Published: (2025)
by: Gautam, Sushant, et al.
Published: (2025)
Similar Items
-
S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image Analysis
by: S, Kamal Basha, et al.
Published: (2024) -
ROI-GS: Interest-based Local Quality 3D Gaussian Splatting
by: Bui, Quoc-Anh, et al.
Published: (2025) -
ROI-NeRFs: Hi-Fi Visualization of Objects of Interest within a Scene by NeRFs Composition
by: Bui, Quoc-Anh, et al.
Published: (2025) -
Smelly, dense, and spreaded: The Object Detection for Olfactory References (ODOR) dataset
by: Zinnen, Mathias, et al.
Published: (2025) -
LRCP: Low-Rank Compressibility Guided Visual Token Pruning for Efficient LVLMs
by: Lu, Hongyu, et al.
Published: (2026)