:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Flores, Erick Andrew Bustamante, Olivera, Harley Vera, Valencia, Ivan Cesar Medrano, Cubas, Carlos Fernando Montoya
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence 68T10 I.2.10
Online Access:	https://arxiv.org/abs/2502.00939
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Banana Ripeness Level Classification using a Simple CNN Model Trained with Real and Synthetic Datasets
by: Chuquimarca, Luis, et al.
Published: (2025)

Sequence Matters: Harnessing Video Models in 3D Super-Resolution
by: Ko, Hyun-kyu, et al.
Published: (2024)

Learning Association via Track-Detection Matching for Multi-Object Tracking
by: Adžemović, Momir
Published: (2025)

MaP-AVR: A Meta-Action Planner for Agents Leveraging Vision Language Models and Retrieval-Augmented Generation
by: Guo, Zhenglong, et al.
Published: (2025)

Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
by: Li, Jinhao, et al.
Published: (2024)

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation
by: Allakhverdov, Eduard, et al.
Published: (2025)

Image Reconstruction as a Tool for Feature Analysis
by: Allakhverdov, Eduard, et al.
Published: (2025)

Predicting Pedestrian Crossing Behavior in Germany and Japan: Insights into Model Transferability
by: Zhang, Chi, et al.
Published: (2024)

Tricks and Plug-ins for Gradient Boosting in Image Classification
by: Fang, Biyi, et al.
Published: (2025)

Learning Discriminative Spatio-temporal Representations for Semi-supervised Action Recognition
by: Wang, Yu, et al.
Published: (2024)

Meaning over Motion: A Semantic-First Approach to 360° Viewport Prediction
by: Khah, Arman Nik, et al.
Published: (2026)

DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation
by: Mohammad, Noor Islam S., et al.
Published: (2025)

Hierarchical Spatial Algorithms for High-Resolution Image Quantization and Feature Extraction
by: Mohammad, Noor Islam S.
Published: (2025)

Combined Hyperbolic and Euclidean Soft Triple Loss Beyond the Single Space Deep Metric Learning
by: Saeki, Shozo, et al.
Published: (2025)

Learning Sign Language Representation using CNN LSTM, 3DCNN, CNN RNN LSTM and CCN TD
by: Louison, Nikita, et al.
Published: (2024)

Video-STR: Reinforcing MLLMs in Video Spatio-Temporal Reasoning with Relation Graph
by: Wang, Wentao, et al.
Published: (2025)

Surg$Σ$: A Spectrum of Large-Scale Multimodal Data and Foundation Models for Surgical Intelligence
by: Zeng, Zhitao, et al.
Published: (2026)

DOD-SA: Infrared-Visible Decoupled Object Detection with Single-Modality Annotations
by: Jin, Hang, et al.
Published: (2025)

Non-Robust Features are Not Always Useful in One-Class Classification
by: Lau, Matthew, et al.
Published: (2024)

SynthRender and IRIS: Open-Source Framework and Dataset for Bidirectional Sim-Real Transfer in Industrial Object Perception
by: Araya-Martinez, Jose Moises, et al.
Published: (2026)

CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
by: Raoufi, Behnam, et al.
Published: (2025)

Edged USLAM: Edge-Aware Event-Based SLAM with Learning-Based Depth Priors
by: Sarıözkan, Şebnem, et al.
Published: (2026)

Fast 3D point clouds retrieval for Large-scale 3D Place Recognition
by: Zede, Chahine-Nicolas, et al.
Published: (2025)

An M-Health Algorithmic Approach to Identify and Assess Physiotherapy Exercises in Real Time
by: Kandylakis, Stylianos, et al.
Published: (2025)

DeepFusionNet: Autoencoder-Based Low-Light Image Enhancement and Super-Resolution
by: Çalışkan, Halil Hüseyin, et al.
Published: (2025)

EventFlow: Real-Time Neuromorphic Event-Driven Classification of Two-Phase Boiling Flow Regimes
by: Chang, Sanghyeon, et al.
Published: (2025)

Evaluating Visual Mathematics in Multimodal LLMs: A Multilingual Benchmark Based on the Kangaroo Tests
by: Sáez, Arnau Igualde, et al.
Published: (2025)

CADE 2.5 - ZeResFDG: Frequency-Decoupled, Rescaled and Zero-Projected Guidance for SD/SDXL Latent Diffusion Models
by: Rychkovskiy, Denis
Published: (2025)

Sat-JEPA-Diff: Bridging Self-Supervised Learning and Generative Diffusion for Remote Sensing
by: Komurcu, Kursat, et al.
Published: (2026)

A Labeled Array Distance Metric for Measuring Image Segmentation Quality
by: Berijanian, Maryam, et al.
Published: (2024)

MeshPose: Unifying DensePose and 3D Body Mesh reconstruction
by: Lê, Eric-Tuan, et al.
Published: (2024)

A large-scale, physically-based synthetic dataset for satellite pose estimation
by: Velkei, Szabolcs, et al.
Published: (2025)

PathFormer: A Transformer with 3D Grid Constraints for Digital Twin Robot-Arm Trajectory Generation
by: Alanazi, Ahmed, et al.
Published: (2025)

Beyond RGB: Leveraging Vision Transformers for Thermal Weapon Segmentation
by: Kambhatla, Akhila, et al.
Published: (2025)

ShapBPT: Image Feature Attributions Using Data-Aware Binary Partition Trees
by: Rashid, Muhammad, et al.
Published: (2026)

TerraSeg: Self-Supervised Ground Segmentation for Any LiDAR
by: Lentsch, Ted, et al.
Published: (2026)

UNION: Unsupervised 3D Object Detection using Object Appearance-based Pseudo-Classes
by: Lentsch, Ted, et al.
Published: (2024)

Do Generative Metrics Predict YOLO Performance? An Evaluation Across Models, Augmentation Ratios, and Dataset Complexity
by: Marian, Vasile, et al.
Published: (2026)

MB-DSMIL-CL-PL: Scalable Weakly Supervised Ovarian Cancer Subtype Classification and Localisation Using Contrastive and Prototype Learning with Frozen Patch Features
by: Jenkins, Marcus, et al.
Published: (2026)

Explainable AI for Analyzing Person-Specific Patterns in Facial Recognition Tasks
by: Borsukiewicz, Paweł Jakub, et al.
Published: (2025)