:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Badatya, Bikash Kumar, Baghel, Vipul, Hegde, Ravi
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition I.2.10; I.5.4
Online Access:	https://arxiv.org/abs/2508.19647
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Motion-Guided Semantic Alignment with Negative Prompts for Zero-Shot Video Action Recognition
by: Wang, Yiming, et al.
Published: (2026)

Quantized Vision-Language Models for Damage Assessment: A Comparative Study of LLaVA-1.5-7B Quantization Levels
by: Yasuno, Takato
Published: (2026)

Learning Association via Track-Detection Matching for Multi-Object Tracking
by: Adžemović, Momir
Published: (2025)

FAME: Feature Activation Map Explanation on Image Classification and Face Recognition
by: Zhang, Xinyi, et al.
Published: (2026)

Habitat Classification from Ground-Level Imagery Using Deep Neural Networks
by: Shi, Hongrui, et al.
Published: (2025)

FeedbackSTS-Det: Sparse Frames-Based Spatio-Temporal Semantic Feedback Network for Moving Infrared Small Target Detection
by: Huang, Yian, et al.
Published: (2026)

OmniFall: From Staged Through Synthetic to Wild, A Unified Multi-Domain Dataset for Robust Fall Detection
by: Schneider, David, et al.
Published: (2025)

Learning from Semantic Dictionaries: Discriminative Codebook Contrastive Learning for Unified Visual Representation and Generation
by: Estepa, Imanol G., et al.
Published: (2026)

All4One: Symbiotic Neighbour Contrastive Learning via Self-Attention and Redundancy Reduction
by: Estepa, Imanol G., et al.
Published: (2023)

Mistake Attribution: Fine-Grained Mistake Understanding in Egocentric Videos
by: Li, Yayuan, et al.
Published: (2025)

HSDA: High-frequency Shuffle Data Augmentation for Bird's-Eye-View Map Segmentation
by: Glisson, Calvin, et al.
Published: (2024)

SelvaBox: A high-resolution dataset for tropical tree crown detection
by: Baudchon, Hugo, et al.
Published: (2025)

Reference Dataset and Benchmark for Reconstructing Laser Parameters from On-axis Video in Powder Bed Fusion of Bulk Stainless Steel
by: Blanc, Cyril, et al.
Published: (2024)

Semi-Supervised Segmentation via Embedding Matching
by: Xie, Weiyi, et al.
Published: (2024)

DeltaVLM: Interactive Remote Sensing Image Change Analysis via Instruction-guided Difference Perception
by: Deng, Pei, et al.
Published: (2025)

Image-Based Leopard Seal Recognition: Approaches and Challenges in Current Automated Systems
by: Salazar, Jorge Yero, et al.
Published: (2024)

Dense Motion Captioning
by: Xu, Shiyao, et al.
Published: (2025)

A Novel Dataset for Flood Detection Robust to Seasonal Changes in Satellite Imagery
by: Jang, Youngsun, et al.
Published: (2025)

TD3Net: A temporal densely connected multi-dilated convolutional network for lipreading
by: Lee, Byung Hoon, et al.
Published: (2025)

CoMatcher: Multi-View Collaborative Feature Matching
by: Zhang, Jintao, et al.
Published: (2025)

MoDE: Mixture of Diffusion Experts for Any Occluded Face Recognition
by: Fan, Qiannan, et al.
Published: (2025)

NeuroGaze-Distill: Brain-informed Distillation and Depression-Inspired Geometric Priors for Robust Facial Emotion Recognition
by: Li, Zilin, et al.
Published: (2025)

Prompt Sensitivity in Vision-Language Grounding: How Small Changes in Wording Affect Object Detection
by: Deka, Dawar Jyoti, et al.
Published: (2026)

Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning
by: Semenov, Andrei, et al.
Published: (2024)

Efficient Temporally-Aware DeepFake Detection using H.264 Motion Vectors
by: Grönquist, Peter, et al.
Published: (2023)

Detecting AI-Generated Videos with Spiking Neural Networks
by: Jang, Minsuk, et al.
Published: (2026)

CG-HOI: Contact-Guided 3D Human-Object Interaction Generation
by: Diller, Christian, et al.
Published: (2023)

Pan-Arctic Permafrost Landform and Human-built Infrastructure Feature Detection with Vision Transformers and Location Embeddings
by: Perera, Amal S., et al.
Published: (2025)

Canonical Space Representation for 4D Panoptic Segmentation of Articulated Objects
by: Gomes, Manuel, et al.
Published: (2025)

Parameter-efficient fine-tuning (PEFT) of Vision Foundation Models for Atypical Mitotic Figure Classification
by: Ramchandani, Lavish, et al.
Published: (2025)

Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation
by: Opra, Balázs, et al.
Published: (2024)

SelvaMask: Segmenting Trees in Tropical Forests and Beyond
by: Duguay, Simon-Olivier, et al.
Published: (2026)

Capacity Constraint Analysis Using Object Detection for Smart Manufacturing
by: Ahmad, Hafiz Mughees, et al.
Published: (2024)

SH17: A Dataset for Human Safety and Personal Protective Equipment Detection in Manufacturing Industry
by: Ahmad, Hafiz Mughees, et al.
Published: (2024)

From Latent to Engine Manifolds: Analyzing ImageBind's Multimodal Embedding Space
by: Hamara, Andrew, et al.
Published: (2024)

Exploring Visual Embedding Spaces Induced by Vision Transformers for Online Auto Parts Marketplaces
by: Armijo, Cameron, et al.
Published: (2025)

Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis
by: Estepa, Imanol G., et al.
Published: (2025)

GIQ: Benchmarking 3D Geometric Reasoning of Vision Foundation Models with Simulated and Real Polyhedra
by: Michalkiewicz, Mateusz, et al.
Published: (2025)

Fairness Without Labels: Pseudo-Balancing for Bias Mitigation in Face Gender Classification
by: Dong, Haohua, et al.
Published: (2025)

HY-Himmel Technical Report: Hierarchical Interleaved Multi-stream Motion Encoding for Long Video Understanding
by: Jin, Haopeng, et al.
Published: (2026)