:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pakdamansavoji, Sajjad, Jha, Kumar Vaibhav, Abdulhai, Baher, Elder, James H
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2511.12342
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion
by: Pakdamansavoji, Sajjad, et al.
Published: (2025)

Box6D : Zero-shot Category-level 6D Pose Estimation of Warehouse Boxes
by: Ma, Yintao, et al.
Published: (2025)

VRU-CIPI: Crossing Intention Prediction at Intersections for Improving Vulnerable Road Users Safety
by: Abdelrahman, Ahmed S., et al.
Published: (2025)

Improving Robustness of Spectrogram Classifiers with Neural Stochastic Differential Equations
by: Brogan, Joel, et al.
Published: (2024)

uTRAND: Unsupervised Anomaly Detection in Traffic Trajectories
by: D'Amicantonio, Giacomo, et al.
Published: (2024)

Constructing Fair Latent Space for Intersection of Fairness and Explainability
by: Joo, Hyungjun, et al.
Published: (2024)

Peer-Ranked Precision: Creating a Foundational Dataset for Fine-Tuning Vision Models from DataSeeds' Annotated Imagery
by: Abdoli, Sajjad, et al.
Published: (2025)

Distracted Robot: How Visual Clutter Undermine Robotic Manipulation
by: Rasouli, Amir, et al.
Published: (2025)

Ensemble Deep Learning and LLM-Assisted Reporting for Automated Skin Lesion Diagnosis
by: Khan, Sher, et al.
Published: (2025)

ULTra: Unveiling Latent Token Interpretability in Transformer-Based Understanding and Segmentation
by: Hosseini, Hesam, et al.
Published: (2024)

Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)

Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
by: Wan, David, et al.
Published: (2024)

BRo-JEPA: Learning Modular Arithmetic in Latent Space
by: Jha, Divyansh, et al.
Published: (2026)

Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts
by: Choi, Jihye, et al.
Published: (2024)

Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision
by: Cao, Shengcao, et al.
Published: (2024)

Data-Driven Analysis of Intersectional Bias in Image Classification: A Framework with Bias-Weighted Augmentation
by: Yesmin, Farjana
Published: (2025)

Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework
by: Patra, Aswini Kumar, et al.
Published: (2024)

TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
by: Dinh, Quang Minh, et al.
Published: (2024)

POSESTITCH-SLT: Linguistically Inspired Pose-Stitching for End-to-End Sign Language Translation
by: Joshi, Abhinav, et al.
Published: (2025)

MRI Plane Orientation Detection using a Context-Aware 2.5D Model
by: Kim, SangHyuk, et al.
Published: (2025)

GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization
by: Sidorov, Gennady, et al.
Published: (2024)

Grounded Object Centric Learning
by: Kori, Avinash, et al.
Published: (2023)

MRD-LiNet: A Novel Lightweight Hybrid CNN with Gradient-Guided Unlearning for Improved Drought Stress Identification
by: Patra, Aswini Kumar, et al.
Published: (2025)

Learning Traffic Anomalies from Generative Models on Real-Time Observations
by: Giasemis, Fotis I., et al.
Published: (2025)

Improved Classification of Nitrogen Stress Severity in Plants Under Combined Stress Conditions Using Spatio-Temporal Deep Learning Framework
by: Patra, Aswini Kumar, et al.
Published: (2025)

Uncertainty-aware Diffusion and Reinforcement Learning for Joint Plane Localization and Anomaly Diagnosis in 3D Ultrasound
by: Huang, Yuhao, et al.
Published: (2025)

Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
by: Jeong, Hyeonho, et al.
Published: (2023)

ASTM :Autonomous Smart Traffic Management System Using Artificial Intelligence CNN and LSTM
by: Goenawan, Christofel Rio
Published: (2024)

Handloom Design Generation Using Generative Networks
by: Bhattacharjee, Rajat Kanti, et al.
Published: (2025)

Uncertainty Propagation in XAI: A Comparison of Analytical and Empirical Estimators
by: Chiaburu, Teodor, et al.
Published: (2025)

Analytical Uncertainty-Based Loss Weighting in Multi-Task Learning
by: Kirchdorfer, Lukas, et al.
Published: (2024)

Intelligent Traffic Monitoring with YOLOv11: A Case Study in Real-Time Vehicle Detection
by: Sherifi, Shkelqim
Published: (2026)

Moving Off-the-Grid: Scene-Grounded Video Representations
by: van Steenkiste, Sjoerd, et al.
Published: (2024)

Cortex-Grounded Diffusion Models for Brain Image Generation
by: Bongratz, Fabian, et al.
Published: (2026)

VG3T: Visual Geometry Grounded Gaussian Transformer
by: Kim, Junho, et al.
Published: (2025)

Visual Test-time Scaling for GUI Agent Grounding
by: Luo, Tiange, et al.
Published: (2025)

Grounding Continuous Representations in Geometry: Equivariant Neural Fields
by: Wessels, David R, et al.
Published: (2024)

Sliced Wasserstein with Random-Path Projecting Directions
by: Nguyen, Khai, et al.
Published: (2024)

Multi-hop Upstream Anticipatory Traffic Signal Control with Deep Reinforcement Learning
by: Li, Xiaocan, et al.
Published: (2024)

PhyGround: Benchmarking Physical Reasoning in Generative World Models
by: Lin, Juyi, et al.
Published: (2026)