Saved in:
| Main Authors: | Pakdamansavoji, Sajjad, Jha, Kumar Vaibhav, Abdulhai, Baher, Elder, James H |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.12342 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion
by: Pakdamansavoji, Sajjad, et al.
Published: (2025)
by: Pakdamansavoji, Sajjad, et al.
Published: (2025)
Box6D : Zero-shot Category-level 6D Pose Estimation of Warehouse Boxes
by: Ma, Yintao, et al.
Published: (2025)
by: Ma, Yintao, et al.
Published: (2025)
VRU-CIPI: Crossing Intention Prediction at Intersections for Improving Vulnerable Road Users Safety
by: Abdelrahman, Ahmed S., et al.
Published: (2025)
by: Abdelrahman, Ahmed S., et al.
Published: (2025)
Improving Robustness of Spectrogram Classifiers with Neural Stochastic Differential Equations
by: Brogan, Joel, et al.
Published: (2024)
by: Brogan, Joel, et al.
Published: (2024)
uTRAND: Unsupervised Anomaly Detection in Traffic Trajectories
by: D'Amicantonio, Giacomo, et al.
Published: (2024)
by: D'Amicantonio, Giacomo, et al.
Published: (2024)
Constructing Fair Latent Space for Intersection of Fairness and Explainability
by: Joo, Hyungjun, et al.
Published: (2024)
by: Joo, Hyungjun, et al.
Published: (2024)
Peer-Ranked Precision: Creating a Foundational Dataset for Fine-Tuning Vision Models from DataSeeds' Annotated Imagery
by: Abdoli, Sajjad, et al.
Published: (2025)
by: Abdoli, Sajjad, et al.
Published: (2025)
Distracted Robot: How Visual Clutter Undermine Robotic Manipulation
by: Rasouli, Amir, et al.
Published: (2025)
by: Rasouli, Amir, et al.
Published: (2025)
Ensemble Deep Learning and LLM-Assisted Reporting for Automated Skin Lesion Diagnosis
by: Khan, Sher, et al.
Published: (2025)
by: Khan, Sher, et al.
Published: (2025)
ULTra: Unveiling Latent Token Interpretability in Transformer-Based Understanding and Segmentation
by: Hosseini, Hesam, et al.
Published: (2024)
by: Hosseini, Hesam, et al.
Published: (2024)
Parameter Reduction Improves Vision Transformers: A Comparative Study of Sharing and Width Reduction
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)
by: Kumar, Anantha Padmanaban Krishna
Published: (2025)
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
by: Wan, David, et al.
Published: (2024)
by: Wan, David, et al.
Published: (2024)
BRo-JEPA: Learning Modular Arithmetic in Latent Space
by: Jha, Divyansh, et al.
Published: (2026)
by: Jha, Divyansh, et al.
Published: (2026)
Adaptive Concept Bottleneck for Foundation Models Under Distribution Shifts
by: Choi, Jihye, et al.
Published: (2024)
by: Choi, Jihye, et al.
Published: (2024)
Emergent Visual Grounding in Large Multimodal Models Without Grounding Supervision
by: Cao, Shengcao, et al.
Published: (2024)
by: Cao, Shengcao, et al.
Published: (2024)
Data-Driven Analysis of Intersectional Bias in Image Classification: A Framework with Bias-Weighted Augmentation
by: Yesmin, Farjana
Published: (2025)
by: Yesmin, Farjana
Published: (2025)
Improved Cotton Leaf Disease Classification Using Parameter-Efficient Deep Learning Framework
by: Patra, Aswini Kumar, et al.
Published: (2024)
by: Patra, Aswini Kumar, et al.
Published: (2024)
TrafficVLM: A Controllable Visual Language Model for Traffic Video Captioning
by: Dinh, Quang Minh, et al.
Published: (2024)
by: Dinh, Quang Minh, et al.
Published: (2024)
POSESTITCH-SLT: Linguistically Inspired Pose-Stitching for End-to-End Sign Language Translation
by: Joshi, Abhinav, et al.
Published: (2025)
by: Joshi, Abhinav, et al.
Published: (2025)
MRI Plane Orientation Detection using a Context-Aware 2.5D Model
by: Kim, SangHyuk, et al.
Published: (2025)
by: Kim, SangHyuk, et al.
Published: (2025)
GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual Localization
by: Sidorov, Gennady, et al.
Published: (2024)
by: Sidorov, Gennady, et al.
Published: (2024)
Grounded Object Centric Learning
by: Kori, Avinash, et al.
Published: (2023)
by: Kori, Avinash, et al.
Published: (2023)
MRD-LiNet: A Novel Lightweight Hybrid CNN with Gradient-Guided Unlearning for Improved Drought Stress Identification
by: Patra, Aswini Kumar, et al.
Published: (2025)
by: Patra, Aswini Kumar, et al.
Published: (2025)
Learning Traffic Anomalies from Generative Models on Real-Time Observations
by: Giasemis, Fotis I., et al.
Published: (2025)
by: Giasemis, Fotis I., et al.
Published: (2025)
Improved Classification of Nitrogen Stress Severity in Plants Under Combined Stress Conditions Using Spatio-Temporal Deep Learning Framework
by: Patra, Aswini Kumar, et al.
Published: (2025)
by: Patra, Aswini Kumar, et al.
Published: (2025)
Uncertainty-aware Diffusion and Reinforcement Learning for Joint Plane Localization and Anomaly Diagnosis in 3D Ultrasound
by: Huang, Yuhao, et al.
Published: (2025)
by: Huang, Yuhao, et al.
Published: (2025)
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
by: Jeong, Hyeonho, et al.
Published: (2023)
by: Jeong, Hyeonho, et al.
Published: (2023)
ASTM :Autonomous Smart Traffic Management System Using Artificial Intelligence CNN and LSTM
by: Goenawan, Christofel Rio
Published: (2024)
by: Goenawan, Christofel Rio
Published: (2024)
Handloom Design Generation Using Generative Networks
by: Bhattacharjee, Rajat Kanti, et al.
Published: (2025)
by: Bhattacharjee, Rajat Kanti, et al.
Published: (2025)
Uncertainty Propagation in XAI: A Comparison of Analytical and Empirical Estimators
by: Chiaburu, Teodor, et al.
Published: (2025)
by: Chiaburu, Teodor, et al.
Published: (2025)
Analytical Uncertainty-Based Loss Weighting in Multi-Task Learning
by: Kirchdorfer, Lukas, et al.
Published: (2024)
by: Kirchdorfer, Lukas, et al.
Published: (2024)
Intelligent Traffic Monitoring with YOLOv11: A Case Study in Real-Time Vehicle Detection
by: Sherifi, Shkelqim
Published: (2026)
by: Sherifi, Shkelqim
Published: (2026)
Moving Off-the-Grid: Scene-Grounded Video Representations
by: van Steenkiste, Sjoerd, et al.
Published: (2024)
by: van Steenkiste, Sjoerd, et al.
Published: (2024)
Cortex-Grounded Diffusion Models for Brain Image Generation
by: Bongratz, Fabian, et al.
Published: (2026)
by: Bongratz, Fabian, et al.
Published: (2026)
VG3T: Visual Geometry Grounded Gaussian Transformer
by: Kim, Junho, et al.
Published: (2025)
by: Kim, Junho, et al.
Published: (2025)
Visual Test-time Scaling for GUI Agent Grounding
by: Luo, Tiange, et al.
Published: (2025)
by: Luo, Tiange, et al.
Published: (2025)
Grounding Continuous Representations in Geometry: Equivariant Neural Fields
by: Wessels, David R, et al.
Published: (2024)
by: Wessels, David R, et al.
Published: (2024)
Sliced Wasserstein with Random-Path Projecting Directions
by: Nguyen, Khai, et al.
Published: (2024)
by: Nguyen, Khai, et al.
Published: (2024)
Multi-hop Upstream Anticipatory Traffic Signal Control with Deep Reinforcement Learning
by: Li, Xiaocan, et al.
Published: (2024)
by: Li, Xiaocan, et al.
Published: (2024)
PhyGround: Benchmarking Physical Reasoning in Generative World Models
by: Lin, Juyi, et al.
Published: (2026)
by: Lin, Juyi, et al.
Published: (2026)
Similar Items
-
WALDO: Where Unseen Model-based 6D Pose Estimation Meets Occlusion
by: Pakdamansavoji, Sajjad, et al.
Published: (2025) -
Box6D : Zero-shot Category-level 6D Pose Estimation of Warehouse Boxes
by: Ma, Yintao, et al.
Published: (2025) -
VRU-CIPI: Crossing Intention Prediction at Intersections for Improving Vulnerable Road Users Safety
by: Abdelrahman, Ahmed S., et al.
Published: (2025) -
Improving Robustness of Spectrogram Classifiers with Neural Stochastic Differential Equations
by: Brogan, Joel, et al.
Published: (2024) -
uTRAND: Unsupervised Anomaly Detection in Traffic Trajectories
by: D'Amicantonio, Giacomo, et al.
Published: (2024)