:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gupta, Sandeep, Passerone, Roberto
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2508.16225
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Robust Vision Systems for Connected and Autonomous Vehicles: Security Challenges and Attack Vectors
by: Gupta, Sandeep, et al.
Published: (2026)

Fast & Efficient Normalizing Flows and Applications of Image Generative Models
by: Nagar, Sandeep
Published: (2025)

AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
by: Mai, Zheda, et al.
Published: (2025)

Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models
by: Zhou, Andy, et al.
Published: (2023)

BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models
by: Gupta, Sunny, et al.
Published: (2026)

Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets
by: Nezhurina, Marianna, et al.
Published: (2025)

First On-Orbit Demonstration of a Geospatial Foundation Model
by: Du, Andrew, et al.
Published: (2025)

VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models
by: Barreto, Jesimon, et al.
Published: (2025)

Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
by: Yao, David Yifan, et al.
Published: (2025)

Directional Gradient Projection for Robust Fine-Tuning of Foundation Models
by: Huang, Chengyue, et al.
Published: (2025)

Specialized Foundation Models Struggle to Beat Supervised Baselines
by: Xu, Zongzhe, et al.
Published: (2024)

Foundation Visual Encoders Are Secretly Few-Shot Anomaly Detectors
by: Zhai, Guangyao, et al.
Published: (2025)

Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks?
by: Kassaw, Kaleb, et al.
Published: (2024)

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
by: Lu, Pan, et al.
Published: (2023)

A Robust Prototype-Based Network with Interpretable RBF Classifier Foundations
by: Saralajew, Sascha, et al.
Published: (2024)

Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
by: Hu, Zixuan, et al.
Published: (2024)

Can VLMs Reason Robustly? A Neuro-Symbolic Investigation
by: Chen, Weixin, et al.
Published: (2026)

ImitDiff: Transferring Foundation-Model Priors for Distraction Robust Visuomotor Policy
by: Dong, Yuhang, et al.
Published: (2025)

Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study
by: Wang, Chenguang, et al.
Published: (2024)

Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models
by: Karamcheti, Siddharth, et al.
Published: (2024)

Investigating the Corruption Robustness of Image Classifiers with Random Lp-norm Corruptions
by: Siedel, Georg, et al.
Published: (2023)

RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection
by: Balakrishnan, Thennarasi, et al.
Published: (2024)

Grounding Foundational Vision Models with 3D Human Poses for Robust Action Recognition
by: Babey, Nicholas, et al.
Published: (2025)

Leveraging Foundation Models for Causal Generative Modeling
by: Komanduri, Aneesh, et al.
Published: (2026)

Revisiting Model Stitching In the Foundation Model Era
by: Mai, Zheda, et al.
Published: (2026)

Experience with Single Domain Generalization in Real World Medical Imaging Deployments
by: Banerjee, Ayan, et al.
Published: (2026)

ExpVG: Investigating the Design Space of Visual Grounding in Multimodal Large Language Model
by: Kang, Weitai, et al.
Published: (2025)

(Almost) Free Modality Stitching of Foundation Models
by: Singh, Jaisidh, et al.
Published: (2025)

PRISM: Distributed Inference for Foundation Models at Edge
by: Qazi, Muhammad Azlan, et al.
Published: (2025)

3D-LFM: Lifting Foundation Model
by: Dabhi, Mosam, et al.
Published: (2023)

Domain-Aware Fine-Tuning of Foundation Models
by: Kaplan, Ugur Ali, et al.
Published: (2024)

A Comprehensive Survey of Foundation Models in Medicine
by: Khan, Wasif, et al.
Published: (2024)

Research on the Spatial Data Intelligent Foundation Model
by: Wang, Shaohua, et al.
Published: (2024)

X-AVDT: Audio-Visual Cross-Attention for Robust Deepfake Detection
by: Kim, Youngseo, et al.
Published: (2026)

VFMF: World Modeling by Forecasting Vision Foundation Model Features
by: Boduljak, Gabrijel, et al.
Published: (2025)

When Better Eyes Lead to Blindness: A Diagnostic Study of the Information Bottleneck in CNN-LSTM Image Captioning Models
by: Gupta, Hitesh Kumar
Published: (2025)

Training Video Foundation Models with NVIDIA NeMo
by: Patel, Zeeshan, et al.
Published: (2025)

CFM: Language-aligned Concept Foundation Model for Vision
by: Wittenmayer, Kai, et al.
Published: (2026)

Collaborating Foundation Models for Domain Generalized Semantic Segmentation
by: Benigmim, Yasser, et al.
Published: (2023)

Bridging Remote Sensors with Multisensor Geospatial Foundation Models
by: Han, Boran, et al.
Published: (2024)