Saved in:
| Main Authors: | Gupta, Sandeep, Passerone, Roberto |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.16225 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Robust Vision Systems for Connected and Autonomous Vehicles: Security Challenges and Attack Vectors
by: Gupta, Sandeep, et al.
Published: (2026)
by: Gupta, Sandeep, et al.
Published: (2026)
Fast & Efficient Normalizing Flows and Applications of Image Generative Models
by: Nagar, Sandeep
Published: (2025)
by: Nagar, Sandeep
Published: (2025)
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
by: Mai, Zheda, et al.
Published: (2025)
by: Mai, Zheda, et al.
Published: (2025)
Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models
by: Zhou, Andy, et al.
Published: (2023)
by: Zhou, Andy, et al.
Published: (2023)
BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models
by: Gupta, Sunny, et al.
Published: (2026)
by: Gupta, Sunny, et al.
Published: (2026)
Scaling Laws for Robust Comparison of Open Foundation Language-Vision Models and Datasets
by: Nezhurina, Marianna, et al.
Published: (2025)
by: Nezhurina, Marianna, et al.
Published: (2025)
First On-Orbit Demonstration of a Geospatial Foundation Model
by: Du, Andrew, et al.
Published: (2025)
by: Du, Andrew, et al.
Published: (2025)
VESSA: Video-based objEct-centric Self-Supervised Adaptation for Visual Foundation Models
by: Barreto, Jesimon, et al.
Published: (2025)
by: Barreto, Jesimon, et al.
Published: (2025)
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video
by: Yao, David Yifan, et al.
Published: (2025)
by: Yao, David Yifan, et al.
Published: (2025)
Directional Gradient Projection for Robust Fine-Tuning of Foundation Models
by: Huang, Chengyue, et al.
Published: (2025)
by: Huang, Chengyue, et al.
Published: (2025)
Specialized Foundation Models Struggle to Beat Supervised Baselines
by: Xu, Zongzhe, et al.
Published: (2024)
by: Xu, Zongzhe, et al.
Published: (2024)
Foundation Visual Encoders Are Secretly Few-Shot Anomaly Detectors
by: Zhai, Guangyao, et al.
Published: (2025)
by: Zhai, Guangyao, et al.
Published: (2025)
Are Deep Learning Models Robust to Partial Object Occlusion in Visual Recognition Tasks?
by: Kassaw, Kaleb, et al.
Published: (2024)
by: Kassaw, Kaleb, et al.
Published: (2024)
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
by: Lu, Pan, et al.
Published: (2023)
by: Lu, Pan, et al.
Published: (2023)
A Robust Prototype-Based Network with Interpretable RBF Classifier Foundations
by: Saralajew, Sascha, et al.
Published: (2024)
by: Saralajew, Sascha, et al.
Published: (2024)
Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs
by: Hu, Zixuan, et al.
Published: (2024)
by: Hu, Zixuan, et al.
Published: (2024)
Can VLMs Reason Robustly? A Neuro-Symbolic Investigation
by: Chen, Weixin, et al.
Published: (2026)
by: Chen, Weixin, et al.
Published: (2026)
ImitDiff: Transferring Foundation-Model Priors for Distraction Robust Visuomotor Policy
by: Dong, Yuhang, et al.
Published: (2025)
by: Dong, Yuhang, et al.
Published: (2025)
Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study
by: Wang, Chenguang, et al.
Published: (2024)
by: Wang, Chenguang, et al.
Published: (2024)
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models
by: Karamcheti, Siddharth, et al.
Published: (2024)
by: Karamcheti, Siddharth, et al.
Published: (2024)
Investigating the Corruption Robustness of Image Classifiers with Random Lp-norm Corruptions
by: Siedel, Georg, et al.
Published: (2023)
by: Siedel, Georg, et al.
Published: (2023)
RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection
by: Balakrishnan, Thennarasi, et al.
Published: (2024)
by: Balakrishnan, Thennarasi, et al.
Published: (2024)
Grounding Foundational Vision Models with 3D Human Poses for Robust Action Recognition
by: Babey, Nicholas, et al.
Published: (2025)
by: Babey, Nicholas, et al.
Published: (2025)
Leveraging Foundation Models for Causal Generative Modeling
by: Komanduri, Aneesh, et al.
Published: (2026)
by: Komanduri, Aneesh, et al.
Published: (2026)
Revisiting Model Stitching In the Foundation Model Era
by: Mai, Zheda, et al.
Published: (2026)
by: Mai, Zheda, et al.
Published: (2026)
Experience with Single Domain Generalization in Real World Medical Imaging Deployments
by: Banerjee, Ayan, et al.
Published: (2026)
by: Banerjee, Ayan, et al.
Published: (2026)
ExpVG: Investigating the Design Space of Visual Grounding in Multimodal Large Language Model
by: Kang, Weitai, et al.
Published: (2025)
by: Kang, Weitai, et al.
Published: (2025)
(Almost) Free Modality Stitching of Foundation Models
by: Singh, Jaisidh, et al.
Published: (2025)
by: Singh, Jaisidh, et al.
Published: (2025)
PRISM: Distributed Inference for Foundation Models at Edge
by: Qazi, Muhammad Azlan, et al.
Published: (2025)
by: Qazi, Muhammad Azlan, et al.
Published: (2025)
3D-LFM: Lifting Foundation Model
by: Dabhi, Mosam, et al.
Published: (2023)
by: Dabhi, Mosam, et al.
Published: (2023)
Domain-Aware Fine-Tuning of Foundation Models
by: Kaplan, Ugur Ali, et al.
Published: (2024)
by: Kaplan, Ugur Ali, et al.
Published: (2024)
A Comprehensive Survey of Foundation Models in Medicine
by: Khan, Wasif, et al.
Published: (2024)
by: Khan, Wasif, et al.
Published: (2024)
Research on the Spatial Data Intelligent Foundation Model
by: Wang, Shaohua, et al.
Published: (2024)
by: Wang, Shaohua, et al.
Published: (2024)
X-AVDT: Audio-Visual Cross-Attention for Robust Deepfake Detection
by: Kim, Youngseo, et al.
Published: (2026)
by: Kim, Youngseo, et al.
Published: (2026)
VFMF: World Modeling by Forecasting Vision Foundation Model Features
by: Boduljak, Gabrijel, et al.
Published: (2025)
by: Boduljak, Gabrijel, et al.
Published: (2025)
When Better Eyes Lead to Blindness: A Diagnostic Study of the Information Bottleneck in CNN-LSTM Image Captioning Models
by: Gupta, Hitesh Kumar
Published: (2025)
by: Gupta, Hitesh Kumar
Published: (2025)
Training Video Foundation Models with NVIDIA NeMo
by: Patel, Zeeshan, et al.
Published: (2025)
by: Patel, Zeeshan, et al.
Published: (2025)
CFM: Language-aligned Concept Foundation Model for Vision
by: Wittenmayer, Kai, et al.
Published: (2026)
by: Wittenmayer, Kai, et al.
Published: (2026)
Collaborating Foundation Models for Domain Generalized Semantic Segmentation
by: Benigmim, Yasser, et al.
Published: (2023)
by: Benigmim, Yasser, et al.
Published: (2023)
Bridging Remote Sensors with Multisensor Geospatial Foundation Models
by: Han, Boran, et al.
Published: (2024)
by: Han, Boran, et al.
Published: (2024)
Similar Items
-
Robust Vision Systems for Connected and Autonomous Vehicles: Security Challenges and Attack Vectors
by: Gupta, Sandeep, et al.
Published: (2026) -
Fast & Efficient Normalizing Flows and Applications of Image Generative Models
by: Nagar, Sandeep
Published: (2025) -
AVA-Bench: Atomic Visual Ability Benchmark for Vision Foundation Models
by: Mai, Zheda, et al.
Published: (2025) -
Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models
by: Zhou, Andy, et al.
Published: (2023) -
BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models
by: Gupta, Sunny, et al.
Published: (2026)