Saved in:
| Main Authors: | Maity, Soumyajit, Kamboj, Pranjal, Maity, Sneha, Singh, Rajat, Chatterjee, Sankhadeep |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.05600 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MedGemma Technical Report
by: Sellergren, Andrew, et al.
Published: (2025)
by: Sellergren, Andrew, et al.
Published: (2025)
MedGemma vs GPT-4: Open-Source and Proprietary Zero-shot Medical Disease Classification from Images
by: Prottasha, Md. Sazzadul Islam, et al.
Published: (2025)
by: Prottasha, Md. Sazzadul Islam, et al.
Published: (2025)
Comparative Analysis of Object Detection Algorithms for Surface Defect Detection
by: Maity, Arpan, et al.
Published: (2025)
by: Maity, Arpan, et al.
Published: (2025)
The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers
by: Kamboj, Abhi
Published: (2024)
by: Kamboj, Abhi
Published: (2024)
Hybrid Deep Learning Framework for Enhanced Diabetic Retinopathy Detection: Integrating Traditional Features with AI-driven Insights
by: Maity, Arpan, et al.
Published: (2025)
by: Maity, Arpan, et al.
Published: (2025)
ToxVidLM: A Multimodal Framework for Toxicity Detection in Code-Mixed Videos
by: Maity, Krishanu, et al.
Published: (2024)
by: Maity, Krishanu, et al.
Published: (2024)
MedGemma 1.5 Technical Report
by: Sellergren, Andrew, et al.
Published: (2026)
by: Sellergren, Andrew, et al.
Published: (2026)
Evaluating Prompting Strategies with MedGemma for Medical Order Extraction
by: Balachandran, Abhinand, et al.
Published: (2025)
by: Balachandran, Abhinand, et al.
Published: (2025)
On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
by: Modi, Rajat, et al.
Published: (2024)
by: Modi, Rajat, et al.
Published: (2024)
Asynchronous Perception Machine For Efficient Test-Time-Training
by: Modi, Rajat, et al.
Published: (2024)
by: Modi, Rajat, et al.
Published: (2024)
AD$^2$: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems
by: Sahu, Ishan, et al.
Published: (2026)
by: Sahu, Ishan, et al.
Published: (2026)
A Brief Survey on Leveraging Large Scale Vision Models for Enhanced Robot Grasping
by: Kamboj, Abhi, et al.
Published: (2024)
by: Kamboj, Abhi, et al.
Published: (2024)
Real-Time Drivers' Drowsiness Detection and Analysis through Deep Learning
by: Zaman, ANK, et al.
Published: (2025)
by: Zaman, ANK, et al.
Published: (2025)
Normal-Abnormal Guided Generalist Anomaly Detection
by: Wang, Yuexin, et al.
Published: (2025)
by: Wang, Yuexin, et al.
Published: (2025)
Abnormal Event Detection In Videos Using Deep Embedding
by: Venkatrayappa, Darshan
Published: (2024)
by: Venkatrayappa, Darshan
Published: (2024)
City-Mesh3R: Simulation-Ready City-Scale 3D Mesh Reconstruction from Multi-View Images
by: Paul, Sayan, et al.
Published: (2026)
by: Paul, Sayan, et al.
Published: (2026)
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
by: Masry, Ahmed, et al.
Published: (2024)
by: Masry, Ahmed, et al.
Published: (2024)
PaliGemma-CXR: A Multi-task Multimodal Model for TB Chest X-ray Interpretation
by: Musinguzi, Denis, et al.
Published: (2025)
by: Musinguzi, Denis, et al.
Published: (2025)
MedNet-PVS: A MedNeXt-Based Deep Learning Model for Automated Segmentation of Perivascular Spaces
by: Low, Zhen Xuen Brandon, et al.
Published: (2025)
by: Low, Zhen Xuen Brandon, et al.
Published: (2025)
Fine-Tuning MedGemma for Clinical Captioning to Enhance Multimodal RAG over Malaysia CPGs
by: Zun, Lee Qi, et al.
Published: (2025)
by: Zun, Lee Qi, et al.
Published: (2025)
Examining Monitoring System: Detecting Abnormal Behavior In Online Examinations
by: Ngo, Dinh An, et al.
Published: (2024)
by: Ngo, Dinh An, et al.
Published: (2024)
CrossMed: A Multimodal Cross-Task Benchmark for Compositional Generalization in Medical Imaging
by: Singh, Pooja, et al.
Published: (2025)
by: Singh, Pooja, et al.
Published: (2025)
A Survey of IMU Based Cross-Modal Transfer Learning in Human Activity Recognition
by: Kamboj, Abhi, et al.
Published: (2024)
by: Kamboj, Abhi, et al.
Published: (2024)
Towards Achieving Perfect Multimodal Alignment
by: Kamboj, Abhi, et al.
Published: (2025)
by: Kamboj, Abhi, et al.
Published: (2025)
Robust Kidney Abnormality Segmentation: A Validation Study of an AI-Based Framework
by: de Boer, Sarah, et al.
Published: (2025)
by: de Boer, Sarah, et al.
Published: (2025)
Robult: Leveraging Redundancy and Modality Specific Features for Robust Multimodal Learning
by: Nguyen, Duy A., et al.
Published: (2025)
by: Nguyen, Duy A., et al.
Published: (2025)
Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body
by: Wang, Zeqing, et al.
Published: (2024)
by: Wang, Zeqing, et al.
Published: (2024)
Detecting and Characterising Mobile App Metamorphosis in Google Play Store
by: Denipitiyage, D., et al.
Published: (2024)
by: Denipitiyage, D., et al.
Published: (2024)
PaliGemma: A versatile 3B VLM for transfer
by: Beyer, Lucas, et al.
Published: (2024)
by: Beyer, Lucas, et al.
Published: (2024)
Who Does Your Algorithm Fail? Investigating Age and Ethnic Bias in the MAMA-MIA Dataset
by: Parikh, Aditya, et al.
Published: (2025)
by: Parikh, Aditya, et al.
Published: (2025)
3D Modality-Aware Pre-training for Vision-Language Model in MRI Multi-organ Abnormality Detection
by: Zhu, Haowen, et al.
Published: (2026)
by: Zhu, Haowen, et al.
Published: (2026)
Implementing Edge Based Object Detection For Microplastic Debris
by: Singh, Amardeep, et al.
Published: (2023)
by: Singh, Amardeep, et al.
Published: (2023)
PlantDiseaseNet-RT50: A Fine-tuned ResNet50 Architecture for High-Accuracy Plant Disease Detection Beyond Standard CNNs
by: Sagnika, Santwana, et al.
Published: (2025)
by: Sagnika, Santwana, et al.
Published: (2025)
Automated Motion Artifact Check for MRI (AutoMAC-MRI): An Interpretable Framework for Motion Artifact Detection and Severity Assessment
by: Jerald, Antony, et al.
Published: (2025)
by: Jerald, Antony, et al.
Published: (2025)
MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
by: Sheikh, Tooba Tehreem, et al.
Published: (2025)
by: Sheikh, Tooba Tehreem, et al.
Published: (2025)
Surveillance Video-Based Traffic Accident Detection Using Transformer Architecture
by: Singh, Tanu, et al.
Published: (2025)
by: Singh, Tanu, et al.
Published: (2025)
DistilDoc: Knowledge Distillation for Visually-Rich Document Applications
by: Van Landeghem, Jordy, et al.
Published: (2024)
by: Van Landeghem, Jordy, et al.
Published: (2024)
OrthoDiffusion: A Generalizable Multi-Task Diffusion Foundation Model for Musculoskeletal MRI Interpretation
by: Lan, Tian, et al.
Published: (2026)
by: Lan, Tian, et al.
Published: (2026)
Abnormalities and Disease Detection in Gastro-Intestinal Tract Images
by: Khan, Zeshan, et al.
Published: (2026)
by: Khan, Zeshan, et al.
Published: (2026)
DiffMorph: Text-less Image Morphing with Diffusion Models
by: Chatterjee, Shounak
Published: (2024)
by: Chatterjee, Shounak
Published: (2024)
Similar Items
-
MedGemma Technical Report
by: Sellergren, Andrew, et al.
Published: (2025) -
MedGemma vs GPT-4: Open-Source and Proprietary Zero-shot Medical Disease Classification from Images
by: Prottasha, Md. Sazzadul Islam, et al.
Published: (2025) -
Comparative Analysis of Object Detection Algorithms for Surface Defect Detection
by: Maity, Arpan, et al.
Published: (2025) -
The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers
by: Kamboj, Abhi
Published: (2024) -
Hybrid Deep Learning Framework for Enhanced Diabetic Retinopathy Detection: Integrating Traditional Features with AI-driven Insights
by: Maity, Arpan, et al.
Published: (2025)