:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Maity, Soumyajit, Kamboj, Pranjal, Maity, Sneha, Singh, Rajat, Chatterjee, Sankhadeep
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.05600
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MedGemma Technical Report
by: Sellergren, Andrew, et al.
Published: (2025)

MedGemma vs GPT-4: Open-Source and Proprietary Zero-shot Medical Disease Classification from Images
by: Prottasha, Md. Sazzadul Islam, et al.
Published: (2025)

Comparative Analysis of Object Detection Algorithms for Surface Defect Detection
by: Maity, Arpan, et al.
Published: (2025)

The Progression of Transformers from Language to Vision to MOT: A Literature Review on Multi-Object Tracking with Transformers
by: Kamboj, Abhi
Published: (2024)

Hybrid Deep Learning Framework for Enhanced Diabetic Retinopathy Detection: Integrating Traditional Features with AI-driven Insights
by: Maity, Arpan, et al.
Published: (2025)

ToxVidLM: A Multimodal Framework for Toxicity Detection in Code-Mixed Videos
by: Maity, Krishanu, et al.
Published: (2024)

MedGemma 1.5 Technical Report
by: Sellergren, Andrew, et al.
Published: (2026)

Evaluating Prompting Strategies with MedGemma for Medical Order Extraction
by: Balachandran, Abhinand, et al.
Published: (2025)

On Occlusions in Video Action Detection: Benchmark Datasets And Training Recipes
by: Modi, Rajat, et al.
Published: (2024)

Asynchronous Perception Machine For Efficient Test-Time-Training
by: Modi, Rajat, et al.
Published: (2024)

AD$^2$: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems
by: Sahu, Ishan, et al.
Published: (2026)

A Brief Survey on Leveraging Large Scale Vision Models for Enhanced Robot Grasping
by: Kamboj, Abhi, et al.
Published: (2024)

Real-Time Drivers' Drowsiness Detection and Analysis through Deep Learning
by: Zaman, ANK, et al.
Published: (2025)

Normal-Abnormal Guided Generalist Anomaly Detection
by: Wang, Yuexin, et al.
Published: (2025)

Abnormal Event Detection In Videos Using Deep Embedding
by: Venkatrayappa, Darshan
Published: (2024)

City-Mesh3R: Simulation-Ready City-Scale 3D Mesh Reconstruction from Multi-View Images
by: Paul, Sayan, et al.
Published: (2026)

ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
by: Masry, Ahmed, et al.
Published: (2024)

PaliGemma-CXR: A Multi-task Multimodal Model for TB Chest X-ray Interpretation
by: Musinguzi, Denis, et al.
Published: (2025)

MedNet-PVS: A MedNeXt-Based Deep Learning Model for Automated Segmentation of Perivascular Spaces
by: Low, Zhen Xuen Brandon, et al.
Published: (2025)

Fine-Tuning MedGemma for Clinical Captioning to Enhance Multimodal RAG over Malaysia CPGs
by: Zun, Lee Qi, et al.
Published: (2025)

Examining Monitoring System: Detecting Abnormal Behavior In Online Examinations
by: Ngo, Dinh An, et al.
Published: (2024)

CrossMed: A Multimodal Cross-Task Benchmark for Compositional Generalization in Medical Imaging
by: Singh, Pooja, et al.
Published: (2025)

A Survey of IMU Based Cross-Modal Transfer Learning in Human Activity Recognition
by: Kamboj, Abhi, et al.
Published: (2024)

Towards Achieving Perfect Multimodal Alignment
by: Kamboj, Abhi, et al.
Published: (2025)

Robust Kidney Abnormality Segmentation: A Validation Study of an AI-Based Framework
by: de Boer, Sarah, et al.
Published: (2025)

Robult: Leveraging Redundancy and Modality Specific Features for Robust Multimodal Learning
by: Nguyen, Duy A., et al.
Published: (2025)

Is this Generated Person Existed in Real-world? Fine-grained Detecting and Calibrating Abnormal Human-body
by: Wang, Zeqing, et al.
Published: (2024)

Detecting and Characterising Mobile App Metamorphosis in Google Play Store
by: Denipitiyage, D., et al.
Published: (2024)

PaliGemma: A versatile 3B VLM for transfer
by: Beyer, Lucas, et al.
Published: (2024)

Who Does Your Algorithm Fail? Investigating Age and Ethnic Bias in the MAMA-MIA Dataset
by: Parikh, Aditya, et al.
Published: (2025)

3D Modality-Aware Pre-training for Vision-Language Model in MRI Multi-organ Abnormality Detection
by: Zhu, Haowen, et al.
Published: (2026)

Implementing Edge Based Object Detection For Microplastic Debris
by: Singh, Amardeep, et al.
Published: (2023)

PlantDiseaseNet-RT50: A Fine-tuned ResNet50 Architecture for High-Accuracy Plant Disease Detection Beyond Standard CNNs
by: Sagnika, Santwana, et al.
Published: (2025)

Automated Motion Artifact Check for MRI (AutoMAC-MRI): An Interpretable Framework for Motion Artifact Detection and Severity Assessment
by: Jerald, Antony, et al.
Published: (2025)

MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities
by: Sheikh, Tooba Tehreem, et al.
Published: (2025)

Surveillance Video-Based Traffic Accident Detection Using Transformer Architecture
by: Singh, Tanu, et al.
Published: (2025)

DistilDoc: Knowledge Distillation for Visually-Rich Document Applications
by: Van Landeghem, Jordy, et al.
Published: (2024)

OrthoDiffusion: A Generalizable Multi-Task Diffusion Foundation Model for Musculoskeletal MRI Interpretation
by: Lan, Tian, et al.
Published: (2026)

Abnormalities and Disease Detection in Gastro-Intestinal Tract Images
by: Khan, Zeshan, et al.
Published: (2026)

DiffMorph: Text-less Image Morphing with Diffusion Models
by: Chatterjee, Shounak
Published: (2024)