Saved in:
| Main Authors: | Baig, Mirza Samad Ahmed, Gillani, Syeda Anshrah, Khan, Abdul Akbar, Shah, Shahid Munir, Khan, Muhammad Omer |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.12088 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AI-based Wearable Vision Assistance System for the Visually Impaired: Integrating Real-Time Object Recognition and Contextual Understanding Using Large Vision-Language Models
by: Baig, Mirza Samad Ahmed, et al.
Published: (2024)
by: Baig, Mirza Samad Ahmed, et al.
Published: (2024)
TextPixs: Glyph-Conditioned Diffusion with Character-Aware Attention and OCR-Guided Supervision
by: Gillani, Syeda Anshrah, et al.
Published: (2025)
by: Gillani, Syeda Anshrah, et al.
Published: (2025)
Advancing Depression Detection on Social Media Platforms Through Fine-Tuned Large Language Models
by: Shah, Shahid Munir, et al.
Published: (2024)
by: Shah, Shahid Munir, et al.
Published: (2024)
Symmetry-Constrained Language-Guided Program Synthesis for Discovering Governing Equations from Noisy and Partial Observations
by: Baig, Mirza Samad Ahmed, et al.
Published: (2026)
by: Baig, Mirza Samad Ahmed, et al.
Published: (2026)
DAUNet: A Lightweight UNet Variant with Deformable Convolutions and Parameter-Free Attention for Medical Image Segmentation
by: Munir, Adnan, et al.
Published: (2025)
by: Munir, Adnan, et al.
Published: (2025)
Adaptive Image Restoration for Video Surveillance: A Real-Time Approach
by: Amin, Muhammad Awais, et al.
Published: (2025)
by: Amin, Muhammad Awais, et al.
Published: (2025)
TerraFM: A Scalable Foundation Model for Unified Multisensor Earth Observation
by: Danish, Muhammad Sohail, et al.
Published: (2025)
by: Danish, Muhammad Sohail, et al.
Published: (2025)
O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models
by: Sharifdeen, Ashshak, et al.
Published: (2025)
by: Sharifdeen, Ashshak, et al.
Published: (2025)
CE-RS-SBCIT A Novel Channel Enhanced Hybrid CNN Transformer with Residual, Spatial, and Boundary-Aware Learning for Brain Tumor MRI Analysis
by: Zahoor, Mirza Mumtaz, et al.
Published: (2025)
by: Zahoor, Mirza Mumtaz, et al.
Published: (2025)
EMF: Event Meta Formers for Event-based Real-time Traffic Object Detection
by: Khan, Muhammad Ahmed Ullah, et al.
Published: (2025)
by: Khan, Muhammad Ahmed Ullah, et al.
Published: (2025)
Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning
by: Munir, Ans, et al.
Published: (2024)
by: Munir, Ans, et al.
Published: (2024)
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks
by: Danish, Muhammad Sohail, et al.
Published: (2024)
by: Danish, Muhammad Sohail, et al.
Published: (2024)
Advanced Vision Transformers and Open-Set Learning for Robust Mosquito Classification: A Novel Approach to Entomological Studies
by: Karim, Ahmed Akib Jawad, et al.
Published: (2024)
by: Karim, Ahmed Akib Jawad, et al.
Published: (2024)
MaxViT-UNet: Multi-Axis Attention for Medical Image Segmentation
by: Khan, Abdul Rehman, et al.
Published: (2023)
by: Khan, Abdul Rehman, et al.
Published: (2023)
A Hybrid Approach for COVID-19 Detection: Combining Wasserstein GAN with Transfer Learning
by: Rounaq, Sumera, et al.
Published: (2024)
by: Rounaq, Sumera, et al.
Published: (2024)
Agentic AI for Remote Sensing: Technical Challenges and Research Directions
by: Munir, Muhammad Akhtar, et al.
Published: (2026)
by: Munir, Muhammad Akhtar, et al.
Published: (2026)
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition
by: Ullah, Hayat, et al.
Published: (2025)
by: Ullah, Hayat, et al.
Published: (2025)
A Permuted Autoregressive Approach to Word-Level Recognition for Urdu Digital Text
by: Mustafa, Ahmed, et al.
Published: (2024)
by: Mustafa, Ahmed, et al.
Published: (2024)
Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention
by: Khan, Zohaib, et al.
Published: (2024)
by: Khan, Zohaib, et al.
Published: (2024)
Streamline pathology foundation model by cross-magnification distillation
by: Su, Ziyu, et al.
Published: (2025)
by: Su, Ziyu, et al.
Published: (2025)
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification
by: Munir, Ans, et al.
Published: (2025)
by: Munir, Ans, et al.
Published: (2025)
Compositional Zero-Shot Learning: A Survey
by: Munir, Ans, et al.
Published: (2025)
by: Munir, Ans, et al.
Published: (2025)
ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks
by: Shabbir, Akashah, et al.
Published: (2025)
by: Shabbir, Akashah, et al.
Published: (2025)
Channel Boosted CNN-Transformer-based Multi-Level and Multi-Scale Nuclei Segmentation
by: Rauf, Zunaira, et al.
Published: (2024)
by: Rauf, Zunaira, et al.
Published: (2024)
AutoGen Driven Multi Agent Framework for Iterative Crime Data Analysis and Prediction
by: Fatima, Syeda Kisaa, et al.
Published: (2025)
by: Fatima, Syeda Kisaa, et al.
Published: (2025)
Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment
by: Danish, Muhammad Sohail, et al.
Published: (2024)
by: Danish, Muhammad Sohail, et al.
Published: (2024)
Enhanced Bank Check Security: Introducing a Novel Dataset and Transformer-Based Approach for Detection and Verification
by: Khan, Muhammad Saif Ullah, et al.
Published: (2024)
by: Khan, Muhammad Saif Ullah, et al.
Published: (2024)
DenseSwinV2: Channel Attentive Dual Branch CNN Transformer Learning for Cassava Leaf Disease Classification
by: Saood, Shah, et al.
Published: (2026)
by: Saood, Shah, et al.
Published: (2026)
OpenEarthAgent: A Unified Framework for Tool-Augmented Geospatial Agents
by: Shabbir, Akashah, et al.
Published: (2026)
by: Shabbir, Akashah, et al.
Published: (2026)
Beyond Anatomy: Explainable ASD Classification from rs-fMRI via Functional Parcellation and Graph Attention Networks
by: Madani, Syeda Hareem, et al.
Published: (2026)
by: Madani, Syeda Hareem, et al.
Published: (2026)
OD-VIRAT: A Large-Scale Benchmark for Object Detection in Realistic Surveillance Environments
by: Ullah, Hayat, et al.
Published: (2025)
by: Ullah, Hayat, et al.
Published: (2025)
Robust and Label-Efficient Deep Waste Detection
by: Abid, Hassan, et al.
Published: (2025)
by: Abid, Hassan, et al.
Published: (2025)
Synergistic Neural Forecasting of Air Pollution with Stochastic Sampling
by: Abeysinghe, Yohan, et al.
Published: (2025)
by: Abeysinghe, Yohan, et al.
Published: (2025)
TAFM-Net: A Novel Approach to Skin Lesion Segmentation Using Transformer Attention and Focal Modulation
by: Khan, Tariq M, et al.
Published: (2024)
by: Khan, Tariq M, et al.
Published: (2024)
EoCD: Encoder only Remote Sensing Change Detection
by: Noman, Mubashir, et al.
Published: (2026)
by: Noman, Mubashir, et al.
Published: (2026)
Region Guided Attention Network for Retinal Vessel Segmentation
by: Javed, Syed, et al.
Published: (2024)
by: Javed, Syed, et al.
Published: (2024)
Depth Attention for Robust RGB Tracking
by: Liu, Yu, et al.
Published: (2024)
by: Liu, Yu, et al.
Published: (2024)
KNN and ANN-based Recognition of Handwritten Pashto Letters using Zoning Features
by: Khan, Sulaiman, et al.
Published: (2019)
by: Khan, Sulaiman, et al.
Published: (2019)
Unified Multi-Foundation-Model Slide Representation for Pan-Cancer Recognition and Text-Guided Tumor Localization
by: Wang, Tianyang, et al.
Published: (2026)
by: Wang, Tianyang, et al.
Published: (2026)
A Tumor Aware DenseNet Swin Hybrid Learning with Boosted and Hierarchical Feature Spaces for Large-Scale Brain MRI Classification
by: Shah, Muhammad Ali, et al.
Published: (2026)
by: Shah, Muhammad Ali, et al.
Published: (2026)
Similar Items
-
AI-based Wearable Vision Assistance System for the Visually Impaired: Integrating Real-Time Object Recognition and Contextual Understanding Using Large Vision-Language Models
by: Baig, Mirza Samad Ahmed, et al.
Published: (2024) -
TextPixs: Glyph-Conditioned Diffusion with Character-Aware Attention and OCR-Guided Supervision
by: Gillani, Syeda Anshrah, et al.
Published: (2025) -
Advancing Depression Detection on Social Media Platforms Through Fine-Tuned Large Language Models
by: Shah, Shahid Munir, et al.
Published: (2024) -
Symmetry-Constrained Language-Guided Program Synthesis for Discovering Governing Equations from Noisy and Partial Observations
by: Baig, Mirza Samad Ahmed, et al.
Published: (2026) -
DAUNet: A Lightweight UNet Variant with Deformable Convolutions and Parameter-Free Attention for Medical Image Segmentation
by: Munir, Adnan, et al.
Published: (2025)