:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rai, Avinash, Jana, Sandeep, Vijay, Vishal
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.23171
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Make It Count: Text-to-Image Generation with an Accurate Number of Objects
by: Binyamin, Lital, et al.
Published: (2024)

Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras
by: Lu, Yipeng, et al.
Published: (2024)

Deep Neural Networks for Accurate Depth Estimation with Latent Space Features
by: Yasir, Siddiqui Muhammad, et al.
Published: (2025)

Fine-Tuning Without Forgetting: Adaptation of YOLOv8 Preserves COCO Performance
by: Gandhi, Vishal, et al.
Published: (2025)

Personalized Image Generation from an Author Writing Style
by: Gandhi, Sagar, et al.
Published: (2025)

Pruning the Paradox: How CLIP's Most Informative Heads Enhance Performance While Amplifying Bias
by: Madasu, Avinash, et al.
Published: (2025)

LS-GAN: Human Motion Synthesis with Latent-space GANs
by: Amballa, Avinash, et al.
Published: (2024)

ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models
by: Madasu, Avinash, et al.
Published: (2023)

CountQA: How Well Do MLLMs Count in the Wild?
by: Tamarapalli, Jayant Sravan, et al.
Published: (2025)

Bones Can't Be Triangles: Accurate and Efficient Vertebrae Keypoint Estimation through Collaborative Error Revision
by: Kim, Jinhee, et al.
Published: (2024)

FruitProM-V2: Robust Probabilistic Maturity Estimation and Detection of Fruits and Vegetables
by: Cheppally, Rahul Harsha, et al.
Published: (2026)

Morphing Through Time: Diffusion-Based Bridging of Temporal Gaps for Robust Alignment in Change Detection
by: Madani, Seyedehanita, et al.
Published: (2025)

Your Pre-trained Diffusion Model Secretly Knows Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2026)

Vision-based module for accurately reading linear scales in a laboratory
by: Saini, Parvesh, et al.
Published: (2025)

Selective Fine-Tuning for Targeted and Robust Concept Unlearning
by: Mansi, et al.
Published: (2026)

AgRegNet: A Deep Regression Network for Flower and Fruit Density Estimation, Localization, and Counting in Orchards
by: Bhattarai, Uddhav, et al.
Published: (2024)

CounterCount: A Diagnostic Framework for Counting Bias in Vision Language Models
by: Alzahrani, Reem, et al.
Published: (2026)

Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors
by: Adjel, Mohamed
Published: (2025)

Equivariant Spherical CNNs for Accurate Fiber Orientation Distribution Estimation in Neonatal Diffusion MRI with Reduced Acquisition Time
by: Snoussi, Haykel, et al.
Published: (2025)

SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision
by: Rai, Utsav, et al.
Published: (2025)

GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations
by: Chen, Boyuan, et al.
Published: (2026)

Transparent Visual Reasoning via Object-Centric Agent Collaboration
by: Teoh, Benjamin, et al.
Published: (2025)

Quantifying and Enabling the Interpretability of CLIP-like Models
by: Madasu, Avinash, et al.
Published: (2024)

SynthRAR: Ring Artifacts Reduction in CT with Unrolled Network and Synthetic Data Training
by: Yang, Hongxu, et al.
Published: (2026)

MTCNET: Multi-task Learning Paradigm for Crowd Count Estimation
by: Kumar, Abhay, et al.
Published: (2019)

Curriculum for Crowd Counting -- Is it Worthy?
by: Khan, Muhammad Asif, et al.
Published: (2024)

Count What You Want: Exemplar Identification and Few-shot Counting of Human Actions in the Wild
by: Huang, Yifeng, et al.
Published: (2023)

CountFormer: A Transformer Framework for Learning Visual Repetition and Structure in Class-Agnostic Object Counting
by: Hossain, Md Tanvir, et al.
Published: (2025)

Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
by: Guo, Xuyang, et al.
Published: (2025)

DiffRegCD: Integrated Registration and Change Detection with Diffusion Features
by: Madani, Seyedehanita, et al.
Published: (2025)

RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2025)

Open-World Object Counting in Videos
by: Amini-Naieni, Niki, et al.
Published: (2025)

NEURO-GUARD: Neuro-Symbolic Generalization and Unbiased Adaptive Routing for Diagnostics -- Explainable Medical AI
by: Urooj, Midhat, et al.
Published: (2025)

CAMBench-QR : A Structure-Aware Benchmark for Post-Hoc Explanations with QR Understanding
by: Chakraborty, Ritabrata, et al.
Published: (2025)

XAI-MeD: Explainable Knowledge Guided Neuro-Symbolic Framework for Domain Generalization and Rare Class Detection in Medical Imaging
by: Urooj, Midhat, et al.
Published: (2026)

Fast & Efficient Normalizing Flows and Applications of Image Generative Models
by: Nagar, Sandeep
Published: (2025)

Accurate and Fast Compressed Video Captioning
by: Shen, Yaojie, et al.
Published: (2023)

Can Deep Learning Trigger Alerts from Mobile-Captured Images?
by: Sarkar, Pritisha, et al.
Published: (2025)

Bound Tightening Network for Robust Crowd Counting
by: Wu, Qiming
Published: (2024)

Privacy Preserving Ordinal-Meta Learning with VLMs for Fine-Grained Fruit Quality Prediction
by: Jain, Riddhi, et al.
Published: (2025)