Saved in:
| Main Authors: | Rai, Avinash, Jana, Sandeep, Vijay, Vishal |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.23171 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Make It Count: Text-to-Image Generation with an Accurate Number of Objects
by: Binyamin, Lital, et al.
Published: (2024)
by: Binyamin, Lital, et al.
Published: (2024)
Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras
by: Lu, Yipeng, et al.
Published: (2024)
by: Lu, Yipeng, et al.
Published: (2024)
Deep Neural Networks for Accurate Depth Estimation with Latent Space Features
by: Yasir, Siddiqui Muhammad, et al.
Published: (2025)
by: Yasir, Siddiqui Muhammad, et al.
Published: (2025)
Fine-Tuning Without Forgetting: Adaptation of YOLOv8 Preserves COCO Performance
by: Gandhi, Vishal, et al.
Published: (2025)
by: Gandhi, Vishal, et al.
Published: (2025)
Personalized Image Generation from an Author Writing Style
by: Gandhi, Sagar, et al.
Published: (2025)
by: Gandhi, Sagar, et al.
Published: (2025)
Pruning the Paradox: How CLIP's Most Informative Heads Enhance Performance While Amplifying Bias
by: Madasu, Avinash, et al.
Published: (2025)
by: Madasu, Avinash, et al.
Published: (2025)
LS-GAN: Human Motion Synthesis with Latent-space GANs
by: Amballa, Avinash, et al.
Published: (2024)
by: Amballa, Avinash, et al.
Published: (2024)
ICSVR: Investigating Compositional and Syntactic Understanding in Video Retrieval Models
by: Madasu, Avinash, et al.
Published: (2023)
by: Madasu, Avinash, et al.
Published: (2023)
CountQA: How Well Do MLLMs Count in the Wild?
by: Tamarapalli, Jayant Sravan, et al.
Published: (2025)
by: Tamarapalli, Jayant Sravan, et al.
Published: (2025)
Bones Can't Be Triangles: Accurate and Efficient Vertebrae Keypoint Estimation through Collaborative Error Revision
by: Kim, Jinhee, et al.
Published: (2024)
by: Kim, Jinhee, et al.
Published: (2024)
FruitProM-V2: Robust Probabilistic Maturity Estimation and Detection of Fruits and Vegetables
by: Cheppally, Rahul Harsha, et al.
Published: (2026)
by: Cheppally, Rahul Harsha, et al.
Published: (2026)
Morphing Through Time: Diffusion-Based Bridging of Temporal Gaps for Robust Alignment in Change Detection
by: Madani, Seyedehanita, et al.
Published: (2025)
by: Madani, Seyedehanita, et al.
Published: (2025)
Your Pre-trained Diffusion Model Secretly Knows Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2026)
by: Rajagopalan, Sudarshan, et al.
Published: (2026)
Vision-based module for accurately reading linear scales in a laboratory
by: Saini, Parvesh, et al.
Published: (2025)
by: Saini, Parvesh, et al.
Published: (2025)
Selective Fine-Tuning for Targeted and Robust Concept Unlearning
by: Mansi, et al.
Published: (2026)
by: Mansi, et al.
Published: (2026)
AgRegNet: A Deep Regression Network for Flower and Fruit Density Estimation, Localization, and Counting in Orchards
by: Bhattarai, Uddhav, et al.
Published: (2024)
by: Bhattarai, Uddhav, et al.
Published: (2024)
CounterCount: A Diagnostic Framework for Counting Bias in Vision Language Models
by: Alzahrani, Reem, et al.
Published: (2026)
by: Alzahrani, Reem, et al.
Published: (2026)
Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors
by: Adjel, Mohamed
Published: (2025)
by: Adjel, Mohamed
Published: (2025)
Equivariant Spherical CNNs for Accurate Fiber Orientation Distribution Estimation in Neonatal Diffusion MRI with Reduced Acquisition Time
by: Snoussi, Haykel, et al.
Published: (2025)
by: Snoussi, Haykel, et al.
Published: (2025)
SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision
by: Rai, Utsav, et al.
Published: (2025)
by: Rai, Utsav, et al.
Published: (2025)
GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations
by: Chen, Boyuan, et al.
Published: (2026)
by: Chen, Boyuan, et al.
Published: (2026)
Transparent Visual Reasoning via Object-Centric Agent Collaboration
by: Teoh, Benjamin, et al.
Published: (2025)
by: Teoh, Benjamin, et al.
Published: (2025)
Quantifying and Enabling the Interpretability of CLIP-like Models
by: Madasu, Avinash, et al.
Published: (2024)
by: Madasu, Avinash, et al.
Published: (2024)
SynthRAR: Ring Artifacts Reduction in CT with Unrolled Network and Synthetic Data Training
by: Yang, Hongxu, et al.
Published: (2026)
by: Yang, Hongxu, et al.
Published: (2026)
MTCNET: Multi-task Learning Paradigm for Crowd Count Estimation
by: Kumar, Abhay, et al.
Published: (2019)
by: Kumar, Abhay, et al.
Published: (2019)
Curriculum for Crowd Counting -- Is it Worthy?
by: Khan, Muhammad Asif, et al.
Published: (2024)
by: Khan, Muhammad Asif, et al.
Published: (2024)
Count What You Want: Exemplar Identification and Few-shot Counting of Human Actions in the Wild
by: Huang, Yifeng, et al.
Published: (2023)
by: Huang, Yifeng, et al.
Published: (2023)
CountFormer: A Transformer Framework for Learning Visual Repetition and Structure in Class-Agnostic Object Counting
by: Hossain, Md Tanvir, et al.
Published: (2025)
by: Hossain, Md Tanvir, et al.
Published: (2025)
Your Vision-Language Model Can't Even Count to 20: Exposing the Failures of VLMs in Compositional Counting
by: Guo, Xuyang, et al.
Published: (2025)
by: Guo, Xuyang, et al.
Published: (2025)
DiffRegCD: Integrated Registration and Change Detection with Diffusion Features
by: Madani, Seyedehanita, et al.
Published: (2025)
by: Madani, Seyedehanita, et al.
Published: (2025)
RestoreVAR: Visual Autoregressive Generation for All-in-One Image Restoration
by: Rajagopalan, Sudarshan, et al.
Published: (2025)
by: Rajagopalan, Sudarshan, et al.
Published: (2025)
Open-World Object Counting in Videos
by: Amini-Naieni, Niki, et al.
Published: (2025)
by: Amini-Naieni, Niki, et al.
Published: (2025)
NEURO-GUARD: Neuro-Symbolic Generalization and Unbiased Adaptive Routing for Diagnostics -- Explainable Medical AI
by: Urooj, Midhat, et al.
Published: (2025)
by: Urooj, Midhat, et al.
Published: (2025)
CAMBench-QR : A Structure-Aware Benchmark for Post-Hoc Explanations with QR Understanding
by: Chakraborty, Ritabrata, et al.
Published: (2025)
by: Chakraborty, Ritabrata, et al.
Published: (2025)
XAI-MeD: Explainable Knowledge Guided Neuro-Symbolic Framework for Domain Generalization and Rare Class Detection in Medical Imaging
by: Urooj, Midhat, et al.
Published: (2026)
by: Urooj, Midhat, et al.
Published: (2026)
Fast & Efficient Normalizing Flows and Applications of Image Generative Models
by: Nagar, Sandeep
Published: (2025)
by: Nagar, Sandeep
Published: (2025)
Accurate and Fast Compressed Video Captioning
by: Shen, Yaojie, et al.
Published: (2023)
by: Shen, Yaojie, et al.
Published: (2023)
Can Deep Learning Trigger Alerts from Mobile-Captured Images?
by: Sarkar, Pritisha, et al.
Published: (2025)
by: Sarkar, Pritisha, et al.
Published: (2025)
Bound Tightening Network for Robust Crowd Counting
by: Wu, Qiming
Published: (2024)
by: Wu, Qiming
Published: (2024)
Privacy Preserving Ordinal-Meta Learning with VLMs for Fine-Grained Fruit Quality Prediction
by: Jain, Riddhi, et al.
Published: (2025)
by: Jain, Riddhi, et al.
Published: (2025)
Similar Items
-
Make It Count: Text-to-Image Generation with an Accurate Number of Objects
by: Binyamin, Lital, et al.
Published: (2024) -
Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras
by: Lu, Yipeng, et al.
Published: (2024) -
Deep Neural Networks for Accurate Depth Estimation with Latent Space Features
by: Yasir, Siddiqui Muhammad, et al.
Published: (2025) -
Fine-Tuning Without Forgetting: Adaptation of YOLOv8 Preserves COCO Performance
by: Gandhi, Vishal, et al.
Published: (2025) -
Personalized Image Generation from an Author Writing Style
by: Gandhi, Sagar, et al.
Published: (2025)