:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kumar, Ashutosh, Chadha, Aman
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2502.02027
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation
by: Chopra, Shivang, et al.
Published: (2023)

Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
by: Lakhanpal, Sanyam, et al.
Published: (2024)

Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development
by: Sahoo, Pranab, et al.
Published: (2024)

Into the Fog: Evaluating Robustness of Multiple Object Tracking
by: Kirillova, Nadezda, et al.
Published: (2024)

High-quality Image Dehazing with Diffusion Model
by: Yu, Hu, et al.
Published: (2023)

StableI2I: Spotting Unintended Changes in Image-to-Image Transition
by: Li, Jiayang, et al.
Published: (2026)

Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types
by: Sinha, Neelabh, et al.
Published: (2024)

Fine-Tuning Adversarially-Robust Transformers for Single-Image Dehazing
by: Vasilescu, Vlad, et al.
Published: (2025)

Out-of-Distribution Detection with Attention Head Masking for Multimodal Document Classification
by: Constantinou, Christos, et al.
Published: (2024)

Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation
by: Chopra, Shivang, et al.
Published: (2024)

Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions
by: Ghosh, Akash, et al.
Published: (2024)

Detecting Object Tracking Failure via Sequential Hypothesis Testing
by: Muñoz, Alejandro Monroy, et al.
Published: (2026)

A Data Efficiency Study of Synthetic Fog for Object Detection Using the Clear2Fog Pipeline
by: Mohamed, Mohamed Ahmed, et al.
Published: (2026)

FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation
by: Velioglu, Riza, et al.
Published: (2024)

Feature Fusion Attention Network with CycleGAN for Image Dehazing, De-Snowing and De-Raining
by: Jain, Akshat
Published: (2025)

CLIP-DQA: Blindly Evaluating Dehazed Images from Global and Local Perspectives Using CLIP
by: Zeng, Yirui, et al.
Published: (2025)

TSNet:A Two-stage Network for Image Dehazing with Multi-scale Fusion and Adaptive Learning
by: Gong, Xiaolin, et al.
Published: (2024)

PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing
by: Chen, Yutong, et al.
Published: (2024)

A Study of Failure Modes in Two-Stage Human-Object Interaction Detection
by: Wang, Lemeng, et al.
Published: (2026)

The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks
by: Barman, Niyar R, et al.
Published: (2024)

How Culturally Aware are Vision-Language Models?
by: Burda-Lassen, Olena, et al.
Published: (2024)

ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models
by: Yin, Hao, et al.
Published: (2025)

Qualitative Failures of Image Generation Models and Their Application in Detecting Deepfakes
by: Borji, Ali
Published: (2023)

Temporal Object-Aware Vision Transformer for Few-Shot Video Object Detection
by: Kumar, Yogesh, et al.
Published: (2025)

Clear Roads, Clear Vision: Advancements in Multi-Weather Restoration for Smart Transportation
by: Galshetwar, Vijay M., et al.
Published: (2025)

Rice-VL: Evaluating Vision-Language Models for Cultural Understanding Across ASEAN Countries
by: Pranav, Tushar, et al.
Published: (2025)

The Visual Counter Turing Test (VCT2): A Benchmark for Evaluating AI-Generated Image Detection and the Visual AI Index (VAI)
by: Imanpour, Nasrin, et al.
Published: (2024)

A Comprehensive Dataset for Human vs. AI Generated Image Detection
by: Roy, Rajarshi, et al.
Published: (2026)

MoonMetaSync: Lunar Image Registration Analysis
by: Kumar, Ashutosh, et al.
Published: (2024)

Spatial-Frequency Aware for Object Detection in RAW Image
by: Ye, Zhuohua, et al.
Published: (2025)

Improving the Detection of Small Oriented Objects in Aerial Images
by: Doloriel, Chandler Timm C., et al.
Published: (2024)

ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models
by: Rawte, Vipula, et al.
Published: (2024)

HazeMatching: Dehazing Light Microscopy Images with Guided Conditional Flow Matching
by: Ray, Anirban, et al.
Published: (2025)

Accelerating Object Detection with YOLOv4 for Real-Time Applications
by: Kumar, K. Senthil, et al.
Published: (2024)

Density Adaptive Attention is All You Need: Robust Parameter-Efficient Fine-Tuning Across Multiple Modalities
by: Ioannides, Georgios, et al.
Published: (2024)

Object Detection Approaches to Identifying Hand Images with High Forensic Values
by: Nguyen, Thanh Thi, et al.
Published: (2024)

OrthoAI v2: From Single-Agent Segmentation to Dual-Agent Treatment Planning for Clear Aligners
by: Edouard, Lansiaux, et al.
Published: (2026)

Multi-Object Tracking based on Imaging Radar 3D Object Detection
by: Palmer, Patrick, et al.
Published: (2024)

STResNet & STYOLO : A New Family of Compact Classification and Object Detection Models for MCUs
by: Sah, Sudhakar, et al.
Published: (2026)

The Evolution of Multimodal Model Architectures
by: Wadekar, Shakti N., et al.
Published: (2024)