Saved in:
| Main Authors: | Kumar, Ashutosh, Chadha, Aman |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.02027 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation
by: Chopra, Shivang, et al.
Published: (2023)
by: Chopra, Shivang, et al.
Published: (2023)
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
by: Lakhanpal, Sanyam, et al.
Published: (2024)
by: Lakhanpal, Sanyam, et al.
Published: (2024)
Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development
by: Sahoo, Pranab, et al.
Published: (2024)
by: Sahoo, Pranab, et al.
Published: (2024)
Into the Fog: Evaluating Robustness of Multiple Object Tracking
by: Kirillova, Nadezda, et al.
Published: (2024)
by: Kirillova, Nadezda, et al.
Published: (2024)
High-quality Image Dehazing with Diffusion Model
by: Yu, Hu, et al.
Published: (2023)
by: Yu, Hu, et al.
Published: (2023)
StableI2I: Spotting Unintended Changes in Image-to-Image Transition
by: Li, Jiayang, et al.
Published: (2026)
by: Li, Jiayang, et al.
Published: (2026)
Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types
by: Sinha, Neelabh, et al.
Published: (2024)
by: Sinha, Neelabh, et al.
Published: (2024)
Fine-Tuning Adversarially-Robust Transformers for Single-Image Dehazing
by: Vasilescu, Vlad, et al.
Published: (2025)
by: Vasilescu, Vlad, et al.
Published: (2025)
Out-of-Distribution Detection with Attention Head Masking for Multimodal Document Classification
by: Constantinou, Christos, et al.
Published: (2024)
by: Constantinou, Christos, et al.
Published: (2024)
Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation
by: Chopra, Shivang, et al.
Published: (2024)
by: Chopra, Shivang, et al.
Published: (2024)
Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions
by: Ghosh, Akash, et al.
Published: (2024)
by: Ghosh, Akash, et al.
Published: (2024)
Detecting Object Tracking Failure via Sequential Hypothesis Testing
by: Muñoz, Alejandro Monroy, et al.
Published: (2026)
by: Muñoz, Alejandro Monroy, et al.
Published: (2026)
A Data Efficiency Study of Synthetic Fog for Object Detection Using the Clear2Fog Pipeline
by: Mohamed, Mohamed Ahmed, et al.
Published: (2026)
by: Mohamed, Mohamed Ahmed, et al.
Published: (2026)
FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation
by: Velioglu, Riza, et al.
Published: (2024)
by: Velioglu, Riza, et al.
Published: (2024)
Feature Fusion Attention Network with CycleGAN for Image Dehazing, De-Snowing and De-Raining
by: Jain, Akshat
Published: (2025)
by: Jain, Akshat
Published: (2025)
CLIP-DQA: Blindly Evaluating Dehazed Images from Global and Local Perspectives Using CLIP
by: Zeng, Yirui, et al.
Published: (2025)
by: Zeng, Yirui, et al.
Published: (2025)
TSNet:A Two-stage Network for Image Dehazing with Multi-scale Fusion and Adaptive Learning
by: Gong, Xiaolin, et al.
Published: (2024)
by: Gong, Xiaolin, et al.
Published: (2024)
PriorNet: A Novel Lightweight Network with Multidimensional Interactive Attention for Efficient Image Dehazing
by: Chen, Yutong, et al.
Published: (2024)
by: Chen, Yutong, et al.
Published: (2024)
A Study of Failure Modes in Two-Stage Human-Object Interaction Detection
by: Wang, Lemeng, et al.
Published: (2026)
by: Wang, Lemeng, et al.
Published: (2026)
The Brittleness of AI-Generated Image Watermarking Techniques: Examining Their Robustness Against Visual Paraphrasing Attacks
by: Barman, Niyar R, et al.
Published: (2024)
by: Barman, Niyar R, et al.
Published: (2024)
How Culturally Aware are Vision-Language Models?
by: Burda-Lassen, Olena, et al.
Published: (2024)
by: Burda-Lassen, Olena, et al.
Published: (2024)
ClearSight: Visual Signal Enhancement for Object Hallucination Mitigation in Multimodal Large language Models
by: Yin, Hao, et al.
Published: (2025)
by: Yin, Hao, et al.
Published: (2025)
Qualitative Failures of Image Generation Models and Their Application in Detecting Deepfakes
by: Borji, Ali
Published: (2023)
by: Borji, Ali
Published: (2023)
Temporal Object-Aware Vision Transformer for Few-Shot Video Object Detection
by: Kumar, Yogesh, et al.
Published: (2025)
by: Kumar, Yogesh, et al.
Published: (2025)
Clear Roads, Clear Vision: Advancements in Multi-Weather Restoration for Smart Transportation
by: Galshetwar, Vijay M., et al.
Published: (2025)
by: Galshetwar, Vijay M., et al.
Published: (2025)
Rice-VL: Evaluating Vision-Language Models for Cultural Understanding Across ASEAN Countries
by: Pranav, Tushar, et al.
Published: (2025)
by: Pranav, Tushar, et al.
Published: (2025)
The Visual Counter Turing Test (VCT2): A Benchmark for Evaluating AI-Generated Image Detection and the Visual AI Index (VAI)
by: Imanpour, Nasrin, et al.
Published: (2024)
by: Imanpour, Nasrin, et al.
Published: (2024)
A Comprehensive Dataset for Human vs. AI Generated Image Detection
by: Roy, Rajarshi, et al.
Published: (2026)
by: Roy, Rajarshi, et al.
Published: (2026)
MoonMetaSync: Lunar Image Registration Analysis
by: Kumar, Ashutosh, et al.
Published: (2024)
by: Kumar, Ashutosh, et al.
Published: (2024)
Spatial-Frequency Aware for Object Detection in RAW Image
by: Ye, Zhuohua, et al.
Published: (2025)
by: Ye, Zhuohua, et al.
Published: (2025)
Improving the Detection of Small Oriented Objects in Aerial Images
by: Doloriel, Chandler Timm C., et al.
Published: (2024)
by: Doloriel, Chandler Timm C., et al.
Published: (2024)
ViBe: A Text-to-Video Benchmark for Evaluating Hallucination in Large Multimodal Models
by: Rawte, Vipula, et al.
Published: (2024)
by: Rawte, Vipula, et al.
Published: (2024)
HazeMatching: Dehazing Light Microscopy Images with Guided Conditional Flow Matching
by: Ray, Anirban, et al.
Published: (2025)
by: Ray, Anirban, et al.
Published: (2025)
Accelerating Object Detection with YOLOv4 for Real-Time Applications
by: Kumar, K. Senthil, et al.
Published: (2024)
by: Kumar, K. Senthil, et al.
Published: (2024)
Density Adaptive Attention is All You Need: Robust Parameter-Efficient Fine-Tuning Across Multiple Modalities
by: Ioannides, Georgios, et al.
Published: (2024)
by: Ioannides, Georgios, et al.
Published: (2024)
Object Detection Approaches to Identifying Hand Images with High Forensic Values
by: Nguyen, Thanh Thi, et al.
Published: (2024)
by: Nguyen, Thanh Thi, et al.
Published: (2024)
OrthoAI v2: From Single-Agent Segmentation to Dual-Agent Treatment Planning for Clear Aligners
by: Edouard, Lansiaux, et al.
Published: (2026)
by: Edouard, Lansiaux, et al.
Published: (2026)
Multi-Object Tracking based on Imaging Radar 3D Object Detection
by: Palmer, Patrick, et al.
Published: (2024)
by: Palmer, Patrick, et al.
Published: (2024)
STResNet & STYOLO : A New Family of Compact Classification and Object Detection Models for MCUs
by: Sah, Sudhakar, et al.
Published: (2026)
by: Sah, Sudhakar, et al.
Published: (2026)
The Evolution of Multimodal Model Architectures
by: Wadekar, Shakti N., et al.
Published: (2024)
by: Wadekar, Shakti N., et al.
Published: (2024)
Similar Items
-
Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation
by: Chopra, Shivang, et al.
Published: (2023) -
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
by: Lakhanpal, Sanyam, et al.
Published: (2024) -
Enhancing Adverse Drug Event Detection with Multimodal Dataset: Corpus Creation and Model Development
by: Sahoo, Pranab, et al.
Published: (2024) -
Into the Fog: Evaluating Robustness of Multiple Object Tracking
by: Kirillova, Nadezda, et al.
Published: (2024) -
High-quality Image Dehazing with Diffusion Model
by: Yu, Hu, et al.
Published: (2023)