:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Batra, Sarthak, Chakrabarti, Partha P., Hadfield, Simon, Mustafa, Armin
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2408.08086
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal
by: Kubiak, Nikolina, et al.
Published: (2024)

RenDetNet: Weakly-supervised Shadow Detection with Shadow Caster Verification
by: Kubiak, Nikolina, et al.
Published: (2024)

FlowDet: Unifying Object Detection and Generative Transport Flows
by: Baty, Enis, et al.
Published: (2025)

ReFrame: Rectification Framework for Image Explaining Architectures
by: Adhikary, Debjyoti Das, et al.
Published: (2025)

SpaGBOL: Spatial-Graph-Based Orientated Localisation
by: Shore, Tavis, et al.
Published: (2024)

Deep Leakage with Generative Flow Matching Denoiser
by: Baglin, Isaac, et al.
Published: (2026)

DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description
by: Deganutti, Adrienne, et al.
Published: (2025)

Realistic Clothed Human and Object Joint Reconstruction from a Single Image
by: Dutta, Ayushi, et al.
Published: (2025)

TACO: Trajectory Aligning Cross-view Optimisation
by: Shore, Tavis, et al.
Published: (2026)

BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation
by: Shore, Tavis, et al.
Published: (2023)

Efficient Audio-Visual Fusion for Video Classification
by: Awan, Mahrukh, et al.
Published: (2024)

Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison
by: Zhong, Jiageng, et al.
Published: (2025)

PEnG: Pose-Enhanced Geo-Localisation
by: Shore, Tavis, et al.
Published: (2024)

HYDRA: HYbrid knowledge Distillation and spectral Reconstruction Algorithm for high channel hyperspectral camera applications
by: Thirgood, Christopher, et al.
Published: (2025)

FeatureSLAM: Feature-enriched 3D gaussian splatting SLAM in real time
by: Thirgood, Christopher, et al.
Published: (2026)

SpooFL: Spoofing Federated Learning
by: Baglin, Isaac, et al.
Published: (2026)

HyperGS: Hyperspectral 3D Gaussian Splatting
by: Thirgood, Christopher, et al.
Published: (2024)

Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks
by: Baty, Enis, et al.
Published: (2024)

VICI: VLM-Instructed Cross-view Image-localisation
by: Zhang, Xiaohan, et al.
Published: (2025)

Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV
by: Spencer, Jaime, et al.
Published: (2024)

An Effective-Efficient Approach for Dense Multi-Label Action Detection
by: Sardari, Faegheh, et al.
Published: (2024)

CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing
by: Sardari, Faegheh, et al.
Published: (2024)

Reframing Dense Action Detection (RefDense): A Paradigm Shift in Problem Solving & a Novel Optimization Strategy
by: Sardari, Faegheh, et al.
Published: (2025)

ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet
by: Cheong, Soon Yau, et al.
Published: (2023)

From Retinal Pixels to Patients: Evolution of Deep Learning Research in Diabetic Retinopathy Screening
by: Chopra, Muskaan, et al.
Published: (2025)

Catalogue Grounded Multimodal Attribution for Museum Video under Resource and Regulatory Constraints
by: Nanang, Minsak, et al.
Published: (2026)

EvtSlowTV -- A Large and Diverse Dataset for Event-Based Depth Estimation
by: Macaulay, Sadiq Layi, et al.
Published: (2025)

Metadata-enhanced contrastive learning from retinal optical coherence tomography images
by: Holland, Robbie, et al.
Published: (2022)

EOPose : Exemplar-based object reposing using Generalized Pose Correspondences
by: Mehrotra, Sarthak, et al.
Published: (2025)

Deep Unrolled Meta-Learning for Multi-Coil and Multi-Modality MRI with Adaptive Optimization
by: Fouladvand, Merham, et al.
Published: (2025)

Robust Assembly Progress Estimation via Deep Metric Learning
by: Miura, Kazuma, et al.
Published: (2026)

CAST: Cross-modal Alignment Similarity Test for Vision Language Models
by: Dagan, Gautier, et al.
Published: (2024)

Unidirectional imaging with partially coherent light
by: Ma, Guangdong, et al.
Published: (2024)

FEDLAD: Federated Evaluation of Deep Leakage Attacks and Defenses
by: Baglin, Isaac, et al.
Published: (2024)

The Devil is in the Details -- From OCR for Old Church Slavonic to Purely Visual Stemma Reconstruction
by: Hoenen, Armin
Published: (2026)

Action-based image editing guided by human instructions
by: Trusca, Maria Mihaela, et al.
Published: (2024)

Learning human-to-robot handovers through 3D scene reconstruction
by: Wu, Yuekun, et al.
Published: (2025)

Symbolic Graph Inference for Compound Scene Understanding
by: Aryan, FNU, et al.
Published: (2024)

Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
by: Tofik, Ali, et al.
Published: (2024)

Scalable Methods for Brick Kiln Detection and Compliance Monitoring from Satellite Imagery: A Deployment Case Study in India
by: Mondal, Rishabh, et al.
Published: (2024)