Saved in:
| Main Authors: | Batra, Sarthak, Chakrabarti, Partha P., Hadfield, Simon, Mustafa, Armin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.08086 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal
by: Kubiak, Nikolina, et al.
Published: (2024)
by: Kubiak, Nikolina, et al.
Published: (2024)
RenDetNet: Weakly-supervised Shadow Detection with Shadow Caster Verification
by: Kubiak, Nikolina, et al.
Published: (2024)
by: Kubiak, Nikolina, et al.
Published: (2024)
FlowDet: Unifying Object Detection and Generative Transport Flows
by: Baty, Enis, et al.
Published: (2025)
by: Baty, Enis, et al.
Published: (2025)
ReFrame: Rectification Framework for Image Explaining Architectures
by: Adhikary, Debjyoti Das, et al.
Published: (2025)
by: Adhikary, Debjyoti Das, et al.
Published: (2025)
SpaGBOL: Spatial-Graph-Based Orientated Localisation
by: Shore, Tavis, et al.
Published: (2024)
by: Shore, Tavis, et al.
Published: (2024)
Deep Leakage with Generative Flow Matching Denoiser
by: Baglin, Isaac, et al.
Published: (2026)
by: Baglin, Isaac, et al.
Published: (2026)
DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description
by: Deganutti, Adrienne, et al.
Published: (2025)
by: Deganutti, Adrienne, et al.
Published: (2025)
Realistic Clothed Human and Object Joint Reconstruction from a Single Image
by: Dutta, Ayushi, et al.
Published: (2025)
by: Dutta, Ayushi, et al.
Published: (2025)
TACO: Trajectory Aligning Cross-view Optimisation
by: Shore, Tavis, et al.
Published: (2026)
by: Shore, Tavis, et al.
Published: (2026)
BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation
by: Shore, Tavis, et al.
Published: (2023)
by: Shore, Tavis, et al.
Published: (2023)
Efficient Audio-Visual Fusion for Video Classification
by: Awan, Mahrukh, et al.
Published: (2024)
by: Awan, Mahrukh, et al.
Published: (2024)
Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison
by: Zhong, Jiageng, et al.
Published: (2025)
by: Zhong, Jiageng, et al.
Published: (2025)
PEnG: Pose-Enhanced Geo-Localisation
by: Shore, Tavis, et al.
Published: (2024)
by: Shore, Tavis, et al.
Published: (2024)
HYDRA: HYbrid knowledge Distillation and spectral Reconstruction Algorithm for high channel hyperspectral camera applications
by: Thirgood, Christopher, et al.
Published: (2025)
by: Thirgood, Christopher, et al.
Published: (2025)
FeatureSLAM: Feature-enriched 3D gaussian splatting SLAM in real time
by: Thirgood, Christopher, et al.
Published: (2026)
by: Thirgood, Christopher, et al.
Published: (2026)
SpooFL: Spoofing Federated Learning
by: Baglin, Isaac, et al.
Published: (2026)
by: Baglin, Isaac, et al.
Published: (2026)
HyperGS: Hyperspectral 3D Gaussian Splatting
by: Thirgood, Christopher, et al.
Published: (2024)
by: Thirgood, Christopher, et al.
Published: (2024)
Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks
by: Baty, Enis, et al.
Published: (2024)
by: Baty, Enis, et al.
Published: (2024)
VICI: VLM-Instructed Cross-view Image-localisation
by: Zhang, Xiaohan, et al.
Published: (2025)
by: Zhang, Xiaohan, et al.
Published: (2025)
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV
by: Spencer, Jaime, et al.
Published: (2024)
by: Spencer, Jaime, et al.
Published: (2024)
An Effective-Efficient Approach for Dense Multi-Label Action Detection
by: Sardari, Faegheh, et al.
Published: (2024)
by: Sardari, Faegheh, et al.
Published: (2024)
CoLeaF: A Contrastive-Collaborative Learning Framework for Weakly Supervised Audio-Visual Video Parsing
by: Sardari, Faegheh, et al.
Published: (2024)
by: Sardari, Faegheh, et al.
Published: (2024)
Reframing Dense Action Detection (RefDense): A Paradigm Shift in Problem Solving & a Novel Optimization Strategy
by: Sardari, Faegheh, et al.
Published: (2025)
by: Sardari, Faegheh, et al.
Published: (2025)
ViscoNet: Bridging and Harmonizing Visual and Textual Conditioning for ControlNet
by: Cheong, Soon Yau, et al.
Published: (2023)
by: Cheong, Soon Yau, et al.
Published: (2023)
From Retinal Pixels to Patients: Evolution of Deep Learning Research in Diabetic Retinopathy Screening
by: Chopra, Muskaan, et al.
Published: (2025)
by: Chopra, Muskaan, et al.
Published: (2025)
Catalogue Grounded Multimodal Attribution for Museum Video under Resource and Regulatory Constraints
by: Nanang, Minsak, et al.
Published: (2026)
by: Nanang, Minsak, et al.
Published: (2026)
EvtSlowTV -- A Large and Diverse Dataset for Event-Based Depth Estimation
by: Macaulay, Sadiq Layi, et al.
Published: (2025)
by: Macaulay, Sadiq Layi, et al.
Published: (2025)
Metadata-enhanced contrastive learning from retinal optical coherence tomography images
by: Holland, Robbie, et al.
Published: (2022)
by: Holland, Robbie, et al.
Published: (2022)
EOPose : Exemplar-based object reposing using Generalized Pose Correspondences
by: Mehrotra, Sarthak, et al.
Published: (2025)
by: Mehrotra, Sarthak, et al.
Published: (2025)
Deep Unrolled Meta-Learning for Multi-Coil and Multi-Modality MRI with Adaptive Optimization
by: Fouladvand, Merham, et al.
Published: (2025)
by: Fouladvand, Merham, et al.
Published: (2025)
Robust Assembly Progress Estimation via Deep Metric Learning
by: Miura, Kazuma, et al.
Published: (2026)
by: Miura, Kazuma, et al.
Published: (2026)
CAST: Cross-modal Alignment Similarity Test for Vision Language Models
by: Dagan, Gautier, et al.
Published: (2024)
by: Dagan, Gautier, et al.
Published: (2024)
Unidirectional imaging with partially coherent light
by: Ma, Guangdong, et al.
Published: (2024)
by: Ma, Guangdong, et al.
Published: (2024)
FEDLAD: Federated Evaluation of Deep Leakage Attacks and Defenses
by: Baglin, Isaac, et al.
Published: (2024)
by: Baglin, Isaac, et al.
Published: (2024)
The Devil is in the Details -- From OCR for Old Church Slavonic to Purely Visual Stemma Reconstruction
by: Hoenen, Armin
Published: (2026)
by: Hoenen, Armin
Published: (2026)
Action-based image editing guided by human instructions
by: Trusca, Maria Mihaela, et al.
Published: (2024)
by: Trusca, Maria Mihaela, et al.
Published: (2024)
Learning human-to-robot handovers through 3D scene reconstruction
by: Wu, Yuekun, et al.
Published: (2025)
by: Wu, Yuekun, et al.
Published: (2025)
Symbolic Graph Inference for Compound Scene Understanding
by: Aryan, FNU, et al.
Published: (2024)
by: Aryan, FNU, et al.
Published: (2024)
Enhancing Small Object Encoding in Deep Neural Networks: Introducing Fast&Focused-Net with Volume-wise Dot Product Layer
by: Tofik, Ali, et al.
Published: (2024)
by: Tofik, Ali, et al.
Published: (2024)
Scalable Methods for Brick Kiln Detection and Compliance Monitoring from Satellite Imagery: A Deployment Case Study in India
by: Mondal, Rishabh, et al.
Published: (2024)
by: Mondal, Rishabh, et al.
Published: (2024)
Similar Items
-
S3R-Net: A Single-Stage Approach to Self-Supervised Shadow Removal
by: Kubiak, Nikolina, et al.
Published: (2024) -
RenDetNet: Weakly-supervised Shadow Detection with Shadow Caster Verification
by: Kubiak, Nikolina, et al.
Published: (2024) -
FlowDet: Unifying Object Detection and Generative Transport Flows
by: Baty, Enis, et al.
Published: (2025) -
ReFrame: Rectification Framework for Image Explaining Architectures
by: Adhikary, Debjyoti Das, et al.
Published: (2025) -
SpaGBOL: Spatial-Graph-Based Orientated Localisation
by: Shore, Tavis, et al.
Published: (2024)