Saved in:
| Main Author: | Bingham, Joseph |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.19562 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FALCON: Few-Shot Adversarial Learning for Cross-Domain Medical Image Segmentation
by: Fayjie, Abdur R., et al.
Published: (2026)
by: Fayjie, Abdur R., et al.
Published: (2026)
Structures Meet Semantics: Multimodal Fusion via Graph Contrastive Learning
by: Sun, Jiangfeng, et al.
Published: (2025)
by: Sun, Jiangfeng, et al.
Published: (2025)
Representation Selection via Cross-Model Agreement using Canonical Correlation Analysis
by: Lewis, Dylan B., et al.
Published: (2026)
by: Lewis, Dylan B., et al.
Published: (2026)
nuScenes Knowledge Graph -- A comprehensive semantic representation of traffic scenes for trajectory prediction
by: Mlodzian, Leon, et al.
Published: (2023)
by: Mlodzian, Leon, et al.
Published: (2023)
CUBIC: Concept Embeddings for Unsupervised Bias Identification using VLMs
by: Méndez, David, et al.
Published: (2025)
by: Méndez, David, et al.
Published: (2025)
KGTN-ens: Few-Shot Image Classification with Knowledge Graph Ensembles
by: Filipiak, Dominik, et al.
Published: (2022)
by: Filipiak, Dominik, et al.
Published: (2022)
Perceptual Flow Network for Visually Grounded Reasoning
by: Li, Yangfu, et al.
Published: (2026)
by: Li, Yangfu, et al.
Published: (2026)
How to train your VAE
by: Rivera, Mariano
Published: (2023)
by: Rivera, Mariano
Published: (2023)
RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
by: Ge, Junyao, et al.
Published: (2024)
by: Ge, Junyao, et al.
Published: (2024)
Vertical Federated Image Segmentation
by: Mandal, Paul K., et al.
Published: (2024)
by: Mandal, Paul K., et al.
Published: (2024)
Horizontal Federated Computer Vision
by: Mandal, Paul K., et al.
Published: (2023)
by: Mandal, Paul K., et al.
Published: (2023)
FedHypeVAE: Federated Learning with Hypernetwork Generated Conditional VAEs for Differentially Private Embedding Sharing
by: Gupta, Sunny, et al.
Published: (2026)
by: Gupta, Sunny, et al.
Published: (2026)
Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments
by: Leiva, Mario, et al.
Published: (2025)
by: Leiva, Mario, et al.
Published: (2025)
SynCo: Synthetic Hard Negatives for Contrastive Visual Representation Learning
by: Giakoumoglou, Nikolaos, et al.
Published: (2024)
by: Giakoumoglou, Nikolaos, et al.
Published: (2024)
VHAKG: A Multi-modal Knowledge Graph Based on Synchronized Multi-view Videos of Daily Activities
by: Egami, Shusaku, et al.
Published: (2024)
by: Egami, Shusaku, et al.
Published: (2024)
Enhancing Sports Strategy with Video Analytics and Data Mining: Assessing the effectiveness of Multimodal LLMs in tennis video analysis
by: Teo, Charlton
Published: (2025)
by: Teo, Charlton
Published: (2025)
Enhancing Cross-Modal Contextual Congruence for Crowdfunding Success using Knowledge-infused Learning
by: Padhi, Trilok, et al.
Published: (2024)
by: Padhi, Trilok, et al.
Published: (2024)
Gated Recursive Fusion: A Stateful Approach to Scalable Multimodal Transformers
by: Shihata, Yusuf
Published: (2025)
by: Shihata, Yusuf
Published: (2025)
Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
by: Hansen-Estruch, Philippe, et al.
Published: (2025)
by: Hansen-Estruch, Philippe, et al.
Published: (2025)
GIST: Multimodal Knowledge Extraction and Spatial Grounding via Intelligent Semantic Topology
by: Agrawal, Shivendra, et al.
Published: (2026)
by: Agrawal, Shivendra, et al.
Published: (2026)
CC-SGG: Corner Case Scenario Generation using Learned Scene Graphs
by: Drayson, George, et al.
Published: (2023)
by: Drayson, George, et al.
Published: (2023)
U-SEG: Uncertainty in SEGmentation -- A systematic multi-variable exploration
by: Smith, Michael, et al.
Published: (2026)
by: Smith, Michael, et al.
Published: (2026)
Safeguarding Vision-Language Models Against Patched Visual Prompt Injectors
by: Sun, Jiachen, et al.
Published: (2024)
by: Sun, Jiachen, et al.
Published: (2024)
FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
by: Ma, Jie, et al.
Published: (2025)
by: Ma, Jie, et al.
Published: (2025)
Perceptual Influence: Improving the Perceptual Loss Design for Low-Dose CT Enhancement
by: Viana, Gabriel A., et al.
Published: (2025)
by: Viana, Gabriel A., et al.
Published: (2025)
Edge-Enabled Collaborative Object Detection for Real-Time Multi-Vehicle Perception
by: Richards, Everett, et al.
Published: (2025)
by: Richards, Everett, et al.
Published: (2025)
Content Adaptive based Motion Alignment Framework for Learned Video Compression
by: Zhang, Tiange, et al.
Published: (2025)
by: Zhang, Tiange, et al.
Published: (2025)
Hilbert-Geo: Solving Solid Geometric Problems by Neural-Symbolic Reasoning
by: Xu, Ruoran, et al.
Published: (2026)
by: Xu, Ruoran, et al.
Published: (2026)
Deterministic Event-Graph Substrates as World Models for Counterfactual Reasoning
by: Rovai, Fabio
Published: (2026)
by: Rovai, Fabio
Published: (2026)
Guide-Guard: Off-Target Predicting in CRISPR Applications
by: Bingham, Joseph, et al.
Published: (2026)
by: Bingham, Joseph, et al.
Published: (2026)
PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation
by: Pan, Ting, et al.
Published: (2025)
by: Pan, Ting, et al.
Published: (2025)
Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations
by: Hütten, Nils, et al.
Published: (2025)
by: Hütten, Nils, et al.
Published: (2025)
From Latent to Engine Manifolds: Analyzing ImageBind's Multimodal Embedding Space
by: Hamara, Andrew, et al.
Published: (2024)
by: Hamara, Andrew, et al.
Published: (2024)
GPT4o-Receipt: A Dataset and Human Study for AI-Generated Document Forensics
by: Zhang, Yan, et al.
Published: (2026)
by: Zhang, Yan, et al.
Published: (2026)
Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis
by: Umeike, Robinson, et al.
Published: (2025)
by: Umeike, Robinson, et al.
Published: (2025)
A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors
by: Doan, Gia-Bao, et al.
Published: (2026)
by: Doan, Gia-Bao, et al.
Published: (2026)
Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners
by: Narasinghe, N. K. B. M. P. K. B., et al.
Published: (2025)
by: Narasinghe, N. K. B. M. P. K. B., et al.
Published: (2025)
StoryMovie: A Dataset for Semantic Alignment of Visual Stories with Movie Scripts and Subtitles
by: Oliveira, Daniel, et al.
Published: (2026)
by: Oliveira, Daniel, et al.
Published: (2026)
A Two-Stage, Object-Centric Deep Learning Framework for Robust Exam Cheating Detection
by: Le, Van-Truong, et al.
Published: (2026)
by: Le, Van-Truong, et al.
Published: (2026)
A Hybrid Deep Learning and Model-Checking Framework for Accurate Brain Tumor Detection and Validation
by: Elfatimi, Elhoucine, et al.
Published: (2024)
by: Elfatimi, Elhoucine, et al.
Published: (2024)
Similar Items
-
FALCON: Few-Shot Adversarial Learning for Cross-Domain Medical Image Segmentation
by: Fayjie, Abdur R., et al.
Published: (2026) -
Structures Meet Semantics: Multimodal Fusion via Graph Contrastive Learning
by: Sun, Jiangfeng, et al.
Published: (2025) -
Representation Selection via Cross-Model Agreement using Canonical Correlation Analysis
by: Lewis, Dylan B., et al.
Published: (2026) -
nuScenes Knowledge Graph -- A comprehensive semantic representation of traffic scenes for trajectory prediction
by: Mlodzian, Leon, et al.
Published: (2023) -
CUBIC: Concept Embeddings for Unsupervised Bias Identification using VLMs
by: Méndez, David, et al.
Published: (2025)