:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Bingham, Joseph
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence Computer Vision and Pattern Recognition I.2.4
Online Access:	https://arxiv.org/abs/2602.19562
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FALCON: Few-Shot Adversarial Learning for Cross-Domain Medical Image Segmentation
by: Fayjie, Abdur R., et al.
Published: (2026)

Structures Meet Semantics: Multimodal Fusion via Graph Contrastive Learning
by: Sun, Jiangfeng, et al.
Published: (2025)

Representation Selection via Cross-Model Agreement using Canonical Correlation Analysis
by: Lewis, Dylan B., et al.
Published: (2026)

nuScenes Knowledge Graph -- A comprehensive semantic representation of traffic scenes for trajectory prediction
by: Mlodzian, Leon, et al.
Published: (2023)

CUBIC: Concept Embeddings for Unsupervised Bias Identification using VLMs
by: Méndez, David, et al.
Published: (2025)

KGTN-ens: Few-Shot Image Classification with Knowledge Graph Ensembles
by: Filipiak, Dominik, et al.
Published: (2022)

Perceptual Flow Network for Visually Grounded Reasoning
by: Li, Yangfu, et al.
Published: (2026)

How to train your VAE
by: Rivera, Mariano
Published: (2023)

RSTeller: Scaling Up Visual Language Modeling in Remote Sensing with Rich Linguistic Semantics from Openly Available Data and Large Language Models
by: Ge, Junyao, et al.
Published: (2024)

Vertical Federated Image Segmentation
by: Mandal, Paul K., et al.
Published: (2024)

Horizontal Federated Computer Vision
by: Mandal, Paul K., et al.
Published: (2023)

FedHypeVAE: Federated Learning with Hypernetwork Generated Conditional VAEs for Differentially Private Embedding Sharing
by: Gupta, Sunny, et al.
Published: (2026)

Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments
by: Leiva, Mario, et al.
Published: (2025)

SynCo: Synthetic Hard Negatives for Contrastive Visual Representation Learning
by: Giakoumoglou, Nikolaos, et al.
Published: (2024)

VHAKG: A Multi-modal Knowledge Graph Based on Synchronized Multi-view Videos of Daily Activities
by: Egami, Shusaku, et al.
Published: (2024)

Enhancing Sports Strategy with Video Analytics and Data Mining: Assessing the effectiveness of Multimodal LLMs in tennis video analysis
by: Teo, Charlton
Published: (2025)

Enhancing Cross-Modal Contextual Congruence for Crowdfunding Success using Knowledge-infused Learning
by: Padhi, Trilok, et al.
Published: (2024)

Gated Recursive Fusion: A Stateful Approach to Scalable Multimodal Transformers
by: Shihata, Yusuf
Published: (2025)

Learnings from Scaling Visual Tokenizers for Reconstruction and Generation
by: Hansen-Estruch, Philippe, et al.
Published: (2025)

GIST: Multimodal Knowledge Extraction and Spatial Grounding via Intelligent Semantic Topology
by: Agrawal, Shivendra, et al.
Published: (2026)

CC-SGG: Corner Case Scenario Generation using Learned Scene Graphs
by: Drayson, George, et al.
Published: (2023)

U-SEG: Uncertainty in SEGmentation -- A systematic multi-variable exploration
by: Smith, Michael, et al.
Published: (2026)

Safeguarding Vision-Language Models Against Patched Visual Prompt Injectors
by: Sun, Jiachen, et al.
Published: (2024)

FortisAVQA and MAVEN: a Benchmark Dataset and Debiasing Framework for Robust Multimodal Reasoning
by: Ma, Jie, et al.
Published: (2025)

Perceptual Influence: Improving the Perceptual Loss Design for Low-Dose CT Enhancement
by: Viana, Gabriel A., et al.
Published: (2025)

Edge-Enabled Collaborative Object Detection for Real-Time Multi-Vehicle Perception
by: Richards, Everett, et al.
Published: (2025)

Content Adaptive based Motion Alignment Framework for Learned Video Compression
by: Zhang, Tiange, et al.
Published: (2025)

Hilbert-Geo: Solving Solid Geometric Problems by Neural-Symbolic Reasoning
by: Xu, Ruoran, et al.
Published: (2026)

Deterministic Event-Graph Substrates as World Models for Counterfactual Reasoning
by: Rovai, Fabio
Published: (2026)

Guide-Guard: Off-Target Predicting in CRISPR Applications
by: Bingham, Joseph, et al.
Published: (2026)

PairHuman: A High-Fidelity Photographic Dataset for Customized Dual-Person Generation
by: Pan, Ting, et al.
Published: (2025)

Detection Transformers Under the Knife: A Neuroscience-Inspired Approach to Ablations
by: Hütten, Nils, et al.
Published: (2025)

From Latent to Engine Manifolds: Analyzing ImageBind's Multimodal Embedding Space
by: Hamara, Andrew, et al.
Published: (2024)

GPT4o-Receipt: A Dataset and Human Study for AI-Generated Document Forensics
by: Zhang, Yan, et al.
Published: (2026)

Scaling Large Vision-Language Models for Enhanced Multimodal Comprehension In Biomedical Image Analysis
by: Umeike, Robinson, et al.
Published: (2025)

A Two-stage Transformer Framework for Temporal Localization of Distracted Driver Behaviors
by: Doan, Gia-Bao, et al.
Published: (2026)

Adapting Multimodal Foundation Models for Few-Shot Learning: A Comprehensive Study on Contrastive Captioners
by: Narasinghe, N. K. B. M. P. K. B., et al.
Published: (2025)

StoryMovie: A Dataset for Semantic Alignment of Visual Stories with Movie Scripts and Subtitles
by: Oliveira, Daniel, et al.
Published: (2026)

A Two-Stage, Object-Centric Deep Learning Framework for Robust Exam Cheating Detection
by: Le, Van-Truong, et al.
Published: (2026)

A Hybrid Deep Learning and Model-Checking Framework for Accurate Brain Tumor Detection and Validation
by: Elfatimi, Elhoucine, et al.
Published: (2024)