:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Douze, Matthijs, Guzhva, Alexandr, Deng, Chengqi, Johnson, Jeff, Szilvasy, Gergely, Mazaré, Pierre-Emmanuel, Lomeli, Maria, Hosseini, Lucas, Jégou, Hervé
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Computer Vision and Pattern Recognition Software Engineering
Online Access:	https://arxiv.org/abs/2401.08281
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Vector search with small radiuses
by: Szilvasy, Gergely, et al.
Published: (2024)

Inference-time sparse attention with asymmetric indexing
by: Mazaré, Pierre-Emmanuel, et al.
Published: (2025)

Self-Pruned Key-Value Attention: Learning When to Write by Predicting Future Utility
by: Szilvasy, Gergely, et al.
Published: (2026)

Short window attention enables long-term memorization
by: Cabannes, Loïc, et al.
Published: (2025)

Stochastic activations
by: Lomeli, Maria, et al.
Published: (2025)

evclust: Python library for evidential clustering
by: Soubeiga, Armel, et al.
Published: (2025)

Cross-Breed Pig Identification Using Auricular Vein Pattern Recognition: A Machine Learning Approach for Small-Scale Farming Applications
by: Nsengiyumvaa, Emmanuel, et al.
Published: (2025)

GUing: A Mobile GUI Search Engine using a Vision-Language Model
by: Wei, Jialiang, et al.
Published: (2024)

Can Vision-Language Models Handle Long-Context Code? An Empirical Study on Visual Compression
by: Zhong, Jianping, et al.
Published: (2026)

BLIP-FusePPO: A Vision-Language Deep Reinforcement Learning Framework for Lane Keeping in Autonomous Vehicles
by: Miangoleh, Seyed Ahmad Hosseini, et al.
Published: (2025)

Terrain characterisation for online adaptability of automated sonar processing: Lessons learnt from operationally applying ATR to sidescan sonar in MCM applications
by: Guerneve, Thomas, et al.
Published: (2024)

SWAN -- Enabling Fast and Mobile Histopathology Image Annotation through Swipeable Interfaces
by: Banerjee, Sweta, et al.
Published: (2025)

DD-CAM: Minimal Sufficient Explanations for Vision Models Using Delta Debugging
by: Khadka, Krishna, et al.
Published: (2026)

Foundation Models in Remote Sensing: Evolving from Unimodality to Multimodality
by: Hong, Danfeng, et al.
Published: (2026)

Technical Report for Argoverse2 Scenario Mining Challenges on Iterative Error Correction and Spatially-Aware Prompting
by: Chen, Yifei, et al.
Published: (2025)

CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent
by: Lin, Haojia, et al.
Published: (2025)

How Far Can VLMs Go for Visual Bug Detection? Studying 19,738 Keyframes from 41 Hours of Gameplay Videos
by: Lu, Wentao, et al.
Published: (2026)

Natural Adversaries: Fuzzing Autonomous Vehicles with Realistic Roadside Object Placements
by: Sun, Yang, et al.
Published: (2024)

Earth Embeddings as Products: Taxonomy, Ecosystem, and Standardized Access
by: Fang, Heng, et al.
Published: (2026)

Interpretable Gallbladder Ultrasound Diagnosis: A Lightweight Web-Mobile Software Platform with Real-Time XAI
by: Bhoyan, Fuyad Hasan, et al.
Published: (2025)

What to Test Next: Interpretable Coverage Gap Discovery in Driving VLMs
by: Aich, Abhishek, et al.
Published: (2026)

ROMAN: Reward-Orchestrated Multi-Head Attention Network for Autonomous Driving System Testing
by: Chi, Jianlei, et al.
Published: (2026)

Effort-Optimized, Accuracy-Driven Labelling and Validation of Test Inputs for DL Systems: A Mixed-Integer Linear Programming Approach
by: Amini, Mohammad Hossein, et al.
Published: (2025)

Ear-Keeper: A Cross-Platform AI System for Rapid and Accurate Ear Disease Diagnosis
by: Lu, Feiyan, et al.
Published: (2023)

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents
by: Meng, Fanqing, et al.
Published: (2026)

Benchmarking Image Perturbations for Testing Automated Driving Assistance Systems
by: Lambertenghi, Stefano Carlo, et al.
Published: (2025)

A Highly Efficient Diversity-based Input Selection for DNN Improvement Using VLMs
by: Abbasishahkoo, Amin, et al.
Published: (2026)

Do Existing Testing Tools Really Uncover Gender Bias in Text-to-Image Models?
by: Lyu, Yunbo, et al.
Published: (2025)

Evaluating and Enhancing Segmentation Model Robustness with Metamorphic Testing
by: Mzoughi, Seif, et al.
Published: (2025)

TigAug: Data Augmentation for Testing Traffic Light Detection in Autonomous Driving Systems
by: Lu, You, et al.
Published: (2025)

VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing
by: Chang, Zhiyuan, et al.
Published: (2024)

ARI3D: A Software for Interactive Quantification of Regions in X-Ray CT 3D Images
by: Albrecht, Jan Phillipp, et al.
Published: (2025)

A Plausibility Study of Using Augmented Reality in the Ventriculoperitoneal Shunt Operations
by: Dorji, Tandin, et al.
Published: (2024)

VideoGameBunny: Towards vision assistants for video games
by: Taesiri, Mohammad Reza, et al.
Published: (2024)

Open-source automatic pipeline for efficient conversion of large-scale point clouds to IFC format
by: Zbirovský, Slávek, et al.
Published: (2025)

DOne: Decoupling Structure and Rendering for High-Fidelity Design-to-Code Generation
by: Huang, Xinhao, et al.
Published: (2026)

Investigating Traffic Accident Detection Using Multimodal Large Language Models
by: Skender, Ilhan, et al.
Published: (2025)

A Retrieval-Augmented Generation Approach to Extracting Algorithmic Logic from Neural Networks
by: Khalid, Waleed, et al.
Published: (2025)

MVOS_HSI: A Python Library for Preprocessing Agricultural Crop Hyperspectral Data
by: Aggarwal, Rishik, et al.
Published: (2026)

ITKIT: Feasible CT Image Analysis based on SimpleITK and MMEngine
by: Zhang, Yiqin, et al.
Published: (2026)